Signal Over Noise: How AI Finds the Drugs Hiding in Plain Sight
A preview of the Solix EAI Pharma white paper — “Signal Over Noise: A Framework for AI-Driven Drug Repurposing and Target Identification in Pharma R&D.”
In January 2020, as SARS-CoV-2 spread and no specific treatment existed, a research team put a single question to an AI-enhanced biomedical knowledge graph: among the drugs already approved and sitting on pharmacy shelves, was there one that could block the mechanism the virus used to enter human cells?
The system surfaced an unexpected name — baricitinib, a rheumatoid arthritis drug.
The logic was not obvious, and that is the entire point. Baricitinib inhibits an enzyme called AAK1, a regulator of the cellular process SARS-CoV-2 was using to enter lung cells. AAK1’s role in viral entry had been documented across papers on dengue, Ebola, HCV, and HIV — literature that is almost never read alongside rheumatology research. The connection was not missing from science. It was scattered across domains no single researcher reads in combination.
By November 2020, baricitinib had an FDA Emergency Use Authorization.
The story is usually told as a COVID story. It is really a target identification story — and it carries a lesson that applies to almost every R&D pipeline in the industry today.
The connection was never the problem. The noise was.
It is tempting to read the baricitinib case as serendipity, or as a uniquely clever algorithm. It is neither. The biology had not changed. AAK1’s role was knowable from existing science. What changed was the ability to connect it — to traverse compound mechanism, viral biology, and pharmacokinetic data simultaneously, and surface one defensible hypothesis faster than manual literature review ever could.
Consider what the system actually did. It found 378 AAK1 inhibitors in the knowledge graph. Of those, 47 were FDA or EMA approved. Most were eliminated on pharmacokinetic grounds — their plasma concentrations at safe doses were simply too low to inhibit AAK1 meaningfully. Baricitinib stood out because its standard dose achieved sufficient concentration, and its years of rheumatoid arthritis use provided a known safety baseline, and its JAK1/2 anti-inflammatory action addressed the cytokine storm driving COVID mortality. A dual-mechanism candidate, surfaced and filtered from a field of hundreds.
This is the central problem of modern drug discovery, and it is not a shortage of data — it is the opposite. Genomic sequencing has produced enormous target landscapes. High-throughput screening has generated billions of compound-activity data points. The scientific knowledge base has never been larger. Yet the number of drugs approved per billion dollars of R&D spend has roughly halved every nine years since the 1950s, a paradox known as Eroom’s Law.
The bottleneck has quietly shifted. It is no longer generating knowledge. It is finding the signal inside it.
Why this matters now
The economics make the case unavoidable. Bringing a new molecule to market costs around $2.6 billion, and fewer than 11% of Phase I candidates ever reach approval — nearly nine in ten fail.
Repurposing changes the math. The average repurposing programme costs roughly $300 million — about a tenth of de novo discovery — because the compound already carries safety and pharmacokinetic data. It is no accident that an estimated 35% of transformative FDA-approved drugs emerged through repurposing pathways.
The opportunity is not theoretical. It is most likely sitting inside your own pipeline right now — in a shelved asset, a Phase 2 disappointment, or a compound whose full mechanism was never fully explored. The question is whether your infrastructure is built to find it.
From serendipity to system
Inspiration does not scale. A single lucky connection, however valuable, is not a strategy. The real prize is turning that kind of discovery into a repeatable process — one that can interrogate an entire asset base, again and again, and surface the connections worth pursuing.
That requires moving past the idea of AI as a black box. In the baricitinib case, the reasoning was fully traceable: AAK1 inhibition blocks viral entry, combined with JAK1/2 anti-inflammatory action, produces a dual therapeutic rationale. The system made the inference chain visible, not just the output. This is not a compliance nicety. No R&D team makes a $50 million clinical investment — and no regulator accepts a submission — on a candidate no one can explain.
Signal over noise, made systematic and made explainable.
What the full white paper covers
This preview makes the case for why. The white paper lays out the how — a vendor-neutral framework R&D teams can apply to their own pipelines, regardless of platform:
- The four-phase implementation framework — Data Integration & Semantic Indexing → Hypothesis Generation → Multi-Signal Validation → Clinical Translation — with the discrete prerequisites and common failure modes at each gate.
- The multi-signal data model — why no single data layer is ever sufficient, and how compound history, disease biology, scientific literature, and clinical evidence converge to produce a credible signal.
- The baricitinib case, fully deconstructed — the 378-to-1 filtering logic, the dual-mechanism rationale, and how a multi-signal approach compresses hypothesis generation from months into hours.
- The strategic role of your proprietary data — why internal assay results, ELN observations, and shelved clinical study reports, indexed alongside public databases, create an intelligence advantage competitors cannot replicate.
- Explainability as a scientific requirement — why a defensible, evidence-backed candidate matters more than a confident one.
Read the full white paper: Signal Over Noise — A Framework for AI-Driven Drug Repurposing and Target Identification in Pharma R&D.
Download the white paper | Book a consultation call
Our scientific and technical team can walk you through how the four-phase framework applies to your specific pipeline. | eai@solix.com · https://pharma.solix.com/
The oldest compound in your pipeline may have a mechanism worth re-examining. The clinical failure from three years ago may have been asking the wrong biological question.
Meet the Solix EAI Pharma team at BIO International 2026, Booth #5252 — June 22–25, San Diego. We’ll be running this live.
