AI Drug Discovery
Software that finds new medicines faster
Thesis
AI drug discovery is two industries pretending to be one. The first is platform companies — Recursion, Insitro, Xaira, Isomorphic — building large-scale data and model infrastructure to industrialize early discovery. The second is asset companies dressing up traditional pipelines with AI marketing. Knowing which is which is the entire game.
What's crowded: structure prediction wrappers, generic generative chemistry plays, "AlphaFold for X" pitches. What's underexplored: data generation moats (the wet-lab side of the loop), translational AI between preclinical and clinical, and trial design optimization. The next wave of winners will be vertically integrated — owning the data they train on.
Watch: Xaira's first clinical readouts, Isomorphic's pipeline disclosures, and any AI-discovered molecule reaching Phase III. The thesis lives or dies on Phase II data over the next 18 months.
Last updated 2026-04-01
Featured Models
Google DeepMind
Structure Prediction, Complex Prediction, Ligand Docking
MIT
Structure Prediction, Complex Prediction
MIT
Structure Prediction, Molecular Docking, Complex Prediction
Chai Discovery
Structure Prediction, Complex Prediction
Companies
Xaira Therapeutics
Building the operating system for AI-native drug discovery.
The most ambitious bet in the space — $1B to build a vertically integrated AI drug company from scratch. The team is elite (ex-DeepMind, ex-Genentech leadership), but the thesis requires everything to work: data generation, model training, and clinical translation. No one has pulled that off yet.
Isomorphic Labs
Google DeepMind's drug discovery spinoff.
Has the best foundation models in the world and the worst incentive to share them. Isomorphic's moat is AlphaFold, but their challenge is proving that structure prediction translates to clinical outcomes. Eli Lilly and Novartis partnerships are the first real test.
Recursion
Industrializing drug discovery with biology-first AI.
The only public AI drug discovery company with a genuine data moat — their phenomic dataset is unmatched. But being public means quarterly pressure on a 10-year thesis. The Roche-Genentech partnership validates the platform; pipeline clinical readouts will validate the company.
Insitro
Machine learning for drug discovery and development.
Daphne Koller's bet that the bottleneck isn't compute but data quality. Insitro generates its own cellular data at scale, which is the right instinct — but the gap between cellular models and human biology is where most drug discovery AI fails.
Generate Biomedicines
Generative AI for protein therapeutics.
The leading generative protein design company, now backed by Novartis. Their Chroma model is impressive, but the real question is whether computationally designed proteins can survive the clinic. First IND expected 2026.
Cradle
AI-powered protein engineering platform.
Smart positioning — instead of discovering new drugs, Cradle helps biotech teams engineer better versions of proteins they already have. Lower risk, faster feedback loops, clearer path to revenue. The un-sexy approach that might actually work.
Iambic Therapeutics
AI-first small molecule drug discovery.
Focused on the unsexy-but-critical middle of drug discovery: lead optimization and ADMET. Less flashy than de novo design, but this is where 80% of programs actually fail. Strong team, realistic scope.
Terray Therapeutics
Chemistry at trillion-molecule scale.
Terray's bet is that better chemistry data beats better AI. Their microfluidic platform generates binding data at a scale no one else can match — and in drug discovery, data wins. The anti-thesis to pure computational approaches, and it might be right.
Funding
Cradle
Series B · 2025-02-10
Led by Index Ventures, Benchmark
The sleeper pick. Cradle is doing for protein engineering what GitHub Copilot did for code — making the bench scientist 10x more productive. The SaaS model means they capture value across the entire protein engineering market without owning any single therapeutic. Smart capital-light strategy in a capital-heavy industry.
Generate bio
Series B · 2025-01-22
Led by Lux Capital, Flagship Pioneering
Generative biology is the most intellectually honest framing in this space — they're not pretending to have solved drug discovery, they're building a generative engine for protein therapeutics. The Novartis partnership de-risks the platform, but the real test is whether diffusion models for proteins can outperform directed evolution at scale.
Insitro
Series C · 2024-11-05
Led by a16z, Canada Pension Plan Investment Board
Daphne Koller has quietly built one of the most defensible data assets in the industry. The iPSC-derived cellular models are the moat — every other AI biotech is training on the same public datasets. This is the one to watch if you believe data quality beats model architecture.
Isomorphic
Series B · 2024-09-12
Led by Alphabet, Thrive Capital
Alphabet is essentially funding its own subsidiary, but the pharma partnerships with Lilly and Novartis give this credibility that most DeepMind spinouts never achieve. The question is whether AlphaFold's structural insights translate to clinical candidates — structure is necessary but nowhere near sufficient.
Recursion
Secondary Offering · 2024-07-18
Led by Baillie Gifford, GV
The public markets are finally giving Recursion credit for the data flywheel, but the stock is still priced on vibes, not pipelines. They need a Phase II readout that proves phenomics-driven discovery actually works in patients, not just in pretty embeddings.
Xaira
Series A · 2024-04-23
Led by ARCH Venture Partners, Foresite Capital, a16z
A billion dollars for a company with no clinical assets — this is either the most important bet in biotech or the most expensive science experiment in history. The team justifies the valuation; the timeline doesn't.
Signals
Zichen Wang, Joshua Meier, Shuai Yuan et al. · 2025-03-05
Phase I/II Study of TRY-3388 in Patients with KRAS G12D-Mutant Non-Small Cell Lung Cancer
Terray Therapeutics · Phase I/II · Recruiting
A Phase I Study of XAI-0291 in Patients with Moderate-to-Severe Inflammatory Bowel Disease
Xaira Therapeutics · Phase I · Recruiting
From billion-dollar launch to Phase I in under a year — that is either terrifyingly fast or exactly what happens when you hire half of the structural biology talent in the industry. IBD is a crowded space, but they claim their AI-designed biologic hits a conformational epitope that existing therapies miss. Bold claim. Clock starts now.
Jacob Berlin, Michael Kossler, Lara Malins et al. · 2025-01-08
Phase II Study of ISO-LLY-4481 for Treatment of Moderate-to-Severe Atopic Dermatitis
Isomorphic Labs / Eli Lilly · Phase II · Active, not recruiting
The first jointly developed molecule from the Isomorphic-Lilly partnership entering Phase II. Lilly would not have pushed this into mid-stage trials unless the preclinical package was genuinely differentiated. Atopic derm is a competitive market, but if AlphaFold-derived compounds show superior selectivity profiles, it rewrites the playbook for structure-based drug design.
Berton Earnshaw, Imran Hasan, Mason Victors et al. · 2024-11-02
The scale of this dataset is the moat — nobody else has 2 billion annotated cell images. But the benchmark tasks are all retrospective. The real question is whether these embeddings can prospectively identify targets that traditional screening misses. The REC-4881 trial is that test.
Daphne Koller, Luke O'Connor, Theofanis Karaletsos et al. · 2024-09-27
This is the most convincing data asset paper in AI bio. Period. The scale of the iPSC library, the depth of the multi-omic characterization, and the causal inference framework are all best-in-class. If you want to understand why Insitro raised $400M, read Table 3.
Phase I Dose-Escalation Study of IAM-1412, a CDK2 Inhibitor, in Patients with Advanced Solid Tumors
Iambic Therapeutics · Phase I · Recruiting
Josh Abramson, Jonas Adler, Jack Dunger et al. · 2024-08-14
This is the paper Isomorphic will use to justify its next fundraise. The hit rates are genuinely impressive, but the authors quietly bury the fact that 9 of the 12 targets are well-characterized pockets with abundant co-crystal structures. Try this on a novel target with no structural data and the magic fades fast.
John Ingraham, Max Barber, Gevorg Grigoryan et al. · 2024-06-19
Generate Bio finally published the details behind their platform, and the results are strong — designed binders with sub-nanomolar affinity in 60% of cases, compared to ~15% for physics-based approaches. But the paper conveniently avoids discussing developability, immunogenicity, and manufacturability. Binding is the easy part.
A Phase I Study Evaluating ISM8207 in Subjects with MASH-Related Fibrosis
Insitro · Phase I · Active, not recruiting
MASH is a graveyard of failed clinical programs, but Insitro claims their iPSC-derived hepatocyte models identified a novel target that prior screens missed entirely. If they can show fibrosis regression at Phase I doses, this will be the most important proof point for ML-driven target discovery to date.
A Phase I/II Study of REC-4881 in Patients with Familial Adenomatous Polyposis
Recursion Pharmaceuticals · Phase I/II · Recruiting
The first real clinical test of whether phenomics-driven target discovery produces better candidates than traditional approaches. FAP is a shrewd indication choice — small patient population, clear biomarkers, and enormous unmet need. If this works, it validates the entire Recursion thesis.