Computational Drug Metabolism · Boston, MA

Drug Metabolism Prediction, Grounded in Quantum Mechanics

Reference Biosciences builds physics-based tools that predict how drugs are metabolized — with the accuracy of quantum chemistry and the speed of machine learning. Know your metabolic liabilities before you synthesize.

CYP3A4 · METABOLIC SITE PREDICTION N 12.8 C3-H 14.2 C5-H 18.5 C7-H 24.1 C1-H 22.7 C8-H CH₃ F DRUG SUBSTRATE H-abstraction N N N N Fe IV O ²⁻ S Cys CYP3A4 COMPOUND I ACTIVATION BARRIER ΔG‡ (kcal/mol) 10 (fast oxidation) 25+ (stable) Fe Iron(IV) O Ferryl oxo N Pyrrole N Coordinate bond Fe=O double bond

Metabolic Surprises Kill Drug Programs

Every year, drug candidates fail because of unexpected metabolism. Soft spots missed in early screening become toxic metabolites in the clinic. The current tools — rule-based predictions and empirical models — top out at ~75–80% accuracy and offer no insight into why a prediction was made or how much to trust it.

The result: expensive experimental cycles, late-stage surprises, and clinical holds that cost $50K–$500K per day.

We took a different approach. We started from the physics.

75–80%
Accuracy ceiling of current prediction tools
$50K–$500K
Daily cost of a clinical hold from missed metabolites
60%
Compounds that fail metabolic screening after synthesis

From First Principles to
30-Second Predictions

Reference Biosciences built the world's largest dataset of quantum-mechanically computed activation barriers for drug metabolism — over 32,000 barrier calculations across cytochrome P450, aldehyde oxidase, and FMO enzymes. We use this dataset to train delta-learning models that correct fast semi-empirical estimates to DFT-level accuracy.

Physics-Based

Every prediction traces back to quantum mechanical transition states — not pattern matching on historical data. When chemistry changes, our predictions change with it.

Uncertainty-Aware

Every prediction comes with a calibrated confidence interval and a transparency dashboard. You know when to trust the model and when to run the experiment.

Fast Enough to Matter

Screen 5,000 compounds across multiple CYP enzymes in 24 hours. Get per-molecule predictions in under 30 seconds. Fast enough to fit inside your design-make-test cycle.

MetaboQM: Predict. Screen. Validate.

One platform, three tiers — from quick predictions to regulatory-grade analysis.

MetaboQM Predict

Predict

Per-molecule metabolism prediction in seconds

Paste a SMILES string. Select your enzymes. In 30 seconds, get a full regioselectivity map, ranked metabolite predictions, and a transparency dashboard showing transition state geometries, spin state analysis, and uncertainty decomposition.

  • Regioselectivity heatmap for every hydrogen-bearing site
  • Predicted activation barriers with confidence intervals
  • Interactive 3D transition state visualization
  • Covers CYP3A4, CYP2D6, CYP2C9, CYP2C19, CYP1A2, AO, FMO
Learn more →
MetaboQM Screen

Screen

Library-scale metabolic liability screening

Upload a compound library. Get back a ranked report: metabolic liability scores, site-specific soft spot analysis, predicted metabolites, and a prioritized list of compounds to validate experimentally.

  • Metabolic liability score (0–100) for every compound
  • Actionable soft-spot identification for medicinal chemistry
  • Optimized experimental validation list
  • Delivered as interactive dashboard + Excel + PDF
Request a pilot →
MetaboQM Validate

Validate

Regulatory-grade metabolism analysis

Comprehensive QM-level characterization of all metabolic sites for a single drug candidate. Full DFT and CASSCF transition states for the highest-risk sites, with uncertainty quantification suitable for IND/NDA filings.

  • Full quantum chemistry on the top 10 metabolic sites
  • Multi-reference treatment for complex electronic structure
  • Regulatory-formatted report for FDA submission
  • 1–2 week turnaround
Contact us →

No Black Boxes

We believe predictions without transparency are just guesses. Every MetaboQM result traces back to real quantum mechanical calculations — and we show our work.

32,000+

QM-Computed Barriers

The largest dataset of its kind — activation barriers computed across three enzyme classes using validated DFT methods with DLPNO-CCSD(T) anchoring.

±

Calibrated Uncertainty

Every prediction includes a confidence interval with uncertainty decomposition. You always know when to trust the model and when to run the experiment.

Open

Methodology

We validate against established benchmarks including the Sheridan drug-like set. Publications and benchmark results will be shared as they become available.

Under the Hood

For the computational chemist who wants to know.

01

Structure Parsing

SMILES → 3D conformer generation → identification of all C–H sites and heteroatom lone pairs.

02

Baseline Features

GFN2-xTB single points at each site — barrier estimates, Mulliken charges, bond orders, Wiberg indices, HOMO–LUMO gaps.

03

Molecular Descriptors

ECFP4 fingerprints, pharmacophore features, electronic descriptors from xTB.

04

Delta-Learning Prediction

A gradient-boosted ensemble predicts the correction from xTB to DFT-quality barriers, trained on 32,000+ data points with active learning.

05

Automatic Triage

A classifier flags sites with likely multi-reference character (DFT spread > 3.0 kcal/mol) and applies CASSCF corrections where needed.

06

Visualization & Output

Template-based transition state rendering and metabolite prediction using established P450 reaction rules.

Built for How You Actually Work

Lead Optimization

"We have 500 compounds in our SAR campaign. Which ones will have metabolic problems?"

MetaboQM Screen ranks your library and identifies exact soft spots so your medicinal chemists can fix them before synthesis.

Hit-to-Lead

"This scaffold looks promising but it gets cleared too fast."

MetaboQM Predict shows which sites are vulnerable and how blocking them — fluorine, deuterium, steric shielding — would change the metabolism profile.

Regulatory Filing

"FDA is asking us to characterize all metabolites for our Phase II candidate."

MetaboQM Validate delivers a comprehensive QM-based metabolism report formatted for regulatory submission.

About Reference Biosciences

Reference Biosciences is a Boston-based computational chemistry company focused on one goal: making drug metabolism predictable.

We started with a simple observation — the tools available to DMPK teams rely on empirical pattern matching and offer no insight into prediction confidence. We believed that starting from quantum mechanical first principles and building rigorous uncertainty quantification into every prediction would produce a fundamentally better tool.

So we built one. Over 12 months, we computed more than 32,000 quantum mechanical activation barriers across three major enzyme classes, developed an automated triage system for multi-reference electronic structure, and trained delta-learning models that deliver DFT-quality accuracy in seconds.

The result is MetaboQM — a platform built by scientists, for scientists, designed to fit inside real drug discovery workflows.

Get in Touch

Boston, MA — United States
ReferenceBiosciences.com

Ready to See What MetaboQM Can Do?

We offer risk-free pilots: send us 50 compounds where you know the experimental metabolites. We predict blind. If we beat 85% accuracy, let's talk about a license. If not, you owe us nothing.