Grand Challenges in Computational Small Molecule Drug Discovery
This work, a massive undertaking two years in the making, surveys scientific and technical problems where better prediction would materially improve drug discovery outcomes.
Benchmarks of methods or models are certainly useful, but we've still not agreed which problem spaces AI can be applied to actually mean "better" drug discovery.
Chemistry: synthesis planning, process chemistry, covalent inhibitor design, chemical stability/degradation
Structure: crystal packing/polymorphism, protein structure, protein dynamics, protein–ligand pose prediction, cryptic pocket discovery
Energy: binding affinity, selectivity, kinetics, allostery/agonism
Pharmacology: pKa, solubility/aggregation/permeability, plasma protein binding and volume of distribution, clearance, oral bioavailability, metabolism, toxicity, dose prediction, PK/PD
The authors propose that the AI-led transformation will come from solving specific, measurable problems as opposed to fully end-to-end black box solutions. Even in the world where the latter comes true, these challenges are highly valuable evaluations for the efficacy of future protocols.
For each challenge the authors outline:
• the underlying physical problem
• why it matters
• the current state of the field
• inputs, outputs, and data types
• metrics that would define meaningful progress
Congratulations on the preprint Woody Sherman, Connor W. Coley, and co-authors.
-----
@tamarindbio is a collection of 250 molecular design tools such as AlphaFold and most of current best solutions to the grand challenges discussed in the paper, accessible via web interface and API.