Scalable Agentic Reasoning for Designing Biologics Targeting Intrinsically Disordered Proteins
1. StructBioReasoner is a scalable multi-agent system aimed at designing biologics against intrinsically disordered proteins (IDPs), treating “what to design next” as an autonomous, HPC-orchestrated reasoning problem rather than a single-model generation task.
2. The key algorithmic idea is a tournament-based reasoning loop: specialized agents propose competing hypotheses (interfaces/hotspots, scaffolds, design methods), evaluate them with structure prediction simulation energetics, then promote winners and cull/annotate failures for the next round.
3. The system is explicitly built for IDP complexity: instead of assuming one stable target structure, it reasons over conformational heterogeneity and uses interactome-scale simulation to identify which interfaces are most plausible and therapeutically relevant to disrupt.
4. A major practical contribution is end-to-end tool orchestration on supercomputers via a federated agentic middleware (Academy), integrating workflow execution (Parsl) and remote/federated execution (Globus Compute) to scale design campaigns across heterogeneous HPC resources.
5. Literature grounding is treated as first-class infrastructure: HiPerRAG builds target-specific corpora (e.g., 1520 full-text papers for NMNAT-2), parses PDFs at scale (AdaParse), indexes embeddings (FAISS), and converts retrieved evidence into structured assertions a shared knowledge graph to reduce hallucinations.
6. Toolchain integration spans: structure prediction (Chai-1, Boltz-2x, plus PDB lookup), molecular dynamics (OpenMM; explicit/implicit solvent decisions), analysis (RMSD/RMSF/SASA/RoG; interaction energies), and binding estimation (MM-PBSA as a throughput/accuracy compromise for large campaigns).
7. Der f 21 benchmark (structured target): from 842 designed binders, 787 passed QC/verification; after MD MM-PBSA, 50.98% beat a literature human-designed reference binder (BindCraft binder 10) by a defined free-energy threshold, while most sequences remained novel (<30% identity vs prior BindCraft designs).
8. Der f 21 mechanistic consistency: simulations highlighted frequent contacts near known IgE-relevant epitope residues; designs often formed salt-bridge interactions (e.g., targeting E7), aligning computationally inferred interfaces with experimentally implicated allergen epitopes.
9. NMNAT-2 benchmark (IDR-rich, interactome-driven): the system simulated 18 interactome partners identified via RAG, prioritized interfaces dominated by IDR interactions, and recovered the well-studied NMNAT2:p53 interface as one binding mode among large-scale designed candidates.
10. NMNAT-2 scale and modes: 97,066 binders passed QC/structural validation (out of 266,606 generated), each run through explicit-solvent MD; analyses revealed three major binding modes, two targeting the IDR, and 84.5% of successful binders contacted at least one IDR residue—mirroring native interactome contact patterns.
11. Optimization is incorporated as an agent: preference pairs are constructed from multi-metric scoring (free energy, stability via RMSD/RMSF, developability features), enabling direct preference optimization (DPO) to bias future generation toward better candidates without full retraining.
12. Scaling results on Aurora (exascale-class system): MD agent showed robust weak scaling to 256 nodes (~26.6 µs/hour aggregate, ~80% efficiency vs 64-node baseline); binder design scaled to 512 nodes (efficiency drops beyond 256 nodes due to filesystem I/O); MM-PBSA became I/O-bound beyond 64 nodes, motivating staged filtering before expensive energetics.
💻Code:
github.com/IDeA-ANL-ORNL/Str…
📜Paper:
arxiv.org/abs/2512.15930
#ComputationalBiology #ProteinDesign #IDP #IntrinsicallyDisorderedProteins #AgenticAI #MultiAgentSystems #RAG #MolecularDynamics #HPC #Exascale #Biologics #DrugDiscovery