Introducing Zatom-1, the first end-to-end, fully open-source foundation model for 3D chemistry! This was a great collaborative effort with many brilliant scientists. I'm grateful to have played a small part.
Paper: arxiv.org/abs/2602.22251
Code: github.com/Zatom-AI/zatom
ALT Architecture of the Zatom-1 foundation model for 3D molecules and materials. A unified Transformer trunk processes (noisy) atomic, structural, and conditional inputs. The model features two training stages: (1) multimodal flow pretraining using the final trunk layer L’s representations to create new molecules or materials; and (2) multi-task finetuning of additional downstream task backbones and heads that use a specified trunk layer K’s representations to predict properties, energies, and forces.
Interested in learning more about how flow matching has begun to advance bioinformatics and computational biology? And how it has already started making strides towards the development of an AI-based virtual cell?
Paper: authorea.com/users/637193/ar…
Code: github.com/amorehead/awesome…
🔬Interested in training AlphaFold3 faster, at scale, and beyond NVIDIA GPU? Now you can.
AlphaFold3 is a major leap in biomolecular modeling, but behind the scenes, it introduces severe system bottlenecks:
🧠 2D EvoAttention spikes memory usage
📉 Retrieval-augmented training pipeline causes long GPU idle time
⛔ Frequent but memory-intensive ops slow everything down
Today, I'm excited to announce MegaFold, a fully open-source system to make AlphaFold3 training fast, scalable, and cross-platform on both NVIDIA and AMD GPUs.
MegaFold delivers:
⚡ Up to 1.73x / 1.62x faster training on NVIDIA H100 / AMD MI250
🧬 Up to 1.35× longer sequences compared to PyTorch baseline
Key features:
🚀 Memory-Efficient EvoAttention via portable Triton kernels
💡 Ahead-of-Time Caching to eliminate GPU idle time in retrieval pipelines
🔗 DeepFusion for reducing overhead of small but frequent memory-intensive AF3 ops
📘 Project page: supercomputing-system-ai-lab…
📄 Paper: arxiv.org/pdf/2506.20686
💻 Code: github.com/Supercomputing-Sy…
🤝 MegaFold is developed in collaboration between UIUC SSAIL Lab and researchers from University of Missouri and Lawrence Berkeley National Laboratory.
Kudos to the brilliant team: Hoa La, Ahan Gupta, Alex Morehead, Jianlin Cheng
#AlphaFold3#AI#ProteinFolding#Bioinformatics#AMD#Triton#CrossPlatform#OpenSource
v0.6.0 of PoseBench is now available, featuring (1) results for AlphaFold 3, the new PLIF-WM metric, and (3) the new DockGen-E dataset of challenging docking targets. See the GitHub release below for more details.
Paper: arxiv.org/abs/2405.14108
Code: github.com/BioinfoMachineLea…
Excited to release FlowDock, an all-atom flow matching model for generative protein-ligand docking and affinity prediction (ranked as a top method in CASP16)!
Paper: arxiv.org/abs/2412.10966
Code: github.com/BioinfoMachineLea…
PoseBench v0.5.0 is now released, featuring (1) docking results with AlphaFold 3's predicted protein structures, (2) Chai-1's benchmarking results, and (3) support for running exhaustive HPC benchmarking sweeps. 🧪
Paper: arxiv.org/abs/2405.14108
Code: github.com/BioinfoMachineLea…
At @icmlconf this week where I'm presenting PoseBench at the AI4Science workshop as a spotlight. I'll also give an oral presentation on RNA-FrameFlow at the SPIGM workshop (AI4Science spotlight as well!) on behalf of many amazing collaborators including @rishabh16_ and @chaitjo.
Introducing PoseBench, the first deep learning (DL) benchmark for practical protein-ligand docking, which provides actionable insights for the development of future docking methods. 🧵
Paper: arxiv.org/abs/2405.14108
Code: github.com/BioinfoMachineLea…
Unfortunately can't join in-person @icmlconf 🇦🇹 but our awesome co-author @MoreheadAlex will be there!!!
Check out our Oral presentation @ SPIGM Workshop on 26 July and Spotlight poster @AI_for_Science Workshop on 27 July ✨🥳
See our poster schedule below 👀👇🏻
🧬🤖 Introducing RNA-FrameFlow –– an unconditional generative model for 3D RNA backbone design!
📑: arxiv.org/abs/2406.13839
🧰: github.com/rish-16/rna-backb…
Our method generates ≥ 40% self-consistent *all-atom* RNA backbones that are globally and locally realistic 💪🏻
1/9
Introducing PoseBench, the first deep learning (DL) benchmark for practical protein-ligand docking, which provides actionable insights for the development of future docking methods. 🧵
Paper: arxiv.org/abs/2405.14108
Code: github.com/BioinfoMachineLea…
LoG Conference 2024 is back !!!👉 We are looking for more reviewers! We have a special emphasis on review quality via monetary rewards, a more focused conference topic, and low reviewer load (max 3 papers). But for this we need your help! Sign up here: forms.gle/Nuff4ndVZDFTisb38!