Computational Structural Biologist |

Joined April 2017
99 Photos and videos
Jun 13
Deep learning based design of buried hydrogen bond networks with HBDesigner ift.tt/HqefsZ6 #biorxiv_bioE
3
18
1,553
Jun 10
ipSAE @RolandDunbrack can distinguish correct heterocomplex pairs when multiple functionally homologous proteins are present within a BGC.
高効率なタンパク質間相互作用予測による生合成遺伝子クラスターのネットワーク解析 AlphaFold3の多重配列アラインメントをMMSeqs2に置き換えることで、約50万組のタンパク質ペアから、機能未知タンパク質も含む約15000組の信頼性の高いヘテロマー相互作用を予測した #BNTNJC biorxiv.org/content/10.1101/…
6
36
6,285
Jun 9
From Caleb, ipsae_max 👉🏻correlation with kD. biorxiv.org/content/10.64898…
ipTM and ipSAE don't predict binding affinity. A-alpha Bio measured 7M interactions: almost no correlation. Dug into this with Michael Holden in Ep 1 of Protein Engineering in Practice (by @ranomics): youtu.be/cVmGeFGsVA0
1
3
179
sk retweeted
Antibody LMs learn what looks antibody-like, but not how selection turns naive germline antibodies into strong binders. @aakarshv1 and I are excited to share CoSiNE, a model that learns this germline-to-mature process for variant effect prediction and antibody design. (1/8)
8
50
221
41,495
sk retweeted
At @Biohub, our goal is to build models that accelerate scientific discovery and progress toward the cure to disease. We’re releasing all of this under MIT license allowing commercial and non-commercial use. Read more here: biohub.ai/esm/protein/
2
32
173
29,976
sk retweeted
Today we're announcing ESMFold2, an open scientific engine to power prediction, design, and discovery across protein biology. The new model delivers state of the art performance on protein interactions, especially antibodies, a critical modality for therapeutics. We have designed and validated miniprotein binders and single chain antibodies across five therapeutic targets that are important in cancer and immunology. We are seeing very high success rates, and affinities at levels consistent with therapeutic activity. We’re also releasing an atlas of 6.8 billion proteins, and 1.1 billion predicted structures. ESMFold2 is built on a state of the art language model that has been trained on billions of protein sequences. A world model of protein biology emerges through language modeling. We’ve used the techniques of mechanistic interpretability developed to understand large language models to understand the concepts ESM uses to represent proteins. The model’s representation space has a compositional organization of features across scales, levels of complexity, and abstraction, that reflects and mirrors the understanding of protein biology developed through a century of empirical science. This understanding emerges without prior knowledge, just from language modeling of protein sequences. Language models are becoming a powerful substrate to understand and program biology. The design of protein interactions is one of the most fundamental problems in biophysics, and has critical implications for the discovery of new medicines. A simple gradient based search with the model was able to discover high-affinity protein binders. I'm excited by the potential this has to accelerate basic science and the understanding of proteins. And especially for the new avenues it opens up for therapeutic design and medicine.
74
447
1,612
599,056
sk retweeted
May 20
- Protein Data Bank (PDB) - Foldseek - AlphafoldDB - Uniprot - Interpro - Jaspar - protein-sequence-similarity-search - protein-sequence-msa - Ensemble - Pubchem - Chembl - pubmed - unibind-database - clinical-trials-database - opentargets-database Finally Pymol OMG!!
1
1
185
May 20
Antigravity 2.0 - Computational Biologist. Check it out and thank me later😉#phd #postdoc
1
1
7
430
May 20
- Protein Data Bank (PDB) - Foldseek - AlphafoldDB - Uniprot - Interpro - Jaspar - protein-sequence-similarity-search - protein-sequence-msa - Ensemble - Pubchem - Chembl - pubmed - unibind-database - clinical-trials-database - opentargets-database Finally Pymol OMG!!
1
1
185
May 9
repo now available on deepwiki: deepwiki.com/aqlaboratory/ge…
Introducing Genie 3, a generative protein model that substantially advances the state-of-the-art for binder design, increasing in silico success rates by up to 20x on hard multimeric targets. It also debuts a form of inference-time scaling unobserved in other design models. 🧵1/8
1
133
sk retweeted
AI can now design antibodies that bind with atomic precision, but not ones that cells can produce. Our preprint closes this gap, delivering a structural principle, an AI-guided rescue pipeline, and adalimumab variants with 20-100x in vivo potency. biorxiv.org/content/10.64898…

7
38
196
11,374
Every time I tell AI utopianists that biology is too complex for AI to "solve", they cite the success of AlphaFold. No, AlphaFold did not "solve" protein folding. It gets broad structures correct ~70-88% of the time (depending on evaluation), enabling useful but flawed statistical guesses. True "solving" would require ~99.9% accuracy, practically zero meaningful edge cases, and high confidence across fine details like side chains and conformations. Even then, this is just one narrow slice of the complexities of proteomics. The persistent gap between the "AlphaFold solved protein folding" claim and reality is a perfect example of AI overhype in biology.
89
134
837
122,128
Apr 22
If I had $10M–$1B? I'd go all-in on one modality. One of these; minibinders, cyclic peptides, nanobodies Pick one. Master it. Skip the "let's do everything" trap. Our designs were quietly excluded from the recent RBX1 competition—no feedback, no documented reason nothing! 👇
1
2
337
Apr 22
Despite the silence from one corner, the response elsewhere has been incredible. Several companies academic groups reached out🤝
1
114
Apr 22
And yes—our 10 designs? They're now shipping for expression & BLI affinity testing. 🚀
100
Mar 22
sorry for typo
1
85
Mar 14
Here's an illustration from Brian explaining why Contact Molecular Surface(CMS) is better than Rosetta SASA or Shape Complementarity. You can install it using: pip install py-contact-ms
1
4
210