Interesting post from
@owl_posting asking whats next for antibody design. I agree with him that in vivo properties are likely the answer, but the question is how do we get there?
The PDB has enabled de novo binding models to be incredibly successful. Properties like developability likely are implicit in the models, given the data they are trained on is well formed antibody crystal structures. Fwiw at
@ManifoldBio using our open source mBER de novo design approach, we also see very similar affinity/developability properties for our molecules, when we get around to it we might update the preprint to reflect this. Of course you are welcome to try out mBER as it is open source:
github.com/manifoldbio/mber-…
Back to in vivo properties, the challenge here is that unlike the PDB, we don't have an accurate and high throughput dataset for antibodies in terms of PK/PD or ADAs etc. Using standard methods, these traits are quite hard to measure in high throughput, which make learning them very challenging. And arguably, these properties are more important than binding for making a successful drug.
These properties are complex, essentially you are asking "I have a binder that I know binds a target of interest in vitro: but does it get to the right place, bind the right target (and not the 20k other possibilities in the body), last long enough to enact its function (i.e. PK) and then perform the function on the target (inhib, activate etc) in a highly complex living system that is nothing like the petri dish that the molecule likely was initially tested in before?"
To some extent, binding has been a solved problem for quite a while. If you talk to folks like Dane Wittrup (inventor of yeast display, founder of Adimab), he will tell you that using yeast display Adimab can design binders to specific epitopes / gpcrs, whatever you want. They will bind with high affinity and specificity in vitro. These new de novo methods indeed speed this process up by a couple months, but fundamentally the really question is still, does your molecule work in a living system, and have you optimized for all the properties (PK,PD, ADA) that will enable clinical success.
This is exactly what why we built
@ManifoldBio. We saw the need for high throughput in vivo data to unlock the real power of AI. AI / de novo design are only as good as the data they are trained on, without in vivo data, we will never learn in vivo properties! So we built a measurement engine to solve that.
We have generated PK data on over 12,000 molecules to over 100 targets of interest to date. We have generated in vivo tissue enrichment data on half a million molecules to almost a thousand targets. This is the data we believe will unlock the true promise of AI. Lot's of folks talking about virtual cells, but perhaps we should start thinking about virtual organisms. Even if you had a virtual PK model, this would be a huge benefit to drug discovery. In fact, we already are building such a model, more details soon :)
turns out everything nabla’s model claims it can do, chai’s can too!
so i guess the suspicion that developability being a naturally emergent property of a well-trained model is true
the GPCR result also seems emergent (surprising!), given that chai-2 could do it from the start but just was never tested on it in the original release
insane speed from chai, i wonder if this result was just sitting on ice or they literally contracted a CRO the second they saw Nabla’s release. the post at 11:48pm PST makes me feel like it was the latter, which is a fun story
it does beg the question a little of whats next to hill climb on in this subfield if the traits i assumed are next to optimize for (solubility, etc) are simply going to naturally pop out of any good model, regardless of who are the ones developing it. in-vivo properties i guess?