The reason large AI companies with virtually unlimited capital, compute, and computer science talent have not transformed biology is very simple.
They do not understand biology.
In biology, the hardest problem is not accessing information. We already have enormous amounts of publicly available data, papers, databases, omics datasets, and experimental results. Much of it remains underutilized. Most of it can be converted into computable formats straightforwardly.
The difficult part is knowing which questions matter.
What do we actually understand? What don’t we understand? Which hypotheses are worth testing? Which experiments would meaningfully reduce uncertainty? Where are the conceptual bottlenecks preventing progress?
These are scientific judgment problems, not database problems.
What biology lacks is not data. It lacks enough biologists with strong quantitative and computational skills who can identify important questions and use available tools to answer them.
Biology is different from coding. The challenge is not retrieving information. The challenge is deciding what information is worth generating in the first place.
Once AI companies recruit enough biologists who know how to identify important questions and how to use quantitative tools to address them, the opportunities will be enormous.
The bottleneck is not the databases. The bottleneck is biological insight.
New Science Blog: Why has AI advanced faster in coding than in biology?
To agents, bio databases are like cities built before cars—maddening to drive in because they're designed for different traffic.
How do we build infrastructure agents can use?
anthropic.com/research/agent…