Measuring Mid-2025 LLM-Assistance on Novice Performance in Biology

Large language models (LLMs) perform strongly on biological benchmarks, raising concerns that they may help novice actors acquire dual-use laboratory skills. Yet, whether this translates to...

arxiv.org

470

Active Site

Active Site @ActiveSiteBio

Feb 19

We ran a randomized controlled trial to see if LLMs can help novices perform molecular biology in a wet-lab. The results: LLMs may help in some aspects, but we found no significant increase at the core tasks end-to-end. That's lower than what experts predicted. Our findings 🧵

155

35,643

more replies

Active Site

Active Site @ActiveSiteBio

Feb 19

Shout out to @ShenZhouHong, @alexjkleinman, @alymathiowetz, @adamhowes, @xave_rg, @lucafrighetti, Joe Torres, Julian Cohen, Suveer Ganta, Deepika Pahari, Alex Letizia Thank you @fmf_org , @ClaireQureshi, and @PackardFdn for supporting our work and to our advisory board.

904

Active Site

Active Site @ActiveSiteBio

Feb 19

You can read more here: 📝arXiv preprint: arxiv.org/abs/2602.16703 📔Blog post: activesite.substack.com/p/rc… 🔮Predictions from @Research_FRI: forecastingresearch.substack…

Measuring Mid-2025 LLM-Assistance on Novice Performance in Biology

Large language models (LLMs) perform strongly on biological benchmarks, raising concerns that they may help novice actors acquire dual-use laboratory skills. Yet, whether this translates to...

arxiv.org

470

Active Site

Active Site @ActiveSiteBio

Feb 19

Importantly: this is a snapshot of mid-2025 novice and LLM performance. Results could change as new LLMs become more capable, easier to use in the lab, and as average elicitation skill improves. As models evolve, we aim to continue tracking how people use frontier AI in biology

838

Active Site

Active Site @ActiveSiteBio

Feb 19

We're actively hiring for scientists and operators! We especially want to find a Head of Ops to help build an engine to repeat this study regularly and develop entirely new ones. jobs.ashbyhq.com/activesite

Active Site Jobs

jobs.ashbyhq.com

176