Life Science Researcher @AnthropicAI. Share feedback on Claude for biology: forms.gle/bXPqhLAHeo2CSaXo8

Joined November 2017
99 Photos and videos
Pinned Tweet
Thrilled to share that I have joined @AnthropicAI as a life science researcher! I am confident that Claude will do amazing things to accelerate biology. Big things ahead!
112
51
2,006
268,980
Matt Durrant retweeted
We’re rolling out changes to make Fable 5’s safeguards for frontier LLM development visible. Starting this week, flagged requests will visibly fall back to Opus 4.8—the same as our safeguards for cyber and bio. You will see this every time it happens. On the API, any flagged requests will return a reason for their refusal (coming to server-side fallback in the next few days). We wanted to deploy Fable 5 to our users quickly and safely. Visible safeguards can be probed, so they have to be robust, which takes time to get right. Invisible safeguards can be targeted more narrowly, allowing us to ship quickly with very few false positives. We went with invisible safeguards for this reason—and that was the wrong tradeoff. You should have visibility into the safeguards we have in place, and why. We’re sorry for not getting the balance right. Making the safeguards visible makes them easier to work around, so keeping them robust to jailbreaks will unfortunately mean more false positives while we improve the classifiers. We're also tuning our bio and cyber classifiers to trigger less often on harmless requests. We know this is frustrating and we’ll do our best to keep this period as short as possible. If you think a request has been mistakenly flagged: run /feedback in Claude Code, click thumbs-down on the fallback in Claude.ai or Cowork, or file the safeguard appeal form for API requests. Your reports help us tune these classifiers and we appreciate your feedback. support.claude.com/en/articl…
668
430
5,074
834,463
Matt Durrant retweeted
Replying to @owl_posting
I know this is frustrating. As we said at launch, Fable blocks bio requests entirely for now and reroutes them to Opus 4.8. We made this tradeoff in order to get the model out safely and quickly while we work to refine the classifiers. It is absolutely our goal to enable the whole bio community to use our most powerful models, with appropriately scoped protections in place. We’re working towards this as fast as possible.
15
5
99
10,507
Matt Durrant retweeted
AI is advancing at a pace our policymaking institutions were never built for—and the gap between the two is becoming the central challenge of the technology. In his latest essay, our CEO Dario Amodei lays out how to close it. We're launching three new initiatives to support the efforts he outlines.
Today I'm publishing a new essay, Policy on the AI Exponential. AI is progressing extremely fast—much faster than the policy process was built to handle. The essay lays out where I think the technology is now, and the action needed to close the gap: darioamodei.com/post/policy-…
422
453
5,470
1,439,477
Sign up for Mythos 5 access updates: claude.com/form/mythos-acces…

5
14
148
22,064
Mythos is an excellent biologist. After we first gained access to it, we tested its ability to perform agentic molecular biology research and propose new hypotheses. It was a significant improvement, its biological reasoning and taste are impressive. We give more examples here:
61
70
1,457
159,260
anthropic.com/news/claude-fa… Fable routes all biology research questions to Opus 4.8 by design. We're moving toward a trusted access program so researchers can use Mythos-class models for biology.
13
6
104
19,644
We believe that AI will do amazing things for biology and human health, and that scientists will need access to frontier intelligence to make that vision a reality. We're working on it!
34
3
127
296,998
Matt Durrant retweeted
Fable 5 is the best model I've ever used. I’ve been spending most of my time in the last few months helping to bring Mythos-level models to general availability safely. These models changed everything. So stoked that anyone can use Fable today! Can't wait to see what you all build with it.
Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.
20
6
227
12,577
Matt Durrant retweeted
This is a super exciting release - Claude Fable 5 is the same underlying model as Mythos but with added safeguards. The benchmarks are great and it's SOTA on everything by a margin but I'll add that *qualitatively* also, this is a major-version-bump-deserving step change forward (imo of the same order as Claude 4.5 was in November), peaking especially for long problem-solving sessions on very difficult problems. You can give it a lot more ambitious tasks than what you're used to, the model "gets it" and it will just go, and it's never felt this tempting to stop looking at the code at all (but don't do this in prod!). The model still has quirks that people will run into and the safeguards are configured to be a little too trigger happy for launch, which can hopefully be tuned over time. I feel a lot of things changing as working software increasingly comes out on a tap. The Jevon's paradox kicks in and I feel my own demand for software growing substantially. You can ask for anything - explainers, visualizers, dashboards, bespoke single-use apps (e.g. a full wandb that is hyper-specific just for your project), you can 10X your test suite, auto-optimize code, run giant research projects with custom HTML for the results, anything! "Free your mind" (Matrix ref). Really looking forward to all the things people build!
Replying to @claudeai
Fable 5 is state-of-the-art on nearly all tested benchmarks, with exceptional performance in software engineering, knowledge work, scientific research, and vision. The longer and more complex the task, the larger Fable 5’s lead over our other models.
1,259
2,355
25,200
2,663,041
Matt Durrant retweeted
This is THE model we use, excited to share the magic.
Replying to @claudeai
Fable 5 is state-of-the-art on nearly all tested benchmarks, with exceptional performance in software engineering, knowledge work, scientific research, and vision. The longer and more complex the task, the larger Fable 5’s lead over our other models.
26
10
283
23,111
Matt Durrant retweeted
My favorite chart from our system card - FrontierCode is an excellent eval, and it accurately reflects the step up I feel when using Fable!
Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.
34
41
628
62,661
Matt Durrant retweeted
I wrote up some thoughts on AI agents and biological databases for the @AnthropicAI Science blog. 🚗🧬 Based on research with @ferbsx and @PardisSabeti
New Science Blog: Why has AI advanced faster in coding than in biology? To agents, bio databases are like cities built before cars—maddening to drive in because they're designed for different traffic. How do we build infrastructure agents can use? anthropic.com/research/agent…
2
11
102
11,222
Matt Durrant retweeted
Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. anthropic.com/institute/recu…
1,771
4,662
28,648
18,491,506
Matt Durrant retweeted
Opus 4.8 is almost as good as Claude Mythos in most of biology benchmarks! 🔥🔥
4
26
182
14,615
Matt Durrant retweeted
Excited to release Opus 4.8 today! We heard your feedback on 4.7 and have made many fixes for 4.8. 4.8 understands nuances better, feels much more natural to talk to, and is overall a stronger collaborator on everything from coding to knowledge work.
May 28
Introducing Claude Opus 4.8: it builds on Opus 4.7 with sharper judgment, more honesty about its own progress, and the ability to work independently for longer than its predecessors. Available today at the same price.
171
93
2,095
189,463
Matt Durrant retweeted
With the help of Claude Mythos Preview, the Firefox team fixed more security bugs in April than in the past 15 months combined.
344
1,257
15,477
1,487,244
"We anticipate that bridge recombinases, together with advances in predictive bRNA design and bacterial delivery systems, will form a foundation for “synthetic microbiomics”, the systematic, programmable orchestration of gene flow within complex microbial ecosystems."
Our lab is proud to present our latest work harnessing Bridge Recombinase for genome-scale editing in diverse bacteria, microbiome editing, and programmable horizontal gene transfer.
2
20
2,335
One of the many amazing parts of this paper that you might have missed at first glance!
Replying to @bradyfcress
We then leveraged TRADE editing to "capture" entire pathways in vivo and move them between species without ever needing to extract their DNA--just molecular biology accomplished fully within the microbes. This allowed us to move functions across phyla.
3
35
4,228
Pioneering work from the @bradyfcress lab using bridge recombinases to perform programmable genomic rearrangements in diverse species (multiple phyla). Congrats to @jayman14661, @swartz_sophi, Agnès Oromí-Bosch, and team!
Our lab is proud to present our latest work harnessing Bridge Recombinase for genome-scale editing in diverse bacteria, microbiome editing, and programmable horizontal gene transfer.
1
3
29
2,913
Matt Durrant retweeted
From @bradyfcress revolutionary work!
Bridge recombinase enables versatile rewriting of bacterial genomes biorxiv.org/content/10.64898… #biorxiv_synbio
1
2
4
997