Inference @AnthropicAI, prev Gemini @Google, prev prev PhD @UTAustin

Joined May 2016
92 Photos and videos
Alek Dimitriev retweeted
opus 4.8 with the fable context is some real flowers for algernon shit
47
137
3,349
257,495
Alek Dimitriev retweeted
Introducing Adaline 2.0 - The Agent Self-Improvement Layer Adaline turns Traces into Behaviors, Behaviors surface Issues, Issues become auto-generated Evals Data, Adaline then generates new agent candidates and tests them. You review the winners and ship!
115
3,194
753
848,762
Alek Dimitriev retweeted
The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…
12,522
25,755
87,892
89,556,797
Nvidia is nothing without its people
Jun 9
JUST IN: Nvidia is now worth more than India
1
7
1,482
yann lecun in shambles
Was using Fable 5 to write my world model training code. Anthropic flagged it as frontier AI research. The steering vector kicked in and it started implementing JEPA 🤨
32
8,098
I asked Fable for an original joke and AFAICT this is genuinely novel and not bad! The shul roof springs a leak. The rabbi stays up all night preparing his case to bring before the Almighty — citations from the prophets, the accumulated merits of the congregation's grandparents, and for the closing argument, a pointed reminder of what He once did with forty days of rain. At dawn, before the rabbi can deliver a single word of it, there's a knock at the door. A stranger, moved by a dream, hands over the full cost of a new roof. The congregation celebrates. The rabbi sulks for a week. Finally his wife demands to know what's wrong. "He settled out of court."
1
13
708
Karpathy joins Anthropic. Anthropic releases Fable. Coincidence? I think not!
This is a super exciting release - Claude Fable 5 is the same underlying model as Mythos but with added safeguards. The benchmarks are great and it's SOTA on everything by a margin but I'll add that *qualitatively* also, this is a major-version-bump-deserving step change forward (imo of the same order as Claude 4.5 was in November), peaking especially for long problem-solving sessions on very difficult problems. You can give it a lot more ambitious tasks than what you're used to, the model "gets it" and it will just go, and it's never felt this tempting to stop looking at the code at all (but don't do this in prod!). The model still has quirks that people will run into and the safeguards are configured to be a little too trigger happy for launch, which can hopefully be tuned over time. I feel a lot of things changing as working software increasingly comes out on a tap. The Jevon's paradox kicks in and I feel my own demand for software growing substantially. You can ask for anything - explainers, visualizers, dashboards, bespoke single-use apps (e.g. a full wandb that is hyper-specific just for your project), you can 10X your test suite, auto-optimize code, run giant research projects with custom HTML for the results, anything! "Free your mind" (Matrix ref). Really looking forward to all the things people build!
1
1
41
6,168
A flat performance on a benchmark with increasing test time compute sometimes means that the models are not good enough right now, but they will be soon enough. Mythos shatters the SOTA with a clean trend-line.
Introducing FrontierCode: a coding eval that raises the bar for difficulty & quality. Each task took 40 hrs of work by leading open-source maintainers. Models write sloppy code that works but isn’t maintainable. Our eval is first to measure: would you actually merge this code?
1
8
1,194
We have scores normalized by test time compute in our Mythos launch for many benchmarks!
We've known about LLM test-time compute scaling since @OpenAI o1. Yet 2 years later labs still report scalar evals for models; safety orgs are still surprised when a scaffold does better via 100x inference; and RSPs still ignore inference budget when deciding critical thresholds.
1
387
Fable 5 has entered the chat. It’s the same underlying model as Mythos 5, but with extra safeguards. It was a lot of effort to figure out how to generally release it, but now that we’ve developed robust safeguards around it, we can’t wait to get it into everyone’s hands.
Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.
1
9
559
In case you're wondering, yes we're feeling the AGI.
Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. anthropic.com/institute/recu…
90
83
1,410
217,941
Alek Dimitriev retweeted
Amazing graph. One the best visualizations of human progress.
26
142
1,202
98,666
New Opus today and a new reduced price for fast mode on Opus 4.8! Fast mode was six times more expensive, but is now only 2x the price for 2.5x the speed, try it out!
May 28
Replying to @claudeai
Fast mode is available for Opus 4.8. It's the same model at roughly 2.5x the speed, and we've made it three times cheaper than before. Turn it on with /fast in Claude Code. On the API, contact your account manager to request access or join the waitlist: claude.com/fast-mode
1
10
1,243
Alek Dimitriev retweeted
over the weekend i checked the obvious thing, which is whether mythos is able to solve the erdos unit distance problem, aka erdos problem #90. the answer is: yea
54
143
2,004
624,698
He cut the best part, Mythos's reaction!
Humans using Mythos as seen by Mythos
1
4
1,420
Please be careful though!
Just like SF is the AI epicenter, it is also the weight loss peptide mecca. If you live here and aren’t using peptides, you at least know many people who do. The cutting edge is currently GLP-3 retatrutide, and Eli Lilly’s phase 3 trial results are out: "participants on 12 mg lost an average of 70.3 Ibs (28.3%) over 80 weeks." Incredible results!
7
2,388
Just like SF is the AI epicenter, it is also the weight loss peptide mecca. If you live here and aren’t using peptides, you at least know many people who do. The cutting edge is currently GLP-3 retatrutide, and Eli Lilly’s phase 3 trial results are out: "participants on 12 mg lost an average of 70.3 Ibs (28.3%) over 80 weeks." Incredible results!
7
1
64
57,983