Joined May 2009
14,315 Photos and videos
Pinned Tweet
🚨I have a new book coming out October 20: Co-Existence! It is about how we live & work with AIs that are sometimes (but not always) smarter than we are. And it has a cool cover. You can pre-order: co-existence.ai/ And here is a post with context: oneusefulthing.org/p/co-exis…
67
84
729
69,290
I think the assumption that you should use smaller models for less important tasks is flawed (or at least deserves much more careful consideration). Big models are generally better at everything but cost, so it is worth considering whether gains in non-key tasks would be valuable
46
12
413
28,878
In fact, areas that are not important in your organization could benefit even more from applying smarter models, since the organizational capabilities & human capital in those areas are weak, and good AI might be a cheap way to increase capacity!
6
6
72
8,334
Has there been anything good written about the failure of Mistral to keep up with both the Big Three and Chinese labs? They have talent and national backing, but despite being Europe’s only frontier lab (Google Deepmind’s UK lab aside), they haven’t been able to close the gap
67
31
705
73,818
I get that Mistral has largely given up on building frontier models, as many comments have pointed out, but it absolutely did not start out that way. (& a bet on being a vendor of frontier open weights models is only a good bet as long as Chinese labs are allowed to release them)
6
4
114
13,417
I don’t think this is going to result in more open weights models, as I wrote before the Anthropic news, if Mythos-level models are considered risky, China will also not want them to be open. And you can’t build a Mythos-class model without a very regulatable compute footprint
Replying to @emollick
I honestly don't understand the assumption that there will be continued open weights models. At some point, China will regulate release of Mythos-class models or the companies making open weights models will switch to closed. There will still be open weights, just not frontier.
50
41
484
56,265
And by regulatable amounts of compute, I mean training a Mythos class model uses enough power and chips that national governments will obviously notice. No ine is training a model of that size without permission
4
2
67
10,403
I wrote this a few months ago right after the Anthropic/DoW conflict & Citrini & Block: “But I think that single week is a good illustration of what the near future will feel like… as the stakes go up, it is likely things will feel even more unstable..” oneusefulthing.org/p/the-sha…
11
18
171
15,810
Well, this situation is confusing.
44
14
958
83,725
(As of now, I have no additional information beyond what has publicly been released. Any additional speculation on my behalf would be based on the public statement from Anthropic and rumors, so I can't really add anything useful on X tonight beyond "this situation is confusing")
6
3
146
19,521
Fable: "create a visually interesting shader that can run in twigl-dot-app make it like an infinite city of neo-gothic towers partially drowned in a stormy ocean with large waves." "Make it better" All of this is procedurally generated.
I had early access to Opus 4.8. Was impressed by it. Here is Opus 4.8's one shot of "create a visually interesting shader that can run in twigl, make it like an infinite city of neo-gothic towers partially drowned in a stormy ocean with large waves" (this is all done with math)
43
69
1,237
1,678,127
10 months later, I gave Claude Code with Fable the same brief, asking it to construct SimRefinery from surviving screenshots and documentation. Fully playable, with a learning mode & all sorts of sophistication. Look at the difference from the old version! simrefinery.netlify.app/
27 Sep 2025
I gave ChatGPT Codex an article & screenshot from a famous, lost Maxis simulation, SimRefinery, and asked it to create it for me. It built a playable prototype for me while I did other stuff, never touching any code once, instead just occasionally poking Codex for small changes.
37
41
767
212,948
Here is the old GPT-5.1 version in contrast: emollick.github.io/simrefine…

1
12
8,011
The shape of the graph is getting very familiar.
Claude Fable 5 scores very well on FrontierMath: Tiers 1–4 (v2), reaching 87% on Tiers 1–3 and 88% on Tier 4. This continues a streak of Anthropic models improving rapidly at math.
30
37
1,182
221,218
There has been a push to use OpenEvidence AI for doctors. But this paper suggests general models are much better: “Frontier LLMs outperformed clinical AI tools in all three evaluations. Clinical AI tools performed comparably to auto-enabled Google Search AI Overview on the RCQ.”
For medical information, general AI frontier models (Google, OpenAI, Anthropic) outperformed specialized @EvidenceOpen and @UpToDate as assessed by 12 US clinicians, randomized and blinded to which model and extensive testing/benchmarks. This was not anticipated. @NatureMedicine nature.com/articles/s41591-0…
41
44
422
68,471
Why this is a big deal.
Replying to @EricTopol
>65% of US physicians use OpenEvidence, with 27 million prompts in April nbcnews.com/tech/tech-news/o…
3
3
31
11,994
Not having access to native imagegen does hold Fable back somewhat. It is really good at making PNGs, etc, but there are lots of areas (including commercially valuable ones like presentations) where having the ability to have multimodal output would be helpful/token efficient.
24
1
151
15,655
Are there toolkits (or skillsets) being created specifically for AIs to use for building games? They default to 3js, reinvent how to make sprites from scratch each time, test technical issues but not gameplay loops, etc. It would help to point AIs at some tools to focus them.
28
3
170
20,502
A funny thing is that I now get more insightful comments on my posts about obscure science fiction authors or modernist poets or Bauhaus architecture than ever before, but that is entirely due to AI powered slop accounts. Soon broad cultural interest is a sign of AI writing?
28
2
141
25,435
Out of curiosity, what do you think Cordwainer Smith would have thought of all of this?
4
2
35
13,651
It worked! (Also this is a very bad interpretation of Cordwainer Smith, the underpeople felt more than humans! Its the whole reason for The Rediscovery of Man.)
1
4
30
9,138