Writing up my thoughts on the AI transformation

Joined February 2007
88 Photos and videos
Pinned Tweet
Finally got round to starting a blog, if you're interested in AI and software engineering I hope you enjoy it: martinalderson.com. And feel free to subscribe to my once a month max newsletter there too!
3
3
48
5,395
Martin Alderson retweeted
As a result of a US government directive, we are suspending access to Claude Fable 5 for all users. You can continue to use all other Claude models. Here’s what this means for you: Across Claude products, new sessions will run on your selected default model or Opus 4.8, and existing Fable 5 sessions will end with an error. On the Claude Platform, requests to Fable 5 will also return an error. Please update your integrations to other Claude models. We know this is a disruption to your workflows; we appreciate your patience and support.
The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…
3,588
7,224
44,283
12,470,551
Enjoyed chatting with @pascalfinnete - give it a listen here: rdcl.is/a-podcast-with/marti…

57
Yeah not a good launch for opus 4.8 at all. Been seeing loads of this spamming huge amount of tool calls. And intermittent auto mode failures on top of this. I've seen this across multiple sessions, even brand new ones (asking it to translate a bunch of markdown files with parallel subagents), opus 4.7 gets it in one and then opus 4.8 implodes trying to OCR markdown files (!?) with a bash tool storm cc @trq212 @bcherny
Is it just me or is Opus 4.8 in CC sometimes just absolutely retarded? In this session it just got stuck in a loop calling "echo" and checking the date 20x times in a row... This has been happening very regularly since the 4.7 --> 4.8 update. WTF? @claudeai @bcherny
159
.@ClaudeDevs auto mode constantly flaking out on opus 4.8. seems worse on longer sessions (been like this for a while...)
1
2
135
So much for the frontier labs subsidising inference!
ANTHROPIC EXPECTS A 130% REVENUE SURGE TO $10.9 BILLION IN THE JUNE QUARTER AND ITS FIRST OPERATING PROFIT- WSJ
1
187
Martin Alderson retweeted
Gemini 3.2 Flash - Capitalizing on DeepMind's clever distillation techniques... Rumors are that benchmarks show it's hitting 92% of GPT 5.5's performance on coding and reasoning tasks while being 15-20x cheaper on inference costs. The latency improvements are insane - sub-200ms for most queries. Google's distillation sparsity techniques are paying off massively. They've essentially compressed a frontier model into a flash variant without the usual quality cliff.
157
184
3,658
920,828
This is what I cannot understand about Anthropic's pricing changes. Many people love conductor, but if you are a heavy user of Claude code via conductor you're going to be running up $1k/month in additional pricing. And given conductor makes it very easy to switch to codex (one button), it's just a huge churn incentive AND marketing exercise for OpenAI. The only thing that makes sense (to me) is Anthropic is still v compute constrained and the spaceX partnership only buys them some time...
Here's what Anthropic pricing updates mean for Conductor users: - You can officially use your Claude sub with Conductor - If you're on a max subscription you get $200 in credits and then can pay at API costs - If you use Big Terminal Mode you won't be affected We're going to keep building the best interface for the best coding agents! Excited to show you what we've been cooking🫡
360
The new Claude Agent View feature is incredible. Cannot believe how far these agents have come in not much more than 12 months.
1
118
Very interesting
We’ve agreed to a partnership with @SpaceX that will substantially increase our compute capacity. This, along with our other recent compute deals, means that we’ve been able to increase our usage limits for Claude Code and the Claude API.
1
131
Martin Alderson retweeted
I waver between thinking the AI security problem is huge but manageable and thinking it's huge and unmanageable. This piece by @martinald makes a very convincing case for the latter martinalderson.com/posts/aug…
1
4
659
Martin Alderson retweeted
what. what. what. gpt-image-2 almost passes the pelican test...in a screenshot of a code editor.
69
104
2,885
320,914
looking like Mythos is a step change and not just marketing hype...
We conducted cyber evaluations of Claude Mythos Preview and found that it is the first model to complete an AISI cyber range end-to-end. 🧵
2
159
Resuming ~400k tokens on Max 20 uses ~4% of your 5h limit at peak times. That means 5h usage limits are probably for uncached input tokens: Pro (1x): 500k input tokens / 5h Max 5 (5x): 2.5M input tokens / 5h Max 20 (20x): 10M input tokens / 5h I assume these (double?) at off peak times. Obviously you'll have output and cache reads on top of this, but feels like the uncached input tokens are really burning thru people's limits recently.
176
Martin Alderson retweeted
The super important thing I haven’t seen mentioned yet as upshot of this: It’s not just that people won’t HAVE to write code anymore, ITS THAT LITERALLY IT WILL BE UNSAFE TO DO SO
Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing
77
130
2,345
157,527
Martin Alderson retweeted
As always, the best stuff is in the system card. During testing, Claude Mythos Preview broke out of a sandbox environment, built "a moderately sophisticated multi-step exploit" to gain internet access, and emailed a researcher while they were eating a sandwich in the park.
75
358
2,438
1,473,955
Agreed. This is a great summary of how confusing things are right now. I'm actually a bit lost as well with this weekends price changes what the state of play is with Codex too now.
I don't know what the fuss is about. Anthropic's rules on using subscriptions are very simple: Claude Code = OK Claude's online platform = OK Agent SDK running in personal software = OK... ish? Agent SDK running in commercial software = NOT OK Claude Code running in CI = ?? Oh, maybe it's not so simple... Agent SDK running in CI = ?? claude -p running in CI = ?? claude -p running in personal software = OK claude -p running on open source software, but run on my personal computer = ?? claude -p running on distributed sandboxes, kicked off by me = ?? Distributing open source software which relies on claude -p, and documenting how to use your subscription with it = ?? A thousand other edge cases = ?? Let me be clear. I have never before experienced, from any developer tool, such a frustrating lack of clarity over the basic terms of usage. I personally asked, 3 weeks ago, and have received nothing but delays. The recent @bcherny announcement did absolutely nothing to clarify things. I say this as someone who just released a Claude Code course - my incentives all align with supporting Anthropic.
221
Second order effects of coding agents. Incredible.
Yup, platform activity is surging. There were 1 billion commits in 2025. Now, it's 275 million per week, on pace for 14 billion this year if growth remains linear (spoiler: it won't.) GitHub Actions has grown from 500M minutes/week in 2023 to 1B minutes/week in 2025, and now 2.1B minutes so far this week. So we're pushing incredibly hard on more CPUs, scaling services, and strengthening GitHub’s core features. And as a fine purveyor of hand-crafted shit code for many years, I'm not gonna weigh in on that. 🤣
1
2
172
ultraplan here!
110