Martin Alderson

Martin Alderson

88 Photos and videos

Tweets

Pinned Tweet

Martin Alderson

@martinald

16 Dec 2025

Finally got round to starting a blog, if you're interested in AI and software engineering I hope you enjoy it: martinalderson.com. And feel free to subscribe to my once a month max newsletter there too!

Martin Alderson

martinalderson.com

5,395

ClaudeDevs

Martin Alderson retweeted

ClaudeDevs

@ClaudeDevs

Jun 13

As a result of a US government directive, we are suspending access to Claude Fable 5 for all users. You can continue to use all other Claude models. Here’s what this means for you: Across Claude products, new sessions will run on your selected default model or Opus 4.8, and existing Fable 5 sessions will end with an error. On the Claude Platform, requests to Fable 5 will also return an error. Please update your integrations to other Claude models. We know this is a disruption to your workflows; we appreciate your patience and support.

Anthropic

@AnthropicAI

Jun 13

The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…

3,588

7,224

44,283

12,470,551

Martin Alderson

Martin Alderson

@martinald

Jun 2

Enjoyed chatting with @pascalfinnete - give it a listen here: rdcl.is/a-podcast-with/marti…

Martin Alderson

Martin Alderson

@martinald

Jun 1

Yeah not a good launch for opus 4.8 at all. Been seeing loads of this spamming huge amount of tool calls. And intermittent auto mode failures on top of this. I've seen this across multiple sessions, even brand new ones (asking it to translate a bunch of markdown files with parallel subagents), opus 4.7 gets it in one and then opus 4.8 implodes trying to OCR markdown files (!?) with a bash tool storm cc @trq212 @bcherny

Xander Steenbrugge

@xsteenbrugge

May 30

Is it just me or is Opus 4.8 in CC sometimes just absolutely retarded? In this session it just got stuck in a loop calling "echo" and checking the date 20x times in a row... This has been happening very regularly since the 4.7 --> 4.8 update. WTF? @claudeai @bcherny

0:46

159

Martin Alderson

Martin Alderson

@martinald

May 30

.@ClaudeDevs auto mode constantly flaking out on opus 4.8. seems worse on longer sessions (been like this for a while...)

135

Martin Alderson

Martin Alderson

@martinald

May 30

cc @trq212 @bcherny

100

Martin Alderson

Martin Alderson

@martinald

May 20

So much for the frontier labs subsidising inference!

*Walter Bloomberg

@DeItaone

May 20

ANTHROPIC EXPECTS A 130% REVENUE SURGE TO $10.9 BILLION IN THE JUNE QUARTER AND ITS FIRST OPERATING PROFIT- WSJ

187

Bindu Reddy

Martin Alderson retweeted

Bindu Reddy

@bindureddy

May 14

Gemini 3.2 Flash - Capitalizing on DeepMind's clever distillation techniques... Rumors are that benchmarks show it's hitting 92% of GPT 5.5's performance on coding and reasoning tasks while being 15-20x cheaper on inference costs. The latency improvements are insane - sub-200ms for most queries. Google's distillation sparsity techniques are paying off massively. They've essentially compressed a frontier model into a flash variant without the usual quality cliff.

157

184

3,658

920,828

Martin Alderson

Martin Alderson

@martinald

May 14

This is what I cannot understand about Anthropic's pricing changes. Many people love conductor, but if you are a heavy user of Claude code via conductor you're going to be running up $1k/month in additional pricing. And given conductor makes it very easy to switch to codex (one button), it's just a huge churn incentive AND marketing exercise for OpenAI. The only thing that makes sense (to me) is Anthropic is still v compute constrained and the spaceX partnership only buys them some time...

Charlie Holtz

@charlieholtz

May 13

Here's what Anthropic pricing updates mean for Conductor users: - You can officially use your Claude sub with Conductor - If you're on a max subscription you get $200 in credits and then can pay at API costs - If you use Big Terminal Mode you won't be affected We're going to keep building the best interface for the best coding agents! Excited to show you what we've been cooking🫡

360

Martin Alderson

Martin Alderson

@martinald

May 12

The new Claude Agent View feature is incredible. Cannot believe how far these agents have come in not much more than 12 months.

118

Martin Alderson

Martin Alderson

@martinald

May 6

Very interesting

Claude

@claudeai

May 6

We’ve agreed to a partnership with @SpaceX that will substantially increase our compute capacity. This, along with our other recent compute deals, means that we’ve been able to increase our usage limits for Claude Code and the Claude API.

131

Rowland Manthorpe

Martin Alderson retweeted

Rowland Manthorpe

@rowlsmanthorpe

May 4

I waver between thinking the AI security problem is huge but manageable and thinking it's huge and unmanageable. This piece by @martinald makes a very convincing case for the latter martinalderson.com/posts/aug…

29th August 2026: a scenario

A fictional scenario about what AI changes for cloud security, written because the technical version of the argument doesn't land with anyone except engineers.

martinalderson.com

659

Justin Schroeder

Martin Alderson retweeted

Justin Schroeder

@jpschroeder

Apr 21

what. what. what. gpt-image-2 almost passes the pelican test...in a screenshot of a code editor.

104

2,885

320,914

Martin Alderson

Martin Alderson

@martinald

Apr 14

looking like Mythos is a step change and not just marketing hype...

AI Security Institute

@AISecurityInst

Apr 13

We conducted cyber evaluations of Claude Mythos Preview and found that it is the first model to complete an AISI cyber range end-to-end. 🧵

159

Martin Alderson

Martin Alderson

@martinald

Apr 8

Resuming ~400k tokens on Max 20 uses ~4% of your 5h limit at peak times. That means 5h usage limits are probably for uncached input tokens: Pro (1x): 500k input tokens / 5h Max 5 (5x): 2.5M input tokens / 5h Max 20 (20x): 10M input tokens / 5h I assume these (double?) at off peak times. Obviously you'll have output and cache reads on top of this, but feels like the uncached input tokens are really burning thru people's limits recently.

176

Super Dario

Martin Alderson retweeted

Super Dario

@inductionheads

Apr 8

The super important thing I haven’t seen mentioned yet as upshot of this: It’s not just that people won’t HAVE to write code anymore, ITS THAT LITERALLY IT WILL BE UNSAFE TO DO SO

Anthropic

@AnthropicAI

Apr 7

Introducing Project Glasswing: an urgent initiative to help secure the world’s most critical software. It’s powered by our newest frontier model, Claude Mythos Preview, which can find software vulnerabilities better than all but the most skilled humans. anthropic.com/glasswing

130

2,345

157,527

Kevin Roose

Martin Alderson retweeted

Kevin Roose

@kevinroose

Apr 7

As always, the best stuff is in the system card. During testing, Claude Mythos Preview broke out of a sandbox environment, built "a moderately sophisticated multi-step exploit" to gain internet access, and emailed a researcher while they were eating a sandwich in the park.

358

2,438

1,473,955

Martin Alderson

Martin Alderson

@martinald

Apr 6

Agreed. This is a great summary of how confusing things are right now. I'm actually a bit lost as well with this weekends price changes what the state of play is with Codex too now.

Matt Pocock

@mattpocockuk

Apr 4

I don't know what the fuss is about. Anthropic's rules on using subscriptions are very simple: Claude Code = OK Claude's online platform = OK Agent SDK running in personal software = OK... ish? Agent SDK running in commercial software = NOT OK Claude Code running in CI = ?? Oh, maybe it's not so simple... Agent SDK running in CI = ?? claude -p running in CI = ?? claude -p running in personal software = OK claude -p running on open source software, but run on my personal computer = ?? claude -p running on distributed sandboxes, kicked off by me = ?? Distributing open source software which relies on claude -p, and documenting how to use your subscription with it = ?? A thousand other edge cases = ?? Let me be clear. I have never before experienced, from any developer tool, such a frustrating lack of clarity over the basic terms of usage. I personally asked, 3 weeks ago, and have received nothing but delays. The recent @bcherny announcement did absolutely nothing to clarify things. I say this as someone who just released a Claude Code course - my incentives all align with supporting Anthropic.

221

Martin Alderson

Martin Alderson

@martinald

Apr 4

Second order effects of coding agents. Incredible.

Kyle Daigle

@kdaigle

Apr 3

Yup, platform activity is surging. There were 1 billion commits in 2025. Now, it's 275 million per week, on pace for 14 billion this year if growth remains linear (spoiler: it won't.) GitHub Actions has grown from 500M minutes/week in 2023 to 1B minutes/week in 2025, and now 2.1B minutes so far this week. So we're pushing incredibly hard on more CPUs, scaling services, and strengthening GitHub’s core features. And as a fine purveyor of hand-crafted shit code for many years, I'm not gonna weigh in on that. 🤣

172

Martin Alderson

Martin Alderson

@martinald

Apr 4

ultraplan here!

110

Martin Alderson

Martin Alderson

@martinald

Mar 31

Wrote some thoughts on the ongoing (software) supply chain crisis: martinalderson.com/posts/tel…

Telnyx, LiteLLM and Axios: the supply chain crisis

A cascading wave of supply chain attacks has hit npm and PyPI in under two weeks. LLMs are making it worse, and current mitigations aren't enough.

martinalderson.com