ClaudeDevs

ClaudeDevs

95 Photos and videos

Tweets

Boris Cherny retweeted

ClaudeDevs

@ClaudeDevs

Jun 11

/goooooal ⚽

0:07

227

480

9,991

561,435

Aaron Li

Boris Cherny retweeted

Aaron Li

@aaronli

Jun 11

claude fable 5 has solved CAD I asked it to make a model of a V8 engine It came back to me with a fully working model in under 10 minutes

0:11

231

503

6,944

816,251

Ara Kharazian

Boris Cherny retweeted

Ara Kharazian

@arakharazian

Jun 10

NEW: Ramp AI Index for June 2026 1. We expected OpenAI to gain on the launch of Codex. It held flat in business adoption last month. 2. Anthropic grew 2.5% points to 41% of firms. It's now driving new AI adoption with never-adopters. We also made methodological updates to better capture spend on bill pay. OpenAI adoption was higher over the 2023-2025 period, but Anthropic remains the most popular model among businesses today.

269

56,763

Boris Cherny

Boris Cherny

@bcherny

Jun 11

Hello from Code with Claude Tokyo!!

123

119

3,044

115,245

Dario Amodei

Boris Cherny retweeted

Dario Amodei

@DarioAmodei

Jun 10

Today I'm publishing a new essay, Policy on the AI Exponential. AI is progressing extremely fast—much faster than the policy process was built to handle. The essay lays out where I think the technology is now, and the action needed to close the gap: darioamodei.com/post/policy-…

Dario Amodei — Policy on the AI Exponential

darioamodei.com

1,297

2,394

13,363

6,322,458

Boris Cherny

Boris Cherny

@bcherny

Jun 9

Enjoy!

ClaudeDevs

@ClaudeDevs

Jun 9

We've reset 5-hour and weekly rate limits for all users. Enjoy Fable 5!

282

2,980

202,832

Boris Cherny

Boris Cherny

@bcherny

Jun 9

Fable 5 is the biggest step up I’ve felt in our models since Opus 4.5 back in November. After 4.5 came out I uninstalled my IDE when I realized that I’d been doing 100% of my coding in a terminal for a few weeks. With Fable, it’s felt like Claude has stepped up from being a coding agent to a thought and design partner in building the product. Fable has judgement, taste, and dimensionality in a way that previous models didn’t, leading me to trust it more with the most complex work. I think the first time I had this realization was when I asked Fable to debug something. It is the first model I have used that was so methodical and precise, taking measurements and adding logs then verifying that it truly fixed the issue before declaring victory. There’s nothing in claude code’s prompting telling the model to do that, it’s just part of its personality. It really has this “big model smell” that I haven’t felt before.

651

598

10,626

888,065

Boris Cherny

Boris Cherny

@bcherny

Jun 9

We talk a lot about how important it is to set up self-verification loops. Especially in the age of powerful models that can run for long periods of time, self-verification is a key ingredient that enables the model to run for much longer, delivering a result that is closer to what you intended, so you can do more without having to constantly check in on Claude as it works. @delba_oliveira gives a great breakdown of what that looks like and why it matters

ClaudeDevs

@ClaudeDevs

Jun 2

How do you get Claude Code to check its own work before handing it back? Watch how you can encode your manual checks so Claude closes its own feedback loop:

5:57

242

3,026

414,711

Andrej Karpathy

Boris Cherny retweeted

Andrej Karpathy

@karpathy

Jun 9

This is a super exciting release - Claude Fable 5 is the same underlying model as Mythos but with added safeguards. The benchmarks are great and it's SOTA on everything by a margin but I'll add that *qualitatively* also, this is a major-version-bump-deserving step change forward (imo of the same order as Claude 4.5 was in November), peaking especially for long problem-solving sessions on very difficult problems. You can give it a lot more ambitious tasks than what you're used to, the model "gets it" and it will just go, and it's never felt this tempting to stop looking at the code at all (but don't do this in prod!). The model still has quirks that people will run into and the safeguards are configured to be a little too trigger happy for launch, which can hopefully be tuned over time. I feel a lot of things changing as working software increasingly comes out on a tap. The Jevon's paradox kicks in and I feel my own demand for software growing substantially. You can ask for anything - explainers, visualizers, dashboards, bespoke single-use apps (e.g. a full wandb that is hyper-specific just for your project), you can 10X your test suite, auto-optimize code, run giant research projects with custom HTML for the results, anything! "Free your mind" (Matrix ref). Really looking forward to all the things people build!

Claude

@claudeai

Jun 9

Replying to @claudeai

Fable 5 is state-of-the-art on nearly all tested benchmarks, with exceptional performance in software engineering, knowledge work, scientific research, and vision. The longer and more complex the task, the larger Fable 5’s lead over our other models.

Benchmark table titled Mythos 5 & Fable 5, comparing Claude Mythos 5 and Fable 5 against Claude Mythos Preview, Claude Opus 4.8, GPT 5.5, and Gemini 3.1 Pro.

ALT Benchmark table titled Mythos 5 & Fable 5, comparing Claude Mythos 5 and Fable 5 against Claude Mythos Preview, Claude Opus 4.8, GPT 5.5, and Gemini 3.1 Pro.

1,254

2,355

25,190

2,657,726

Aaron Levie

Boris Cherny retweeted

Aaron Levie

@levie

Jun 9

If you thought AI progress was slowing down, well here's the immediate answer to that. Huge jump in capability across the board. This is going to deliver major improvement in agents across almost all knowledge work categories.

Claude

@claudeai

Jun 9

Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.

0:20

670

144,223

Boris Cherny

Boris Cherny

@bcherny

Jun 9

Fable 5 is now available in Claude Code and Cowork Fable is the best model I have used for coding, by a wide margin. It is a big step up, enabling less prompts and steers, more efficient token use, better code quality, better tool use, more intelligent self-verification, longer running sessions, and higher trust & autonomy. Happy coding!

Claude

@claudeai

Jun 9

Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.

0:20

428

301

4,375

366,073

Boris Cherny

Boris Cherny

@bcherny

Jun 9

Just landed nested subagent support in Claude Code Starting to experiment more with agents kicking off agents as a way to better manage context. Capped at depth=5 to start, going out in today’s release. Lmk what you think!

501

294

5,643

469,790

Anthony Morris ツ

Boris Cherny retweeted

Anthony Morris ツ

@amorriscode

Jun 9

maximum intelligence

0:06

423

57,785

Mikhail Parakhin

Boris Cherny retweeted

Mikhail Parakhin

@MParakhin

Jun 7

Have been extensively testing Claude Workflows this weekend, with the best model possible. Threw it at my whole code base, combing for bugs. 144 found and fixed! Geez... It is a large code base, for sure, but 144?!! Some are very impactful, some are downright embarrassing...

Mikhail Parakhin

@MParakhin

Apr 3

I keep predicting software quality will improve. I keep being wrong. Models write better-than-average code, yet we use them to write more code - not better code (shoutout to the unmovable, always-on-top Claude Code download and install window).

543

177,660

Jarred Sumner

Boris Cherny retweeted

Jarred Sumner

@jarredsumner

Jun 8

Tokyo this week

ALT Code w/ Claude 15:30 - 16:00 Rewriting Bun in Rust Jarred Sumner

1,145

110,490

Boris Cherny

Boris Cherny

@bcherny

Jun 8

When we first demoed Claude Code internally, it got two reactions on Slack. A year after GA, @_catwu and I sat down to talk about what's changed: why I use auto mode instead of plan mode, how routines fix bugs before I see them, why I do most of my coding from my phone now, and where the product is going

ClaudeDevs

@ClaudeDevs

Jun 8

Claude Code's first demo got two Slack reactions. One year after GA, @bcherny and @_catwu look back: verification best practices, why we built auto mode, routines and loops, and what's next. youtube.com/watch?v=Hth_tLaC…

131

123

2,208

355,960

Boris Cherny

Boris Cherny

@bcherny

Jun 8

Seeing a number of benchmarks showing Opus is the best model for long-running work. Five tips for running Opus autonomously for hours/days: 1. Use auto mode for permissions, so Claude doesn’t ask for approval 2. Use dynamic workflows, to have Claude orchestrate hundreds/thousands of agents to get a task done 3. Use /goal or /loop, to nudge Claude to keep going until it’s done 4. Use Claude Code in the cloud, so you can close your laptop (easiest way is the desktop or mobile app) 5. Make sure Claude has a way to self-verify its work end to end: Claude in Chrome browser extension for web, iOS/Android sim MCP for mobile, a way to start the full web server or service for backend work

Rishi Desai

@rishi_desai2

Jun 5

Can coding agents stay coherent over a 1 billion token budget? Can they build Slack from scratch? Rewrite a JAX codebase in PyTorch? Build a C compiler in Rust? Enter SWE-Marathon: a benchmark for autonomous long-horizon software work.

313

278

3,477

638,540

Anthropic

Boris Cherny retweeted

Anthropic

@AnthropicAI

Jun 4

Today, Anthropic engineers on average ship 8x as much code per quarter as they did compared to 2021-2025.

210

380

5,035

2,032,724

Anthropic

Boris Cherny retweeted

Anthropic

@AnthropicAI

Jun 4

Our internal data shows Claude is accelerating AI development—a possible path to recursive self-improvement, or AI autonomously building a more capable successor. It’s happening faster than we thought, and the implications deserve greater attention. anthropic.com/institute/recu…

When AI builds itself

Our progress toward recursive self-improvement, and its implications.

anthropic.com

1,771

4,662

28,646

18,488,299

Boris Cherny

Boris Cherny

@bcherny

Jun 5

We doubled Claude Cowork usage limits for the next month. This applies to your 5-hr rate limits. If you’ve been saving up a big messy project, now’s the time.

Claude

@claudeai

Jun 5

We've doubled usage limits in Claude Cowork for the next month. Delegate bigger, more complex tasks to Claude.

325

200

4,488

533,623

Boris Cherny

Boris Cherny

@bcherny

Jun 5

Cowork is at its best on work that’s too big for a chat: research across dozens of accounts, recurring reports, triaging my inbox and drafting replies. If you’ve been curious, this is a good month to find out what it can take off your plate. Can’t wait to see what you do.

210

32,062