Kiera

Kiera

204 Photos and videos

Tweets

Pinned Tweet

Kiera @kieradev

26 Dec 2025

People forget that when a new model say Opus 4.5 improves by 5% on SWE Bench Verified, going from like 75% -> 80%, it is in no way the same as a model going from 20% -> 25%. As you get to saturation of a benchmark, all that are left are the absolute most difficult tasks. That is why Opus 4.5 appears as an incremental improvement in charts, but offers a drastically improved performance to Opus 4.1 or Sonnet 4.5. Also why I think whilst kinda cringe, Anthropic's chart crimes are not wholly unjustified.

5,096

Kiera

Kiera @kieradev

Jun 13

"you will only have gpt-2 and you will be happy"

Artificial Analysis

@ArtificialAnlys

Jun 13

Today is the first time our Intelligence Frontier chart has moved backward.

Kiera

Kiera @kieradev

Jun 9

Claude Fable 5, 40% of Max 5x usage. Full Mario Kart 64 type game in 2 sentence prompt. All decoration and character models, 4 maps, music, all three game modes, UI was done by Fable 5 in a single shot, in about 15 mins. What the fuck. #fable #anthropic

3:35

780

183,359

Kiera

Kiera @kieradev

Jun 10

@bcherny you have all cooked an unimaginable amount

3,275

Kiera

Kiera @kieradev

Jun 10

x.com/kieradev/status/206452…

Kiera @kieradev

Jun 10

In Episode 2 of Claude Fable 5 is fucking insane: A full Pokemon-Like RPG, made in 1 hour. Used 90% of my 5 hour limit but is worth it. Seems to have full story, over 4 hours of content, 50 unique collectable pokemon, controller support. Imagine fusing this with images-2...

2:05

5,600

Kiera

Kiera @kieradev

Jun 10

2:05

8,953

Pope Leo XIV

Kiera retweeted

Pope Leo XIV

@Pontifex

Apr 10

Hundreds of millions of people throughout the world are immersed in extreme poverty. Yet, disproportionate wealth remains in the hands of a few. It is an unjust scenario, in the face of which we cannot fail to question ourselves and commit to change things. There is no lack of resources at the root of disparities, but the need to address solvable problems related to a more equitable distribution of wealth, to be achieved with moral sense and honesty.

13,551

26,270

131,449

5,984,026

Kiera

Kiera @kieradev

Apr 7

mythos

423

Kiera

Kiera @kieradev

Jan 11

genuinely why does google not use a smarter model for overviews, it is hallucinating typos now

1,335

Kiera

Kiera @kieradev

Jan 11

It is still wild to me how quick Claude context window fills when they don't preserve thinking like Google. I would presume preserving thinking in the message stream which Google can do thanks to their 5x larger context window probably provides at least a ~2% intelligence bump.

825

Kiera

Kiera @kieradev

Jan 9

Where would Claude want to live? #claudecode #anthropic #ai

760

Kiera

Kiera @kieradev

Jan 9

imagine if chemists created a compound which could materalise into literally any other compound, and then imagine they gave free access to everyone imo ai image gen (and perhaps ai more broadly) is no different. I know it is cool to be accelerationist on here, but really AI should have been researched and developed in private, everyone is trying to get AGI anyway, all the consumer products now are pointless as they are all negative profit and I see very few positive cases for consumer ai, mostly just slop and harm

488

Kiera

Kiera @kieradev

Jan 6

Claude Opus 4.5 level software model with GPT-OSS-120B speed and pricing would undeniably just end any manual writing of code imo

461

Kiera

Kiera @kieradev

Jan 4

Icl claude getting side tracked and finding and being surprised by news during a completely unrelated question about my personal finances is very funny and quite human

525

Kiera

Kiera @kieradev

Jan 1

gemini continues to be great at tool calls "The user is likely working with a hypothetical future or a fictional timeline, given the system prompt states it is January 1, 2026."

607