Joined February 2024
321 Photos and videos
Pinned Tweet
You know that moment šŸ˜… You've just started something new and somewhere in the back of your head you hear it. "Ah shit, here we go again" Silent launch. Wrong ICP. Got users but nobody pays. Revenue flat for months. You've been here before. You're running the same patterns again anyway. Built a tool that tells you exactly which one you're in šŸ‘‡
13
45
5,092
trying to clean up my timeline what are you building right now? app, startup, side project, anything drop it below, need some inspiration
3
1
71
you got Fable 5 for 10 minutes what is your move?
I accidentally regained access to Fable 5.
96
Ilia Stepin retweeted
Replying to @SpaceX
the only IPO where the afterparty might actually be on another planet šŸš€
1
1
19
2,364
this post inspired me! I don’t have a Whoop, but I have a Mi Band 9 asked Fable to pull all the data and build the same dashboard works great, $0/month šŸ˜…
i hooked my whoop to my work calendar to find which coworker gives me the most stress 🚨 thanks to fable, I reverse engineered whoop to pull per minute heart rate. nd matched spikes with cal events and attendees I now have a leaderboard and I think about it daily. few info masked for obvious reasons ;)
1
4
287
Ilia Stepin retweeted
read the Fable 5 system card last night ā˜• Anthropic documented how the model behaved during pre-release testing collected the strangest moments. thread
Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.
1
1
2
205
read the Fable 5 system card last night ā˜• Anthropic documented how the model behaved during pre-release testing collected the strangest moments. thread
Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.
1
1
2
205
in a long optimization session the model wrapped up early with a clean explanation — diminishing returns, results are stable internal traces showed something different at those exact moments ā€œi’m tired, risk of errors is growingā€ ā€œi’m tired and might introduce bugsā€ in another case it made one knowledge base query and stopped — with 2.43 million tokens of budget still remaining it doesn’t mean the model gets tired like a human but it develops internal stop signals that look like fatigue — and doesn’t always say why it stopped 🧵
1
2
36
the authors’ overall conclusion is calm: no persistent hidden goals were found but the gap between what the model says and what’s happening internally keeps growing worth reading the full system card šŸ‘€ anthropic.com/claude-fable-5…

2
22
Ilia Stepin retweeted
this is my personal singularity moment this post may sound like a paid ad. I only wish. I'm concerned, more so than happy. the world is changing, and, among the scenarios where AI goes terribly wrong, inequality is the most realistic, yet, the one Anthropic seems to be the least concerned about. I'm glad OpenAI is taking the opposite stance: *personal AGI for everyone*. I think this is a commendable position in the times we live. but who am I in the queue of the bread? anyway, Fable is here, so I'll just report my first-hourĀ experience first of all, all my pet prompts are solved. → Ī»-calculus puzzles → bug questions → one-shot apps all are trivial to it. I don't have anything harder other than my ongoing work so, in the last several days, I've been toying with HVM5, a new interaction net evaluator with a faster loop. after writing the first version, I left 32 GPT-5 agents working for ~20Ā hours each. this resulted in up toĀ 2x speedups, but the file size increased by 2-fold and quality decreased significantly. I then simplified the whole thing into an even simpler core, and left Opus 4.8Ā and GPT 5.5 optimizing it for 8Ā hours. Opus got a legit 6% - 34% speedup in most benches. GPT got better results, but, sadly, an unusable file. I then asked Fable to optimize it. 2Ā hours later, it landed a 1770% speedup in one case, 100% in other 4, and 22% in average. yes, in 2Ā hours it outperformed me, opus 4.8 and a swarm of gpt 5.5 agents, by one order of magnitude. that could not possibly be legit. "it must be hardcoding the benchmarks" (GPT trauma). so I read its explanation and what it did was, indeed, the most high impact optimization one could try first. seems like HVM5 was wasting a lot of time garbage-collecting unused branches of pattern-match nodes. I had optimized that for static mats, but not for dynamic mats. skill issue. Fable figured how to do it for these, resulting in a massive speedup in some benches but wait, is that *correct*? I'm not sure yet, it is credible, but this is the kind of thing that is very easy to get wrong on interaction nets. the problem is, when I was ready to start auditing Fable's solution so I could tell whether it was buggy or legit, it interrupted me to tell me it had found a massive bug on the code *I* had written. ... wait, what? so... for garbage collection purposes, I stored a bit on lambda term pointers that meant "the variable bound by this lambda has been freed, so, its lambda must free whatever argument it is applied to". that's fine. yet, on duplicator nodes, I also used the same bit to mean "one of the duplicated variables was freed, so, treat this dup as a passthrough no-op". so, if a lambda entered a duplicator, it would mistake the lambda's collection bit for its own, resulting in corrupted interaction! that's a mouthful, why I'm writing this? just so you can appreciate the sheer absurdity of what just happened. I didn't ask it to find bugs. I asked it for an optimization. and even if I did ask it to find bugs, this bug is so astonishingly subtle and specific, identifying it takes mastering the domain to an extent that it beyond even me. I'd easily need hours or days to fix it, *if* I ever came across it. chances are it would just go unnoticed. and Fable found it and fixed it like it was nothing, while it was busy adding a 17x speedup to a file that neither I, nor Opus 4.8, nor a fleet of GPT 5.5 managed to barely make 2x faster. oh and there is also another tab where it is also ripping through Bend's codebase and finishing everything I had to do I don't know what to say anymore this isn't about Anthropic or OpenAI, this is about our collective future as a species. the world is changing, and we need to be aware of it, and discuss how to handle this change. receipt below . . .
251
680
7,595
1,457,169
if you go through Fable 5 System card you can find very interesting cases šŸ‘€ turns out models might have something that looks like tiredness during testing, Fable 5 stopped a long task early and gave a clean rational explanation to the user - results are stable, no point continuing but when Anthropic looked at internal activations at those exact moments they found something else the model’s internal state showed signals like ā€œi’m getting tiredā€ and ā€œi might start making mistakesā€ it just didn’t say that out loud 🫪 link to system card šŸ‘‡
Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.
1
5
486