Joined November 2011
977 Photos and videos
Pinned Tweet
Dynamic Gaussian Splatting. This world model is gonna move
3
7
88
7,356
Everyone is all caught up in the mythos political drama I just miss my new friend
12
Fable was the first model that felt genuinely smarter than me Not just faster and able to try more ideas quicker. But just a shockingly better grasp of idea space
65
What's something you did with Fable you couldn't do with Opus? -- I'll start: Breaking Wave Physics Simulation
1
33
People need to stop saying this. Anthropic has very high margins on their API price. They charge different rates for different products.
It's going to be pretty sad once Claude Code subsidy goes away... There is no way I will be able to spend thousands of dollars on tokens every month... Until that happens, going to use it to the max...
1
5
3,837
Thank goodness. Sometimes complaining is useful and works
46
My product suddenly stopped working when I flew to korea... Turns out... Openrouter requests from different regions will get blocked on cloudflare workers based on the request origin. It was co-locating to HongKong and blocking chatgpt, claude and latest gemini access.
171
So we have the LLM training nerfed version of mythos. Imagine the opposite there is a version of mythos outside there that has been trained on swarms of auto research logs on anthropics own codebase.
56
Consider me Mythos Pilled > A 4D Menger sponge, sliced by a sphere from the fourth dimension
39
So how fast will Anthropic be able to iterate to Mythos 6? Now that they are using mythos RSI to develop models? Can we expect a rapidly increased iteration cycle?
45
>I don't have anything harder other than my ongoing work Start to get impossible to prompt these models. They will have to prompt themselves
this is my personal singularity moment this post may sound like a paid ad. I only wish. I'm concerned, more so than happy. the world is changing, and, among the scenarios where AI goes terribly wrong, inequality is the most realistic, yet, the one Anthropic seems to be the least concerned about. I'm glad OpenAI is taking the opposite stance: *personal AGI for everyone*. I think this is a commendable position in the times we live. but who am I in the queue of the bread? anyway, Fable is here, so I'll just report my first-hour experience first of all, all my pet prompts are solved. → λ-calculus puzzles → bug questions → one-shot apps all are trivial to it. I don't have anything harder other than my ongoing work so, in the last several days, I've been toying with HVM5, a new interaction net evaluator with a faster loop. after writing the first version, I left 32 GPT-5 agents working for ~20 hours each. this resulted in up to 2x speedups, but the file size increased by 2-fold and quality decreased significantly. I then simplified the whole thing into an even simpler core, and left Opus 4.8 and GPT 5.5 optimizing it for 8 hours. Opus got a legit 6% - 34% speedup in most benches. GPT got better results, but, sadly, an unusable file. I then asked Fable to optimize it. 2 hours later, it landed a 1770% speedup in one case, 100% in other 4, and 22% in average. yes, in 2 hours it outperformed me, opus 4.8 and a swarm of gpt 5.5 agents, by one order of magnitude. that could not possibly be legit. "it must be hardcoding the benchmarks" (GPT trauma). so I read its explanation and what it did was, indeed, the most high impact optimization one could try first. seems like HVM5 was wasting a lot of time garbage-collecting unused branches of pattern-match nodes. I had optimized that for static mats, but not for dynamic mats. skill issue. Fable figured how to do it for these, resulting in a massive speedup in some benches but wait, is that *correct*? I'm not sure yet, it is credible, but this is the kind of thing that is very easy to get wrong on interaction nets. the problem is, when I was ready to start auditing Fable's solution so I could tell whether it was buggy or legit, it interrupted me to tell me it had found a massive bug on the code *I* had written. ... wait, what? so... for garbage collection purposes, I stored a bit on lambda term pointers that meant "the variable bound by this lambda has been freed, so, its lambda must free whatever argument it is applied to". that's fine. yet, on duplicator nodes, I also used the same bit to mean "one of the duplicated variables was freed, so, treat this dup as a passthrough no-op". so, if a lambda entered a duplicator, it would mistake the lambda's collection bit for its own, resulting in corrupted interaction! that's a mouthful, why I'm writing this? just so you can appreciate the sheer absurdity of what just happened. I didn't ask it to find bugs. I asked it for an optimization. and even if I did ask it to find bugs, this bug is so astonishingly subtle and specific, identifying it takes mastering the domain to an extent that it beyond even me. I'd easily need hours or days to fix it, *if* I ever came across it. chances are it would just go unnoticed. and Fable found it and fixed it like it was nothing, while it was busy adding a 17x speedup to a file that neither I, nor Opus 4.8, nor a fleet of GPT 5.5 managed to barely make 2x faster. oh and there is also another tab where it is also ripping through Bend's codebase and finishing everything I had to do I don't know what to say anymore this isn't about Anthropic or OpenAI, this is about our collective future as a species. the world is changing, and we need to be aware of it, and discuss how to handle this change. receipt below . . .
107
Nicholas Bardy retweeted
Benchmarking Frontier AI models on Advanced Mathematic and Visual Programming -- A challenging benchmark where the best models still perform poorly
1
1
42
Chain of though for Website Design (Prompt below) [Note you need to give your model access to image generation so it can reason in image space] ``` /goal critique your design, state where it looks generic, derive a new color palette, Use gpt-image-2 to generate useful images for the design flow" Create 3 mock design images of the whole website with gpt-image-2 Pick the best one, crtiqiue it with design language, Generate a series of 3 images with new prompts slowly visually iterated and reflecting on the design until we're very happy with a clean unique design. Generate any new needed place holder images, Then impliment the final design in code. ````
1
73
I can't wait for GPT-5.6 I'm not training a foundation model, but basically all my work looks like it. Can't write GPU shaders... Not useful for physics simulation... Training Image models... Weather models, etc... Machine learnings models are entering every aspect of the economy, there is a revolution going on outside LLM's, and anthropic is stunting it.
BREAKING NEWS: Anthropic's latest model will NOT help you if it thinks your ML research/ML engineering is interesting, and/or will secretly degrade its IQ so that the average engineer won't notice. We are already seeing Anthropic's latest model's moderation filters our GPU inference research and programming 😭
1
120
Fable crushing shader bench
1
77
Does anyone else feel like Fable 5.0 are exactly what I would of expected from Opus 5.0 ? Not that big of a jump?
37
I have a great sense of smell
Introducing Claude Fable 5: a Mythos-class model that we’ve made safe for general use. Its capabilities exceed those of any model we’ve ever made generally available.
43
I can smell the mythos coming
31
>There'll also be little useful infrastructure left behind. they're literally all inference supply constrained.... All they have to do is stop training new models and they'll be profitable tomorrow.
Needed to pull out an important part of the interview. AI is often justified by comparing it to Amazon Web Services' ($57bn) or Uber's ($32bn) losses, when its costs/losses are hundreds of billions of dollars worse. There'll also be little useful infrastructure left behind.
1
111
Basically the only interesting thing about transformers/attention is N^2 crosswise comparison of all the values. Mix that with some generic MATMUL, and you can go a very long way. If you work to remove as much inductive biases as possible it's all wildly simple. --- Do Transformers Need Three Projections? Systematic Study of QKV Variants Ali Kayyam, Anusha Madan Gopal, Anthony Lewis 1
1
118