Joined November 2012
91 Photos and videos
Pinned Tweet
This is a *way* bigger deal than it seems... Frontier AI companies will *never* own the frontier again I kid you not... I've been waiting for someone to show this result for like 4 years... this is a huge deal. The short reason: combinations of models will *always* outperform individual models The long reason: this is the gateway to a million times more data... and huge leaps in compute efficiency. The AI scaling laws always win. More in article below 👇
Introducing the Fusion API, the smartest compound model in the market. Fusion achieves Fable-level intelligence at half the price. How it works 👇
88
97
1,439
277,690
This is a *way* bigger deal than it seems... Frontier AI companies will *never* own the frontier again I kid you not... I've been waiting for someone to show this result for like 4 years... this is a huge deal. The short reason: combinations of models will *always* outperform individual models The long reason: this is the gateway to a million times more data... and huge leaps in compute efficiency. The AI scaling laws always win. More in article below 👇
Introducing the Fusion API, the smartest compound model in the market. Fusion achieves Fable-level intelligence at half the price. How it works 👇
88
97
1,439
277,690
BREAKING: American in-fighting vs Chinese focus
Kimi 2.7 ranked 2nd after Fable 5 and before GPT-5 xhigh We have re-run our ErdosBench smoke test on 14 problems with Kimi 2.7, Qwen 3.7 Max, Grok 4.3 and compared it with the top performers from previous runs. Kimi 2.7 is amazingly good. More below.
3
1
27
7,097
Seriously tho. lol. America seems *quite* confident in their supremacy… surprising since the *literally* free... option is ~6 months behind. And sponsored by the 2nd largest economy in the world
5
728
⿻ Andrew Trask retweeted
348
2,388
21,663
1,375,082
⿻ Andrew Trask retweeted
FYI: all closed-source AI products degrade performance based on the prompt w/o telling the user. There are layers of "filters" on the output. Most obvious is a "recitation filter"... checks every LLM output to prevent an AI product from accidentally outputting exact copies of information from its training data. The problem: exact-quoting high quality sources is often the most accurate response to give... but it can be a liability for closed-source products. So the filter catches it... and a less-exact output is produced instead. Open source models don't do this. This is also why attribution-based control is such a crucial feature to long-term trustworthy AI. Attribution (and corresponding credit/payment layers) would allow frontier AI systems to more reliably exact-quote sources. Important research area.
NEW: Anthropic is walking back Claude Fable 5's policy to covertly degrade performance for competing AI researchers, after facing fierce backlash. “We’re changing Fable 5’s safeguards for frontier LLM development to make them visible,” Anthropic tells WIRED. “We made the wrong tradeoff and we apologize for not getting the balance right.”
8
5
54
53,030
FYI: all closed-source AI products degrade performance based on the prompt w/o telling the user. There are layers of "filters" on the output. Most obvious is a "recitation filter"... checks every LLM output to prevent an AI product from accidentally outputting exact copies of information from its training data. The problem: exact-quoting high quality sources is often the most accurate response to give... but it can be a liability for closed-source products. So the filter catches it... and a less-exact output is produced instead. Open source models don't do this. This is also why attribution-based control is such a crucial feature to long-term trustworthy AI. Attribution (and corresponding credit/payment layers) would allow frontier AI systems to more reliably exact-quote sources. Important research area.
NEW: Anthropic is walking back Claude Fable 5's policy to covertly degrade performance for competing AI researchers, after facing fierce backlash. “We’re changing Fable 5’s safeguards for frontier LLM development to make them visible,” Anthropic tells WIRED. “We made the wrong tradeoff and we apologize for not getting the balance right.”
8
5
54
53,030
- OpenAI Filters: arxiv.org/html/2509.13608v1

1
423
Fwiw - I understand that this is the concensus view, but I think history will look back with surprise that it didn't bear out in the end. In the 1960s, an employee at IBM or Bell Labs would have said the same thing about the Mainframe computer... and they were incredible (and many are still in use today). But it wasn't just "bigger mainframes forever" anymore than the library was just "bigger library of alexandria forever". I say that as someone who joined DeepMind nearly 10 years ago doing language modeling research. I have had access to large scale and small scale compute during that time. I personally think there's an enormous amount of low-hanging fruit which doesn't require. The future is networks of neural networks: - better routers - better benchmarks - better access to non-public / niche information - better pricing mechanisms - better source attribution - better unlearning - ... There's so much great research to be done. And much of it remains low-hanging because there are some subtle reasons why highly resourced orgs don't tend to pursue them.
If you want to work on pretraining-for-AGI, join OpenAI, Google, Meta or the Anthropic/XAI/Cursor supergroup. The bitter truth of the widening compute gap is that all the problems which are actually on the critical path to AGI now demand that level of compute.
10
11
150
69,774
One final thing - I think the biggest barrier to breakthrough research is allowing yourself to subscribe to industry groupthink (or to the polar opposite of that groupthink). Go in a 3rd direction. Follow the scaling laws. Look for bridges across fields (especially deep learning, cryptography, and distributed systems). It's never been a better time to do research.
1
8
993
If none of this makes sense and you just need to read "a completely different way to think about AI progress" and a bunch of launch points for research in AI, a few links: - attribution-based-control.ai… - github.com/iamtrask/abcGPT - openmined.org/blog/what-is-b… - x.com/stanislavfort/status/1… - openmined.org/blog/secure-en…

I found the Git Re-Basin paper (arxiv.org/abs/2209.04836) by @SamuelAinsworth, J. Hayase & @siddhss5 *really* intriguing. So I made a replication in Colab reusing bits of their code but unfortunately couldn't reproduce the key conclusion 🚨😱 🖥️Colab github.com/stanislavfort/dis… 1/5
9
1,257