⿻ Andrew Trask

⿻ Andrew Trask

91 Photos and videos

Tweets

Pinned Tweet

⿻ Andrew Trask

@iamtrask

This is a *way* bigger deal than it seems... Frontier AI companies will *never* own the frontier again I kid you not... I've been waiting for someone to show this result for like 4 years... this is a huge deal. The short reason: combinations of models will *always* outperform individual models The long reason: this is the gateway to a million times more data... and huge leaps in compute efficiency. The AI scaling laws always win. More in article below 👇

OpenRouter

@OpenRouter

16h

Introducing the Fusion API, the smartest compound model in the market. Fusion achieves Fable-level intelligence at half the price. How it works 👇

1,439

277,690

⿻ Andrew Trask

⿻ Andrew Trask

@iamtrask

OpenRouter

@OpenRouter

16h

Introducing the Fusion API, the smartest compound model in the market. Fusion achieves Fable-level intelligence at half the price. How it works 👇

1,439

277,690

⿻ Andrew Trask

⿻ Andrew Trask

@iamtrask

andrewtrask.substack.com/p/b…

6,387

⿻ Andrew Trask

⿻ Andrew Trask

@iamtrask

12h

BREAKING: American in-fighting vs Chinese focus

Przemek Chojecki | PC

@prz_chojecki

Jun 13

Kimi 2.7 ranked 2nd after Fable 5 and before GPT-5 xhigh We have re-run our ErdosBench smoke test on 14 problems with Kimi 2.7, Qwen 3.7 Max, Grok 4.3 and compared it with the top performers from previous runs. Kimi 2.7 is amazingly good. More below.

7,097

⿻ Andrew Trask

⿻ Andrew Trask

@iamtrask

12h

Seriously tho. lol. America seems *quite* confident in their supremacy… surprising since the *literally* free... option is ~6 months behind. And sponsored by the 2nd largest economy in the world

728

jack

⿻ Andrew Trask retweeted

jack

@jackbutcher

Jun 13

348

2,388

21,663

1,375,082

⿻ Andrew Trask

⿻ Andrew Trask retweeted

⿻ Andrew Trask

@iamtrask

Jun 11

FYI: all closed-source AI products degrade performance based on the prompt w/o telling the user. There are layers of "filters" on the output. Most obvious is a "recitation filter"... checks every LLM output to prevent an AI product from accidentally outputting exact copies of information from its training data. The problem: exact-quoting high quality sources is often the most accurate response to give... but it can be a liability for closed-source products. So the filter catches it... and a less-exact output is produced instead. Open source models don't do this. This is also why attribution-based control is such a crucial feature to long-term trustworthy AI. Attribution (and corresponding credit/payment layers) would allow frontier AI systems to more reliably exact-quote sources. Important research area.

Max Zeff

@ZeffMax

Jun 11

NEW: Anthropic is walking back Claude Fable 5's policy to covertly degrade performance for competing AI researchers, after facing fierce backlash. “We’re changing Fable 5’s safeguards for frontier LLM development to make them visible,” Anthropic tells WIRED. “We made the wrong tradeoff and we apologize for not getting the balance right.”

53,030

⿻ Andrew Trask

⿻ Andrew Trask

@iamtrask

Jun 11

Max Zeff

@ZeffMax

Jun 11

53,030

more replies

⿻ Andrew Trask

⿻ Andrew Trask

@iamtrask

Jun 11

- Anthropic on Filters: support.claude.com/en/articl… - Microsoft on Filters: learn.microsoft.com/en-us/az…

Our Approach to User Safety | Claude Help Center

support.claude.com

488

⿻ Andrew Trask

⿻ Andrew Trask

@iamtrask

Jun 11

- OpenAI Filters: arxiv.org/html/2509.13608v1

423

⿻ Andrew Trask

⿻ Andrew Trask

@iamtrask

Jun 10

Fwiw - I understand that this is the concensus view, but I think history will look back with surprise that it didn't bear out in the end. In the 1960s, an employee at IBM or Bell Labs would have said the same thing about the Mainframe computer... and they were incredible (and many are still in use today). But it wasn't just "bigger mainframes forever" anymore than the library was just "bigger library of alexandria forever". I say that as someone who joined DeepMind nearly 10 years ago doing language modeling research. I have had access to large scale and small scale compute during that time. I personally think there's an enormous amount of low-hanging fruit which doesn't require. The future is networks of neural networks: - better routers - better benchmarks - better access to non-public / niche information - better pricing mechanisms - better source attribution - better unlearning - ... There's so much great research to be done. And much of it remains low-hanging because there are some subtle reasons why highly resourced orgs don't tend to pursue them.

Aidan Clark @_aidan_clark_

May 24

If you want to work on pretraining-for-AGI, join OpenAI, Google, Meta or the Anthropic/XAI/Cursor supergroup. The bitter truth of the widening compute gap is that all the problems which are actually on the critical path to AGI now demand that level of compute.

150

69,774

more replies

⿻ Andrew Trask

⿻ Andrew Trask

@iamtrask

Jun 10

One final thing - I think the biggest barrier to breakthrough research is allowing yourself to subscribe to industry groupthink (or to the polar opposite of that groupthink). Go in a 3rd direction. Follow the scaling laws. Look for bridges across fields (especially deep learning, cryptography, and distributed systems). It's never been a better time to do research.

993

⿻ Andrew Trask

⿻ Andrew Trask

@iamtrask

Jun 10

If none of this makes sense and you just need to read "a completely different way to think about AI progress" and a bunch of launch points for research in AI, a few links: - attribution-based-control.ai… - github.com/iamtrask/abcGPT - openmined.org/blog/what-is-b… - x.com/stanislavfort/status/1… - openmined.org/blog/secure-en…

Stanislav Fort

@stanislavfort

16 Sep 2022

I found the Git Re-Basin paper (arxiv.org/abs/2209.04836) by @SamuelAinsworth, J. Hayase & @siddhss5 *really* intriguing. So I made a replication in Colab reusing bits of their code but unfortunately couldn't reproduce the key conclusion 🚨😱 🖥️Colab github.com/stanislavfort/dis… 1/5

1,257