Full Stack Computational Linguist ※ Mozilla OpenNews Fellow ※ Virtual Production ※ Filmmaker ※ AI accelerationist

Joined April 2007
8,244 Photos and videos
Pinned Tweet
I successfuly optimised a context compression prompt with @DSPyOSS GEPA and TextGrad, see github.com/Laurian/context-c…
11
24
171
25,413
Laurian Gridinoc retweeted
my personal litmus test is asking ML engineers whether this is hardware or software
560
64
2,799
229,904
Laurian Gridinoc retweeted
Excited to launch Luce KVFlash. We've been working harder than ever with @davideciffa to bring better DX for local AI. Today, long context has a second memory bill nobody budgets for: the KV cache. On Qwen3.6-27B at 256K it costs 4.6 GiB of VRAM and drags decode down to 13 tok/s, because every new token reads the whole thing. KVFlash keeps a small pool of KV on the GPU, auto-sized to your VRAM, and pages cold 64-token chunks to host RAM, bit-exact and recallable. decode holds a flat 38.6 tok/s from 64K to the native 256K on a 3090, 2.9x the full cache at 256K, 72 MiB resident and benchmark accuracy unchanged.
11
24
161
22,238
Laurian Gridinoc retweeted
most of you don't know how big a deal it is that a single rtx 3090 from 2020 runs qwen 27b dense q4 with 256k context at 40 tok/s, full agentic loops on hermes agent, zero tool call failures. the more i build on this card the more i think nobody really knows how untapped it actually is. the silicon was always capable, the models finally caught up.
46
37
586
277,569
Laurian Gridinoc retweeted
Had Hermes Agent with the Manim Video skill plus it's TTS tool create a video explaining Hermes' Agent.
62
105
1,420
154,300
Laurian Gridinoc retweeted
Jun 14
We are finally getting our fable back. I built a repo that runs two opus 4.8 on the same question in parallel, blind to each other base on the OpenRouter Fusion. Then a third opus reads both and writes the final answer from where they agree, where they split, and what they both missed. One run can be confidently wrong. Two, cross-examined, can't hide it. Put the real fable system prompt on top of the claude.md and the judgment comes back. It sounds like fable again because it's doing what fable did. Link's in the comments.
Introducing the Fusion API, the smartest compound model in the market. Fusion achieves Fable-level intelligence at half the price. How it works 👇
74
81
1,194
201,929
Laurian Gridinoc retweeted
It's now possible to compile Python extensions (C, C , Rust etc) to WebAssembly and distribute them through PyPI such that Pyodide can install them directly simonwillison.net/2026/Jun/1…
25
36
342
30,591
Introducing the Fusion API, the smartest compound model in the market. Fusion achieves Fable-level intelligence at half the price. How it works 👇
1
54
Laurian Gridinoc retweeted
yes, eventually Fable was banned. but for a beautiful moment in time, we could one shot our whole backlog
26
90
1,594
35,712
Laurian Gridinoc retweeted
You are Algernon. That's the fable.
4
4
20
1,840
Laurian Gridinoc retweeted
Introducing the Fusion API, the smartest compound model in the market. Fusion achieves Fable-level intelligence at half the price. How it works 👇
636
1,630
13,843
5,282,572
Are we collecting our Fable traces somewhere to distill from?
you can distill from hf fable traces btw
2
231
Laurian Gridinoc retweeted
Replying to @tszzl
6
107
3,920
Laurian Gridinoc retweeted
Jun 13
it’s starting to feel like end of evangelion again
117
87
1,754
87,592
Laurian Gridinoc retweeted
355
2,428
22,388
1,517,804
Laurian Gridinoc retweeted
Now that I’ve tasted Fable 5, it’ll be hard to go back
The US government, citing national security authorities, has issued an export control directive to suspend all access to Fable 5 and Mythos 5 by any foreign national, whether inside or outside the United States, including foreign national Anthropic employees. The net effect of this order is that we must abruptly disable Fable 5 and Mythos 5 for all our customers to ensure compliance. Access to all other Claude models is not affected. We apologize for this disruption to our customers. We believe this is a misunderstanding and are working to restore access as soon as possible. Read our full statement: anthropic.com/news/fable-myt…
243
62
1,622
174,208
(ノಠ益ಠ)ノ彡┻━┻
As a result of a US government directive, we are suspending access to Claude Fable 5 for all users. You can continue to use all other Claude models. Here’s what this means for you: Across Claude products, new sessions will run on your selected default model or Opus 4.8, and existing Fable 5 sessions will end with an error. On the Claude Platform, requests to Fable 5 will also return an error. Please update your integrations to other Claude models. We know this is a disruption to your workflows; we appreciate your patience and support.
34
Laurian Gridinoc retweeted
Apple TimeBand prototype (1990) From “Apple Design The Work of Apple Industrial Design Group” (1997)
6
39
274
8,546
Laurian Gridinoc retweeted
AI Engineers who look at the data
6
4
48
17,498
Laurian Gridinoc retweeted
the jokes write themselves
3
19
205
15,017
Laurian Gridinoc retweeted
We've completely lost the tactile thrill of chunky, high-end remotes.
32
126
1,404
35,296