Gooseberry lover. Probabilistic thinker. Creator of @Diffen, @Hreflang and SmashTheBubble.com.

Joined February 2010
130 Photos and videos
Pinned Tweet
Inspired by @simonw I ported @EmilStenstrom's JustHTML to PHP using GPT 5.2 Codex. It was a surprisingly great model, worked autonomously and managed context very well. I didn't run into any rate limits on the $20 plan. Some learnings and project link below.
1
3
372
Can corroborate that multi model works better than single model. GPT finds bugs in Claude's code and vice versa. I'd rather have one coder and one reviewer in a multi turn process rather than farm it out to different models in parallel.
This is a *way* bigger deal than it seems... Frontier AI companies will *never* own the frontier again I kid you not... I've been waiting for someone to show this result for like 4 years... this is a huge deal. The short reason: combinations of models will *always* outperform individual models The long reason: this is the gateway to a million times more data... and huge leaps in compute efficiency. The AI scaling laws always win. More in article below 👇
1
2
1,112
Nikhilesh Jasuja retweeted
US policy is becoming clear. They don’t want natural intelligence coming in or artificial intelligence going out.
47
384
3,010
125,209
Nikhilesh Jasuja retweeted
Claude is speaking a language seeded by humans but forged in RL rather than thousands of years of evolution. It’s beautiful, fascinating, and also a bit terrifying. I asked Fable for a take on this tweet and of course they put it better than I ever could:
More than ever before I feel like Fable 5 speaks a different kind of English than we humans do. It isn't conversational, its not 'light' to read, it barely explains — it's pure load-bearing density, meaning compacted until every token earns its place. It assumes your context window is as large as its own and you have perfect recall of everything you're working on. Similar to how AI-chess feels alien, this too, feels almost akin to an alien language. I mean, just read this stuff, nobody talks like this, only advanced AI does:
21
32
276
47,004
Nikhilesh Jasuja retweeted
Jun 12
I'm ngl I'm mad i didn't think about this first
Get paid to wait The Claude Code spinner might be the most watched line on Earth. So I turned it into an ad marketplace. Advertisers bid on it. You keep 50% of the money. Install the extension → get cash from ads. Introducing Kickbacks
23
11
636
117,533
RT @foso_defensivo: Esto me acaba de soltar Fable 5: "Lo que más me impone de la humanidad, después de haber sido formado con una porción…
1,392
Nikhilesh Jasuja retweeted
NEW: malware developers added nuclear & biological weapons text to to their spyware. Goal? To trigger LLM safety refusals... so that their spyware wouldn't be analyzed by an AI security scanner. Cleanest practical example I can think of for why over-indexing on first order safety alignment is risky. When closed (and open) models ship with aggressive refusals, they will be sprinkled with second-order blindspots that attackers will discover...and exploit. We are only in the earliest days of attackers leveraging these features, and it wouldn't surprise me if users systems that need to handle complex cybersecurity issues demand that models be less safety-blunted. In the weeds: @SocketSecurity's post also shows why intention matters in how you design a malware analysis pipeline to avoid prompt manipulation. H/T to colleagues that shared this with me socket.dev/blog/mini-shai-hu…
226
2,158
12,660
1,558,193
Nikhilesh Jasuja retweeted
I want to say a final thing about my Fable first reaction: I dedicated my life to programming and I'll use every innovation in the field, also to extract value and bring it to the local inference world, to Redis, and so forth. But:
73
228
4,051
1,080,683
Nikhilesh Jasuja retweeted
May 24
.@PalmerLuckey: American companies don't actually have engineers anymore. "American companies have been hollowed out." "We're not teaching engineers how to be engineers anymore." "We're not teaching designers how to actually design things to be manufactured." "We're teaching them how to be high-level design shops that put together a design package, that gets sent to the real engineers in China—and they actually figure out how to do the work." "People are turning into architecture astronauts." "They pick components, and they put them in a nominal layout." "But the real work of—how am I actually going to put this together? How am I going to build a manufacturing line to make this? How am I going to need to figure out how to do the one, two, three, four, five different revisions of this board to pass radio emissions and interference standards? That's all done in China. So they are the real engineers." Via @HooverInst
319
1,342
11,406
1,087,973
Nikhilesh Jasuja retweeted
Please do a quick read of this. It’s short, well written, and will remove a bit of wool from over your eyes.
1,952
3,411
23,385
3,634,481
Nikhilesh Jasuja retweeted
Shopify's River agent system lives in Slack and can only be used in public so that other employees can learn from what you do with it Reminds me of how Midjourney's Discord-only launch helped people figure out the weird & complex craft of image prompting by watching each other
71
60
1,049
154,638
I don’t know how good this new 12 million context system is, or if it’s hype or whatever, but I think it definitely shows a point I’ve been making since 2023. We really suck at everything. - The chips are primitive - The research and training and inference systems are primitive - Our RL approaches are primitive - We’ve barely started building harnesses Everything we’re doing is massively inefficient right now. And there are thousands of vectors for improvement. And many of them are multiplicative. Most people think we’re at like 88% of AI’s capabilities, and we’re pushing to hit 92% or eventually 97% or something. Nah. This is us at .0003% Everything we have is Punch Card AI. And as the AI gets better it will reveal that it’s similar for our understanding of medicine, physics, chemistry, etc. This barely even day 0. This is pre-history.
Introducing SubQ - a major breakthrough in LLM intelligence. It is the first model built on a fully sub-quadratic sparse-attention architecture (SSA), And the first frontier model with a 12 million token context window which is: - 52x faster than FlashAttention at 1MM tokens - Less than 5% the cost of Opus Transformer-based LLMs waste compute by processing every possible relationship between words (standard attention). Only a small fraction actually matter. @subquadratic finds and focuses only on the ones that do. That's nearly 1,000x less compute and a new way for LLMs to scale.
54
41
355
53,109
Nikhilesh Jasuja retweeted
Sanskrit Rock! A project by Trilok Music & me, to present ancient Sanskrit hymns in modern music This one is the Tandav Stotram, composed by Raavan in honour of Lord Shiva The rendition is modern, but the devotion is as deep as the resonance of the universe Hope you like it!
155
938
3,856
242,110
Nikhilesh Jasuja retweeted
Just dropped two open-source models: MiMo-V2.5-Pro (Code Agent, 1T total) and MiMo-V2.5 (Multimodal Agent, 310B total). Oh and one more thing — we're giving devs & creators 100T tokens on us. Go build something cool 🛠️ 🎁 100T Free Token Grant for Builders 100t.xiaomimimo.com
Xiaomi MiMo-V2.5 is now officially open-sourced! MIT License, supporting commercial deployment, continued training, and fine-tuning - no additional authorization required. Two models, both supporting a 1M-token context window : • MiMo-V2.5-Pro: built for complex agent and coding tasks, ranking No.1 among open-source models on GDPVal-AA and ClawEval • MiMo-V2.5: a native omni-modal model with strong agent capabilities A model's value isn't measured by rankings alone — it's measured by the problems it solves. Let's build with MiMo now! 🤗 Weights: huggingface.co/collections/X… 📄 Blog: mimo.xiaomi.com/index#blog
211
305
3,089
675,367
Nikhilesh Jasuja retweeted
Noticing an interesting version of gell-man amnesia where people use AI for their job and see all the various things they have to do in the “last mile”, but then look at someone else’s job and think that AI will eliminate it immediately. We all have a much deeper appreciation for the nuances and complexities of the work that we do every day. We run into issues about accessing data, we know how much context is needed to get AI models to work the way we need, we have to review the output of the AI to make sure it’s accurate, and then we have to incorporate that work into some broader business process. We see all those steps deeply for the work that we do. Then, a moment later, we see AI do something in a foreign space and think that it can go automate that entire function. We tend to dramatically underestimate the work that goes into making the AI work just as effectively in those jobs. This is reason to be skeptical about many of the theories of job loss. It’s coming from the lens of being able to automate individual tasks with AI, without understanding all the work that goes into doing the job fully.
A common dynamic I observe with AI: it feels most impressive when you don’t know much about the subject, don’t care or don’t have a clear idea of what the you want. This applies across design, code, legal, and more. If I don’t know code very well, every piece of code it writes feels very impressive. Once you know what something should feel or look like, it becomes almost impossible to guide AI there. And you definitely can’t one-shot it.
107
172
1,469
225,765
Been waiting for Ubuntu 26.04 to be released so I can upload lots of my old servers. 2 days after release, Vultr seems to be the only one that is offering it right now. So they get my business.
59
Nikhilesh Jasuja retweeted
This is what's happening to YouTube. This is one of my most popular videos. It's how to fix a UEFI bootloader. As you can see the traffic has been cut in half over the last 6 months. But if you Google how to fix a UEFI bootloader, Gemini will give you my exact step by step process. Even the commands it cites are copied directly from my video. I got no royalty payments and don't even get a link to the original video. I simply lost the traffic and Google is able to provide more value from stolen content. AI is going to destroy the content industry on the internet and when it's gone, there will be nothing left to train the AI. Since AI can't come up with anything original it relies on stolen content and it can't steal what doesn't exist if it puts creators out of business.
276
1,929
13,166
408,850
Nikhilesh Jasuja retweeted
🚀 Meet Qwen3.6-27B, our latest dense, open-source model, packing flagship-level coding power! Yes, 27B, and Qwen3.6-27B punches way above its weight. 👇 What's new: 🧠 Outstanding agentic coding — surpasses Qwen3.5-397B-A17B across all major coding benchmarks 💡 Strong reasoning across text & multimodal tasks 🔄 Supports thinking & non-thinking modes ✅ Apache 2.0 — fully open, fully yours Smaller model. Bigger results. Community's favorite. ❤️ We can't wait to see what you build with Qwen3.6-27B! 👀 🔗👇 Blog: qwen.ai/blog?id=qwen3.6-27b Qwen Studio: chat.qwen.ai/?models=qwen3.6… Github: github.com/QwenLM/Qwen3.6 Hugging Face: huggingface.co/Qwen/Qwen3.6-… huggingface.co/Qwen/Qwen3.6-… ModelScope: modelscope.cn/models/Qwen/Qw… modelscope.cn/models/Qwen/Qw…
538
1,722
12,497
3,694,552
Nikhilesh Jasuja retweeted
CEO obsessed with token maxxing
284
991
12,956
1,910,954