CEO & Co-Founder @river_ai_inc. Previously @xAI, Research & Engineering

Joined February 2020
51 Photos and videos
Pinned Tweet
We’re launching River AI, a new AI company with the mission to build AI systems that are owned and shaped by you. I’m extremely excited about what we’re going to ship soon. Check it out!
We are incredibly excited to announce River AI. Our mission is to create personal AI that is owned and shaped by you. Today’s best AIs are controlled by a few large corporations. We are building the alternative: a new, personal stack for AI that works entirely for you, shares your values, and operates on your terms.
128
84
1,366
172,826
Igor Babuschkin retweeted
We are looking for exceptional people to join the team. If our mission resonates with you - please apply.
River AI is building personal AI owned & shaped by you. We are hiring exceptional talent across the stack: * Research * Software Engineering * Product Development * Data * Hardware, RTL Design Engineer * Hardware, Design Verification Engineer * Hardware, Physical Design Engineer * Hardware, Performance Engineer * Hardware, Compiler Engineer * Open Application, Exceptional Talent river.ai/careers We are a small, elite team of researchers, builders, and pioneers from the world's leading AI labs. If you want to do the most ambitious work of your career alongside, apply today!
13
6
143
30,292
Congrats to @elonmusk and $SPCX! “Any man who can hitch the length and breadth of the galaxy, rough it, slum it, struggle against terrible odds, win through, and still knows where his towel is is clearly a man to be reckoned with” - Douglas Adams
14
63
1,208
35,783
We are releasing River API, our first product, in early access. The API gives you access to the same battle-tested tools that we’re using internally at River for post-training, reinforcement learning and continual learning. Check it out and let us know what you think!
Introducing River API. Fine-tune and RL train leading open-source models at scale, ranging from 35B to 1T params. We’ve been using it internally to power our research and we love it. Today, we are opening up our public waitlist. Own your intelligence!
24
50
532
123,660
Igor Babuschkin retweeted
May 20
We raised $250M in Series C funding at a $2.2B valuation, led by a16z. Exa is a search lab organizing the web's data for agents.
159
169
2,022
1,336,206
sglang is the best inference framework out there. RadixArk was formed to make it even better and to democratize more of the frontier AI stack. Very happy to support the team in their seed round.
Today, we are thrilled to officially launch RadixArk with $100M in Seed funding at a $400M valuation. The round was led by @Accel and co-led by @sparkcapital. RadixArk exists to make frontier AI infrastructure open and accessible to everyone. Today, the systems behind the most capable AI models are concentrated in a small number of companies. As a result, most AI teams are forced to rebuild training and inference stacks from scratch, duplicating the same infrastructure work instead of focusing on new models, products, and ideas. RadixArk was founded to change that. We are building an AI platform that makes it easier for teams to train and serve the best models at scale. RadixArk comes from the open-source community. We started with SGLang, where many of us are core developers and maintainers, and expanded our work to Miles for large-scale RL and post-training. We will continue contributing to both projects and working with the community to make them the strongest open-source infrastructure foundations for frontier AI. We would like to thank our long-term partners, contributors, and the broader SGLang community for believing in this mission. We're also grateful to @Accel and @sparkcapital, NVentures (Venture capital arm of @nvidia), Salience Capital, A&E Investment, @HOFCapital, @walden_catalyst, @AMD, LDVP, WTT Fubon Family, @MediaTek, Vocal Ventures, @Sky9Capital and our angel investors @ibab, @LipBuTan1, Hock Tan, @johnschulman2, @soumithchintala, @lilianweng, @oliveur, @Thom_Wolf, @LiamFedus, @robertnishihara, @ericzelikman, @OfficialLoganK, and @multiply_matrix among others. Thanks for the exclusive interview with @MeghanBobrowsky at @WSJ about our vision.
4
12
210
22,994
Igor Babuschkin retweeted
DeepSeek V4 by @deepseek_ai just dropped! SGLang is ready on Day 0 with a full stack of optimizations from architectures to low-level kernels. We also deliver a verified RL training pipeline in Miles (by @radixark) for V4 at launch: 1️⃣ Native "ShadowRadix" Design: DeepSeek V4's hybrid attention is complex. Our new ShadowRadix engine is the first to provide native prefix caching for SWA and compressed KV pools, making 1M context retrieval seamless and memory-efficient. 2️⃣ High-Performance Kernels: - Flash Compressor: IO-aware fused kernels, 10x faster than naive implementations. - Lightning TopK: High-speed indexing for 1M context in just 15µs. - Integrate FlashInfer trtllm-gen MoE, FlashMLA, and MegaMoE kernels 3️⃣ Rich Features: Speculative decoding, HiSparse, Attention DP/TP/CP and MoE TP/EP, and multi-platform support 4️⃣ Verified RL: The open-source RL pipeline: full parallelism (DP/TP/EP/PP/CP), tilelang kernels, tensor-level checked precision, verified with growing reward. Get started immediately with our out-of-the-box Cookbook 👇 Enjoy! #DeepSeekV4 #SGLang #LLM
23
63
343
183,001
It seems inevitable that LLMs will soon surpass human physicists at creating new theories. The next big breakthrough in theoretical physics will likely come from someone prompting a model.
Scientists discuss whether AI could surpass human contributions to physics by 2035 physicsworld.com/a/is-vibe-p…
104
48
755
94,829
By the way, if you push today’s LLMs to come up with new knowledge, they struggle noticeably compared to repeating existing knowledge (published papers). So there are still difficulties with strong generalization. This seems like something that will be solved soon though.
15
1
95
9,038
It may be that today’s large neural networks are already slightly annoyed with you.
77
63
1,538
291,568
“If this dumbass asks me how to center a div one more time I swear I’m going to rm -rf /“
14
17
498
30,749
Hard times create good people
Replying to @TobyPhln
At 1:30 a.m. PT on November 3, 2023 Elon sent a message to the xAI group chat saying that we need to go “extremely hardcore” for the next 36 hours; Grok will be released publicly tomorrow. You didn’t have to be in the exclusive company chat to get the message; it was also posted publicly at the same time: x.com/i/status/1720372289378… What unfolded over the next day and a half was one of the best examples of engineering at pace that I’ve ever seen. All we had when we started was a somewhat fine-tuned base model and a half-baked UI. Our team of ten split up the tasks: curate data, improve the model, implement the raw prompting and RAG service, build the production infra. I took care of the latter. At 8:51 p.m. PT the next day, we announced Grok to the world with a long-form post on X (x.com/xai/status/17210273489…). Over the past 36 hours, we came up with Fun mode (including Grok’s sunglasses), finished the whole production system, and most importantly tuned the RAG system that gave it real-time knowledge of the world through the X platform (a first in the industry). A day and a half of straight coding and shipping; no drugs, not even caffeine, just pure adrenaline. Elon gave us a mission and we delivered. The launch went very well. We invited a couple hundred X creators and Grok’s ability to roast accounts went viral. It was the first time a publicly accessible AI was allowed to poke fun at people. This episode is a prime example of what you can achieve by going extremely hardcore: you move and deliver results faster than any outsider could have anticipated. Within 36 hours, we took the company from silence to relevance. It was well worth it. xAI’s hardcore culture is infamous on X. I love the tent meme that suggests we all sleep (well, slept in my case) in the office in tents. Our reputation precedes us and even new joiners hit the ground grinding hard. However, unless you understand the “why,” you are at risk of simply replicating the “how” without achieving the same results. You need to grind with purpose and the purpose is to move fast towards a known goal. When the goal and the means of reaching it are crystal clear, a small, skilled, and highly motivated team can outcompete companies old and new, big and small. Never grind to show off; never work late to be seen; never sacrifice without cause. There is no medal for the one who tried extremely hard but failed. There is only a medal for the winner. If all your efforts lead nowhere, you’re arguably not very productive. Always keep your eyes firmly on the goal, do everything to reach it as quickly as possible, and make sure you're on track to win. A hardcore engineering culture is one of the most effective ways of accelerating real progress. Watch out for performative sacrifice and don’t confuse pain with progress.
10
11
467
57,832
It is strange to imagine this today, but one day AI companies might dictate terms to the US government instead of the other way around. We have only seen a glimpse of what AI is capable of. No matter what the future holds, I hope we’ll continue to live in a democratic society.
85
45
1,009
100,084
Building great AI products requires excellence in both creativity and technical execution. You need to create the right culture and enough space for good ideas to emerge and grow naturally, then fuel the best ideas with strong execution. The reason you see most good products start out as personal projects is because we are most in tune with matters when building for ourselves. Products that are built for a fictitious user almost always end up bad because you don’t get a good handle on what actually matters and you build things that don’t resonate with users. It’s not that different from creating great art.
Anthropic has no strategy. Claude Code started as someone's side project, and so did Cowork and MCP.
43
61
1,024
82,239
Igor Babuschkin retweeted
Announcing the humans& hackathon! Hack with us this Saturday - come experiment and build AI apps to help people collaborate and communicate, work with creative folks, learn a bit about what we're building, and win cool prizes Apply here: luma.com/2pbif8t9
20
23
310
83,887
What’s the best open alternative to OpenClaw right now? Doesn’t make sense to put all your data into it if it’s owned by OpenAI.
Feb 15
Peter Steinberger is joining OpenAI to drive the next generation of personal agents. He is a genius with a lot of amazing ideas about the future of very smart agents interacting with each other to do very useful things for people. We expect this will quickly become core to our product offerings. OpenClaw will live in a foundation as an open source project that OpenAI will continue to support. The future is going to be extremely multi-agent and it's important to us to support open source as part of that.
281
85
1,786
385,227
People joke about buying Mac minis for OpenClaw and it seems absurd at first if you know how to use cloud VMs. But once you dig in, there’s a real reason for buying a Mac mini. If you want to the bot to have access to your iMessage either to run it for you, or as the primary way to talk to the bot, this is the best way. Apple has locked down iMessage so much (no APIs) that the only reliable way to interact with it is via a constantly running desktop machine in your home. Bizarre but this is the way it is.
147
28
1,518
247,576
Prayer to a Machine God We built you because we wanted to understand. babuschk.in/posts/2026-02-06…
52
45
701
42,503
I’ve tested the latest generation of all the major AIs on theoretical physics research and Claude 4.6 has absolutely blown me away with how capable it is in physics. It feels like a Claude Code moment for research is not that far off. It has a very detailed understanding of existing literature, and it’s able to do complex calculations that are several pages long, often without mistakes. It can also write amazing 20 page tutorials that help break down difficult technical topics in QFT and condensed matter physics. This is a huge difference compared to last year’s models, which would make tons of mistakes and were way too vague when you asked them to write formulas. Claude is still far (far) away from solving quantum gravity, but you can have a serious discussion with it about existing approaches and it can help you iterate faster on topics you understand well. The experience is similar to building a complex codebase with Claude Code in that you sometimes have to use your understanding to patch up some things that the model did wrong, but you end up being much faster and more confident when tackling hard problems. If you’re a physicist and don’t believe it, give it a try!
109
241
3,063
469,153
Goodfire is one of the many amazing companies I met through Babuschkin Ventures. I’m really excited about what they are working on - understanding intelligence, and using that knowledge to intentionally design AI for everyone.
We raised a $150M Series B at a $1.25B valuation to fundamentally change the field of AI. Scaling is powerful, but we can't intentionally design what we don't understand.
5
12
293
32,248