Research & Engineering @togethercompute; Ex @YandexResearch; Building Products; Sapere aude!

Joined June 2020
15 Photos and videos
Pinned Tweet
New research from @couplefire12 and me on training LLMs to reason from expert demonstrations — with no verifiers and no preference labels. We do a GAN-like training via Inverse Reinforcement Learning. Promising results. Take a look!
11 Dec 2025
RL for reasoning often rely on verifiers — great for math, but tricky for creative writing or open-ended research. Meet RARO: a new paradigm that teaches LLMs to reason via adversarial games instead of verification. No verifiers. No environments. Just demonstrations. 🧵👇
10
764
Ivan Provilkov retweeted
Mar 25
🇪🇺eu/acc After 2 years in existence, the first 4 points of the @euacc manifesto, which was crowdsourced by all of you, are now passed as laws That means 1/3rd of eu/acc's points is complete: ✅ 1. Reduce regulatory burden for startups ✅ 2. Make skilled immigration easier, unskilled harder ✅ 3. Repeal the cookie law ✅ 4. European Inc: a single pan-EU business entity Now for the next 8 objectives: 🔲 6. Tax discount during startup phase 🔲 7. Tax stock options when sold, not when exercised 🔲 8. Embrace AI and technology, don't fight it 🔲 9. Champion free speech, don't censor it 🔲 10. Reform bankruptcy laws to empower entrepreneurs 🔲 11. Make English the primary language of the European Union 🔲 12. Teach AI and tech in European schools and universities
34
49
470
39,169
Ivan Provilkov retweeted
Together Fine-tuning now supports tool calling, reasoning, and vision-language model fine-tuning. Train models up to 1T parameters with up to 6x higher throughput on MoE architectures.
2
6
14
2,520
A really nice deep dive showing how fine-tuning using Together AI can produce a model that outperforms GPT-5.2 on a given task, while also being 10× cheaper and 15× faster.
How to fine-tune OS LLM judges to outperform GPT-5.2! 🔥 We trained GPT-OSS 120B on 5,400 preference pairs to beat GPT-5.2's accuracy > superior performance > 15x lower cost > 14x faster speeds Code deepdive below👇
4
7
473
Ivan Provilkov retweeted
2/ Together Evaluations is a unified framework for assessing LLM quality. Team can: • compare open models to OpenAI/Anthropic/Google • make better decisions on prompting vs. fine-tuning • track quality improvements over time read more: together.ai/blog/together-ev…
1
1
4
1,320
A cool project for video generation and editing from my friend
Introducing vargai/sdk - JSX for AI Video. Declarative programming language for Claude Code. AI Agent writes JSX, you get videos ✦ 🧵
1
52
Ivan Provilkov retweeted
I used to think Sapiens was a great book. Sweeping, provocative, the kind of book that makes you feel like you finally understand the big picture of human history. It's on every CEO's bookshelf, assigned in universities, praised as a masterwork of synthesis. Yuval Noah Harari is treated as one of the serious thinkers of our time. But something nagged at me. Some passages felt off. Claims that human rights are just figments of our collective imagination, not real things, just stories we tell ourselves. That nations, laws, money, justice, doesn't exist outside our heads. That meaning itself is a delusion we've invented to cope. That we're far more powerful than ever before but not happier. That hunter-gatherers had it better because they had no dishes to wash, no carpets to vacuum, no nappies to change, no bills to pay. That sounded depressing to me, but was perhaps just the realistic scientific worldview? What it meant to see the world clearly, without comforting illusions. Then I read The Beginning of Infinity by @DavidDeutschOxf. Deutsch has a concept he calls 'bad philosophy.' Not philosophy that's merely false, but philosophy that actively prevents the growth of knowledge. Ideas that close doors rather than open them. That makes problems seem unsolvable by design. After soaking in Deutsch's framework (it's dense, a bit like digesting a delicious whale), it becomes clear: Harari's books are riddled with bad philosophy. They're smuggling nihilism in under the guise of scientific objectivity. Some examples: On meaning: "Human life has absolutely no meaning. Humans are the outcome of blind evolutionary processes that operate without goal or purpose... any meaning that people inscribe to their lives is just a delusion." On human rights: "There are no gods in the universe, no nations, no money, no human rights, no laws, and no justice outside the common imagination of human beings." On free will: "Humans are now hackable animals. The idea that humans have this soul or spirit and they have free will, that's over." On progress: "We thought we were saving time; instead we revved up the treadmill of life to ten times its former speed." The Agricultural Revolution? "History's biggest fraud." We didn't domesticate wheat, "it domesticated us." On our cosmic significance: "If planet Earth were to blow up tomorrow morning, the universe would probably keep going about its business as usual. Human subjectivity would not be missed." On the future: "Those who fail in the struggle against irrelevance would constitute a new 'useless class.'" Homo sapiens will likely "disappear in a century or two." This is bad philosophy. It tells us our problems are cosmically insignificant, our solutions are illusions, and that progress is neither desirable nor within our control. It's also perfect nonsense. No one would ever go back to being hunter-gatherers. Would you rather worry about your kid spending too much time on Roblox, or face the 50% chance she won't reach puberty? And our so-called "fictions"? They ended slavery. They gave women equal rights. They solved hunger. They eradicated smallpox. They turned sand into computer chips. They got us to the moon, and hopefully soon, to Mars and beyond. These "fictions" are already reshaping the universe, and over time they may become the most potent force in it. Now compare Deutsch: "Humans, people and knowledge are not only objectively significant: they are by far the most significant phenomena in nature." "Feeling insignificant because the universe is large has exactly the same logic as feeling inadequate for not being a cow." "Problems are soluble, and each particular evil is a problem that can be solved." "We are only just scratching the surface, and shall never be doing anything else. If unlimited progress really is going to happen, not only are we now at almost the very beginning of it, we always shall be." Where Harari sees a species of deluded apes stumbling toward obsolescence, Deutsch sees universal explainers, the only entities we know of capable of creating explanatory knowledge, solving problems, and potentially seeding the universe with intelligence. The difference isn't academic. Ideas shape action. If you believe life is meaningless, progress is a trap, and humans are hackable animals with no free will, how does that affect what you build? What you fight for? What you teach your children? Harari's books sell because they flatter a fashionable pessimism. They let readers feel sophisticated for seeing through the "delusions" everyone else lives by. That smug cynicism is corrosive. And it's everywhere: in schools, in media, in bestselling books. More than half of young adults now say they feel little to no purpose or meaning in life. This is what happens when you teach an entire generation bad philosophy. Less progress, less health, less wealth. Less flourishing. And ultimately, a higher chance that civilization and consciousness go extinct. Fortunately, there's another equally well-written, but much truer, account of homo sapiens, appropriately titled 'The Beginning of Infinity'. And this one smuggles no despair in by the backdoor. But let's give Harari credit where it's due. He is right about one thing: if planet Earth blew up tomorrow, we wouldn't be missed. Because there'd be no one left to miss us, just a careless universe, blindly obeying physical laws. We are the only ones who can miss, but we're not going to. We're going to aim, hit, and keep going. Full credit for the amazing meme to @Ben__Jeff
862
1,479
9,160
901,882
Ivan Provilkov retweeted
No verifiers? No problem. 🤝 The Together Research team is excited to introduce RARO — a new paradigm that unlocks scalable reasoning. By teaching LLMs to reason through adversarial games, we're seeing promising results where standard RL fails. Check it out now and let us know if you're interested in trying RARO to train reasoning models: forms.gle/Rrrs52MZHJZVuHo49
11 Dec 2025
RL for reasoning often rely on verifiers — great for math, but tricky for creative writing or open-ended research. Meet RARO: a new paradigm that teaches LLMs to reason via adversarial games instead of verification. No verifiers. No environments. Just demonstrations. 🧵👇
2
9
2,827
Thank you! Anything you can demo, you can explain, reason about, and then hillclimb — that’s the high-level idea behind this research direction. However, it still requires a lot of tuning and scaling.
13 Dec 2025
Anything you can demo you can hillclimb now. It's kind of over.
2
59
If you have a good dataset/task in mind that you’re interested in, and it a) requires reasoning b) is hard to build a fast experiment/verification system for please share a link!
12 Dec 2025
Woke up to some amazing feedback, thanks everyone!! @provilkov and I are working hard to release a plug-and-play RARO repo soon — what domains do you want to see supported? If you have specific model/dataset requests, let us know in the announcement thread! 👇
1
38
Ivan Provilkov retweeted
excited to be partnering with amazing folks @togethercompute, @ZainHasan6 and @provilkov to bring dynamic agent simulations to together evals.
Together AI 🤝@CollinearAI Introducing TraitMix, Collinear’s simulation product empowering teams to generate persona-driven AI agent interactions. 🔌Plug these interactions into your workflows and evaluate their effectiveness with Together Evals. Details: bit.ly/43GHJhR
2
2
22
2,803
Ivan Provilkov retweeted
16 Sep 2025
🚀Now you can fine-tune LLM's from the @huggingface hub using @togethercompute!🔥 • Public private repos • CausalLMs <100B params • Push tuned models back to the Hub Smaller, open models smart fine-tuning > bigger closed ones. Link below👇
1
2
7
424
Ivan Provilkov retweeted
🚨 Stop shipping LLMs blind. Together Evaluations is here — fast, flexible, LLM-as-a-judge-based benchmarking to: ✅ Compare model outputs ✅ Score responses against your own criteria ✅ Classify outputs into custom labels — from safety to sentiment Run our early preview today with any serverless model — more support coming soon. Learn more (links below!)
4
4
27
6,358
Ivan Provilkov retweeted
Announcing DeepSWE 🤖: our fully open-sourced, SOTA software engineering agent trained purely with RL on top of Qwen3-32B. DeepSWE achieves 59% on SWEBench-Verified with test-time scaling (and 42.2% Pass@1), topping the SWEBench leaderboard for open-weight models. Built in collaboration with the @Agentica_ team. 💪 DeepSWE is trained with rLLM, Agentica’s modular RL post-training framework for agents. rLLM makes it easy to build, train, and deploy RL-tuned agents on real-world workloads — from software engineering to web navigation and beyond. 🤗 As always, we’re open-sourcing everything: not just the model, but the training code (rLLM), dataset (R2EGym), and training recipe for full reproducibility. 🔥 Train DeepSWE yourself. Extend it. Build your own local agents. No secrets, no barriers. DeepSWE and rLLM mark our major shift: from training language reasoners to building language agents that can truly learn from experience. We believe the future of AI lies in experience-driven learning — and we’re here to democratize it. Welcome to the era of experience. 🌍
7
78
495
270,740
Ivan Provilkov retweeted
.@togethercompute is building 2 gigawatts of AI factories (~100,000 GPUs) in the EU over the next 4 years with the first phase live in H2 '2025. AI compute is at <1% saturation relative to our 2035 forecast and we are starting early to build a large-scale sustainable AI cloud in the US and abroad with global partnerships. European companies can reserve this capacity today: together.ai/gpu-clusters Our blog: together.ai/blog/together-ai… Data Center Dynamics Coverage: datacenterdynamics.com/en/ne…
6
18
184
34,620
Ivan Provilkov retweeted
🐋 🚨 DeepSeek R1-0528 is now live on Together AI!
4
5
56
20,257
Ivan Provilkov retweeted
🛠️ Your AI models shouldn’t be static—they should evolve with your users. Introducing Together Fine-Tuning with Direct Preference Optimization & Continued Training: build custom models that continuously adapt. Details below 👇
3
32
45
8,055
I'll be there at the beginning of February. Join if you're interested
26 Dec 2024
Welcome to ZuGrama!!! A 6-week residency in Kerala where builders, learners, and dreamers come together to create the future. 🏡 Themes & Tracks: ⚖️Governance : Jan6 - Jan 12 🚚Impact & Public goods : Jan 13 - Jan 19 ⚕️Longevity : Jan 20 - Jan 26 (Biotech , Desci) 💻Deeptech Jan 27 - Feb 2 🤖AI : Feb 3 - Feb 9 💸Cryptography Feb10 - Feb16 ⏰ Applications close in just 5 days! So hurry up those are still on the sidelines 🔗 zugrama.org/ If you had 6 weeks to innovate on a problem, what would you build? 🛠️ Reply us in the comments below ⬇️ The best idea is in for a surprise 🎁 🌴✨ Cultural Amalgamation Meets Tech Innovation! For 6 weeks, Kerala’s serene beauty 🌊 will host the brightest minds in tech: 👩‍🎓 Students: Build, innovate, & learn. 💼 Professionals: Brainstorm, connect with founders, & launch impactful projects. A blend of tradition & tech awaits. Join us at #ZuGrama! 🚀 #Kerala #AI #DeSci #CryptoCommunity #Biotech #longevity #Governance Also do check with communities in your region to know more about us. Thanks to their support to help spread the word on Innovation & technology. @FoundershipHQ @blockchainedind @pune_dao @Web3_kerala @Web3Panjab @0xRabble @hyderabaddao @Lucknow_DAO @Web3Chennai @web3meetups @Web3Assam @ActualOnexyz Appreciate some 🫶 by sharing it in your network & circles.
1
10
717