Joined January 2026
2 Photos and videos
Pinned Tweet
We’ve redesigned our docs with easy access to SDK reference, tutorials, support, and our newly updated cookbook---v0.3.0! Whether you’re writing your first training loop in Tinker or debugging async RL, we want to make it easier to find what you need.
8
23
276
42,307
Nemotron 3 Ultra from @nvidia is out today and available on Tinker day one! The flagship from the Nemotron family is built for long-running agents; @trajectorylabs have been using it in early access to power continual learning workflows.
Today we're shipping Nemotron 3 Ultra. A 550B MoE frontier-intelligence open model built for long-running agents. It delivers 5x faster inference and lowers the cost of complex agentic tasks by up to 30% versus other open frontier models.
5
8
66
7,299
Continual learning on real user data has been a major capability gap in AI. @trajectorylabs launched to bring continual learning to production, with Tinker part of what they're building on. Congratulations to Ronak, Michael, Arjun and the team!
Today, @MichaelElabd, @QuantumArjun, and I are excited to announce Trajectory. We are a research lab and product company building the platform for Continual Learning. Our platform unlocks the signal already sitting in product usage, so companies can continuously post-train large-scale agentic models that outperform the frontier. @trajectorylabs We’ve raised $15M from @Conviction, @BessemerVP, @radicalvcfund, @jeffdean, @drfeifei and more. We’re partnering with some of the best AI-native companies: @ClayRunHQ @Harvey, @DecagonAI, @mercor_ai, @RogoAI to power their agentic systems, some of which we are already in production with. We’ve brought together a world class research team from DeepMind, OpenAI, Apple, Meta Superintelligence, Amazon AGI, Scale AI, and an elite product team from Stripe and Figma. AI will never again start on day one. Every correction, every retry, every edit will make products smarter. This is Continual Learning.
5
4
112
12,445
The hard part of continual learning isn't getting the data, but training on a single rollout per task that's off-policy by the time you train. Trajectory's off-policy SDPO recipe stabilizes training and scales. The technical post is well worth the read. x.com/rronak_/status/2059644…

Replying to @rronak_
We have been exploring new algorithmic frontiers and are excited to share our contributions to Self Distillation Policy Optimization (SDPO) for agentic continual learning, check out our blog post here: trajectory.ai/field-notes/sc…
1
20
2,511
Tinker retweeted
Thinking Machines is impressive. In a couple hours I just fine tuned my own Qwen3.5-397B model this afternoon. Fast usable multimodal is also going to enable very mind-blowing personal AI.
People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. thinkingmachines.ai/blog/int…
115
201
2,981
428,403
Foresight Learning is a clever data recipe for training prediction: split a sequence of notes randomly into prediction context and outcome label. Train on Tinker and you get a lightweight adapter that beats GPT-5 on calibration and clinical reasoning. Congrats @lightningrodai!
New preprint from @lightningrodai! We trained AI to predict clinical events — ICU transfers, new diagnoses, complications, procedures, ventilation, mortality — directly from raw clinical notes. No labeled data required – Foresight Learning infers outcomes from what happens later in patient records. Using Tinker from @thinkymachines , we trained a lightweight adapter on GPT-OSS-120B, resulting in a specialized predictor that runs on a single GPU. Results: 🎯 ~70% lower calibration error 📈 Brier skill score: ~0% → 27% 🧠 84% win-rate vs the base model in blind reasoning review 🥇 Slightly better Brier than GPT-5, despite being a fraction of the size Hospitals and specialty clinics often treat unique patient populations that out-of-the-box models don't have training data for. This makes it possible to build frontier-quality predictors for highly specific patient groups, with nothing but raw clinical records. Congrats to the team — @indiequant @KSkotheim64001 🙌 Full paper 👇 arxiv.org/abs/2605.12817
2
9
87
15,868
Exa trains Qwen3-4B-Instruct to search using Tinker!
May 13
How does Exa compare to Google for training LLMs to search? In this blog post, we find that LLMs using Exa during reinforcement learning reach higher performance with 70% less training compute. exa.ai/blog/rl-search-outcom…
3
17
313
52,320
Tinker retweeted
People talk, listen, watch, think, and collaborate at the same time, in real time. We've designed an AI that works with people the same way. We share our approach, early results, and a quick look at our model in action. thinkingmachines.ai/blog/int…
464
1,959
15,782
7,746,448
Tinker retweeted
Apr 28
Meet Waldo: Glean’s first agentic search model. Built on @nvidia Nemotron 3 Nano and post-trained for search planning, Waldo figures out how to break down a query, which tools to call, what to read next, and when it has enough evidence to hand off.
5
12
103
98,273
:)
the @tinkerapi tutorials were really well put together thank you @thinkymachines folks this was really helpful for the project I'm working on
1
1
20
4,599
Kimi K2.6 from @Kimi_Moonshot and Qwen3.6-35B-A3B from @Alibaba_Qwen are now available on Tinker. Both models offer improvements in long-horizon agentic reliability over the previous versions, at two distinct points on the size-capability spectrum.
6
6
139
10,186
We’re also adding Qwen3.6-27B, a dense model for thorough fine-tuning alongside the 35B-A3B MoE.
2
5
2,126
Exciting work from @wzenus, supported by Tinker grants!
In Agent RL, models suffer from Template Collapse. They generate vast, diverse outputs (High Entropy) that lose all meaningful connection to the input prompt (Low Mutual Information). In other words, agent learn different ways to say nothing. 🚀 Introducing RAGEN-v2 -- Here's how we define and fix such silent failure modes in Agent RL. 🧵
1
28
15,800
Tinker retweeted
We built a new task to test AI research capabilities! Agents asked to use @tinkerapi from @thinkymachines to train a model on logic games. That involves writing full training pipeline, running experiments across recipes, and submitting the best model.
Replying to @ProximalHQ
FrontierSWE was built with collaborators from industry and academia to ensure that tasks are diverse and reflect real work engineers and researchers encounter. We specifically thank our partners @Modular, @PrimeIntellect and @thoughtfullab for their contributions
1
11
52
7,827
that's us!
Replying to @MatternJustus
Another task tests AI research capabilities: using @tinkerapi from @thinkymachines, agents are asked to post-train an agent to play logic games, which involves writing an entire training pipeline and running experiments with different recipes to finally submit the best model
1
2
46
3,902
Coding agents are racing towards strong performance over long horizons. @ProximalHQ's FrontierSWE throws down a rigorous benchmark, and we're thrilled that Tinker gets to play a part!
Introducing FrontierSWE, an ultra-long horizon coding benchmark. We test agents on some of the hardest technical tasks like optimizing a video rendering library or training a model to predict the quantum properties of molecules. Despite having 20 hours, they rarely succeed
8
74
8,680
Tinker for autoresearch (for golf):
I pointed Claude Code at a research task (build a golf forecasting system) and let it run for 49 hours on Tinker. No human in the loop. It ran 108 experiments. Here's the full trajectory, including the ones that made things worse.
3
159
19,767
Tinker retweeted
Apr 10
so many good tutorials on here would highly recommend checking it out if you haven't yet
We’ve redesigned our docs with easy access to SDK reference, tutorials, support, and our newly updated cookbook---v0.3.0! Whether you’re writing your first training loop in Tinker or debugging async RL, we want to make it easier to find what you need.
3
14
3,394
Tinker retweeted
okay yes nice that’s the type of learning material I love to see will def go through these
Replying to @tinkerapi
First, to get you started, we've created 23 tutorials to walk you from the API basics to advanced training techniques and deploying models into production. tinker-docs.thinkingmachines…
2
25
667
82,334
please note we did not pay brydon to say this (we pay him to do research)
I know it's self serving to say, but man I would've killed for a resource like Tinker and the tutorials, the cookbook, etc back when I was in undergrad. Following @karpathy blogs and training RNNs on a crappy Acer *was* fun, but doing bigger things with less setup is such a boon
5
2
112
13,597
We’ve redesigned our docs with easy access to SDK reference, tutorials, support, and our newly updated cookbook---v0.3.0! Whether you’re writing your first training loop in Tinker or debugging async RL, we want to make it easier to find what you need.
8
23
276
42,307
Two new distillation recipes: Self-distillation (SDFT) lets the model teach itself with top-K forward KL — no separate teacher needed. Multi-teacher off-policy distillation merges knowledge from multiple teachers into one student.
1
20
3,585