0.03%er. Placing Ghosts into Shells.

Joined February 2022
253 Photos and videos
Pinned Tweet
> Laughs in i told you not to externalize your institutional knowledge to model labs that will cannibalize your business model which you de-risked and PMFed for free
When Fable 5 is used for frontier LLM development, it does not notify the user and instead limits the model’s capabilities through methods such as prompt modification, steering vectors, and PEFT. Anthropic estimated that this would affect approximately 0.03% of traffic.
2
90
Tensor Templar retweeted
Talking to other researchers in the "learning from human video" space early this year, a common observation was that it's hard to show transfer from true in-the-wild Internet video, compared to curated research datasets. In most of these datasets, the humans deliberately move like robots, and hands are tracked with clean 3D labels. The usual starting point for Internet video — run a monocular hand pose estimator on YouTube videos, cotrain with robot data — often doesn't work. In our recent work, we study this "YouTube-type" video setting and try to understand what it takes to absorb egocentric Internet videos into a VLA training pipeline. 1/n
2
55
475
8,179
Tensor Templar retweeted
Jun 13
This is going to have an opposite effect that the decels want. It's a huge open source AI accelerant, an accelerant for corporations, and enterprises Now it's a real race. You either make your own AI infrastructure or you don't have a seat at the table
97
204
2,562
92,015
Tensor Templar retweeted
To me this paints Anthropic clearly as anti science, and therefore anti progress and anti safety.
10
13
414
12,649
Tensor Templar retweeted
Flow reversal steering allows "steering" diffusion-based VLAs with high-level actions, for example from VLM reasoning. This also lets us run RL in the diffusion noise space with exploration guided by high-level reasoning: think through a task, then practice it! 👇
5
57
529
56,631
Lmao
Was using Fable 5 to write my world model training code. Anthropic flagged it as frontier AI research. The steering vector kicked in and it started implementing JEPA 🤨
2
60
I feel like we should start having tps / active params / price per output tokens / tokens per task on these benchmarks already
Replying to @claudeai
Fable 5 is state-of-the-art on nearly all tested benchmarks, with exceptional performance in software engineering, knowledge work, scientific research, and vision. The longer and more complex the task, the larger Fable 5’s lead over our other models.
1
578
Great talk by @yukez at #ICRA2026. In the VLA's own words it would probably be "the reports of my death are greatly exaggerated". I asked for intuitions on limits of WAMs vs. VLM/A, given agents use language priors to reason and navigate complex plans and (video) WAMs never learn these: - VL parts would be great to keep, not throw out - Language is a somewhat naive interface between the two, latents are probably better, open research question - @Ken_Goldberg's plenary highly related - backbone quality is key - (relatively) expensive to train WAMs, harder to make fast, but they got to 30 fps with nvpf4 and tricks. I would add: - temporal priors very easy to teach with WAMs, but hard with VLAs - lots of spatial graphs / navigation and memory work was presented in other keynotes all of which used language interfaces / abstract language reasoning - interactions with humans need VL, can't pass a video to the human for every question
1
2
1,213
Tensor Templar retweeted
🚨 BREAKING: EUROPE DISCOVERS AMBITION
At microagi we want to put Europe at the front of physical and embodied AI. That’s why we’ve created the microagi research fellowship. Fellows can build in our lab in Zurich or Munich, work with our team, our datasets and our partners like Unitree, Nvidia, and Google. If you know someone who should be focussing on physical AI without any distractions, tag them, if you’re the first to tag someone who is accepted we’ll send you 10.000 USD
2
4
29
6,831
You throw the dart roughly somewhere and its always hits bullseye, is this claude code for showing off?
3
406
Tensor Templar retweeted
We're launching the microagi Research Fellowship. Fellows get up to $2M in compute, robotics hardware, our evals, and one of the largest physical AI datasets ever assembled. You build in our lab, with our team, alongside partners like Unitree, Nvidia, and Google Cloud. The hard part of AI left is physical. That's the part we're working on. Come build with us. One more thing: know someone who belongs here? Reply with their name. If they get in, we send you 10.000 USD
60
58
523
163,113
The two main takeaways from the #ICRA2026 keynotes RL track, to bring robots it into production, in case you missed it, were: "Is RL even needed? (no)" and "Just RL all the way". I do agree
1
142
This hand is a hidden gem, not quite Sharpa yet but we liked it a lot and one of the few that visualized shear forces nicely. Here is a vid of me playing around with it
So pretty. Xynova's Flex 2 hybrid-driven dexterous hand at #ICRA2026 It's almost like a human hand. I hope that one day it can serve as a prosthetic hand for those in need. x.com/cyberrobooo/status/205…
5
23
2,825
Breaking: VCs discover asking claude for business ideas gives everyone the same business ideas.
Thought experiment: if every company suddenly had infinite free compute, what new products would emerge? My take: with very few exceptions, not much would change. The bottleneck is figuring out what people want, and it’s not so easy to apply compute to solve that.
1
93
AI2 are consistently the most open players in AI for several years now. From RLVR, tulu, molmo, to evals molmospaces, rewardbench, to VLAs molmobot trained in sim, molmoact1/2 - just persistently good research taste and they don't shy away from risky and challenging research.
May 28
MolmoAct 2 artifacts have been downloaded 400K times in under 1 month. Today we're opening up the full code & training data: everything you need to fine-tune or build on our fully open robotics foundation model. 🧵
1
144
The internets: the west is falling behind in humanoid robot hardware BD: hold my fridge...
You can’t lift a fridge with just your hands. Your whole body needs to conform to its shape, and bear the load between your arms and torso. Here, @BostonDynamics' Atlas uses proprioception to manage the whole-body interaction and adapt to a shifting 100 lb load. Enabling this type of high performance manipulation is exactly why we walked away from what was arguably the world’s best implementation of MPC for humanoids, and shifted entirely to RL without looking back. This level of whole-body controls is a fundamental building block of physical intelligence and key to the value proposition of humanoids. More technical details in: Blog: bostondynamics.com/blog/trai… Behind the scenes video: youtu.be/xKK5ze3FukQ
91
Lets regulate AI in the west more so we can collectively avoid the permanent underclass and slide right into the under-underclass. This should pair nicely with the existing energy and population crisis and fit into pre-existing legal deindustrialization frameworks.
JUST IN: Senator Elizabeth Warren urgently calls for AI regulations to prevent a “permanent underclass”
1
49
Very nice to see more robotics startups looking at the human interaction side of deployments
Twolabs is building humanoid robots for caregiving. They’re starting in nursing homes, where caregivers are overworked and elderly residents often need more support than the system can provide. Most robots are useful but not social, or social but not useful. @twolabsai is combining both.
49
People are not ready for the mountains of Claude.md bloat and custom harnesses that can be deleted if this works at scale.
Reward hacking is the hardest problem in RL. We design settings where hacking is predictable, and find patterns between task difficulty and hack frequency. These runs are highly efficient, using <$1 in compute. We’re launching Sprints to allow everyone to join this effort.
76