Kush Hari

Kush Hari

16 Photos and videos

Tweets

Brent Yi retweeted

Kush Hari @KushtimusPrime

Apr 7

Our new work, STITCH 2.0, can perform consecutive running sutures to close a sample wound with the daVinci robot.

0:30

26,514

Tommie Kerssies

Brent Yi retweeted

Tommie Kerssies

@tommiekerssies

Apr 9

World models are heavy. They don't need to be. Each frame is encoded as 1024 spatial tokens. What if it were just 1? In our #CVPR2026 Highlight from Amazon FAR, we compress frames into "delta" tokens for efficient generative world modeling. Paper, code & models below ↓ (1/7)

Outline of DeltaWorld. Unlike large existing generative world models that require many forward passes and represent each frame with many spatial tokens, our small DeltaWorld generates multiple futures in a single forward pass by using a single delta token to encode the difference between consecutive frames.

ALT Outline of DeltaWorld. Unlike large existing generative world models that require many forward passes and represent each frame with many spatial tokens, our small DeltaWorld generates multiple futures in a single forward pass by using a single delta token to encode the difference between consecutive frames.

596

55,529

Neerja Thakkar

Brent Yi retweeted

Neerja Thakkar

@neerjathakkar

Apr 2

What’s the right representation for a world model? 3D, pixels, or something else? Excited to release our new paper “Forecasting Motion in the Wild” where we propose point tracks as tokens for generating complex non-rigid motion and behavior From @GoogleDeepmind @Berkeley_AI @TTIC_Connect

469

80,698

Max Fu

Brent Yi retweeted

Max Fu

@letian_fu

Apr 1

Robotics: coding agents’ next frontier. So how good are they? We introduce CaP-X: an open-source framework and benchmark for coding agents, where they write code for robot perception and control, execute it on sim and real robots, observe the outcomes, and iteratively improve code reliability. From @NVIDIA @Berkeley_AI @CMU_Robotics @StanfordAILab capgym.github.io 🧵

1:31

126

632

168,843

Kie Horiuchi

Brent Yi retweeted

Kie Horiuchi @kiehoriuchi

Mar 27

Excited to share our latest work on motion generation! We tackled multi-agent generation across diverse tasks using Diffusion Forcing. Check out the project page for more! 🚀

Vongani Maluleke @vonekels

Mar 27

When people share a space, their movements become intertwined. Embodied agents need to understand these social dynamics to interact effectively. Introducing MAGNet 🧲, a unified autoregressive diffusion forcing model for multi-agent motion generation that captures these interactions. MAGNet is flexible: predict the future, fill in missing motion, or have people react to each other, all while naturally scaling to N>2 people and generating ultra-long motion sequences.

0:38

1,067

Vongani Maluleke

Brent Yi retweeted

Vongani Maluleke @vonekels

Mar 27

0:38

372

66,251

Jimmy Lee

Brent Yi retweeted

Jimmy Lee

@wwwjim

Feb 26

So basically the most valuable thing to build right now is friendship

193

396

3,211

144,979

Kevin Zakka

Brent Yi retweeted

Kevin Zakka @kevin_zakka

Feb 27

The viser viewer in mjlab just got a huge QOL upgrade! - Real-time factor control: go slower or faster than real-time and viewer paces physics to match - Single step mode: advance one physics step at a time (super useful for debugging!) - Overall faster and smoother

0:39

181

7,495

Kevin Zakka

Brent Yi retweeted

Kevin Zakka @kevin_zakka

Feb 12

New in mjlab from the amazing @ki_ki_ki1: 8 new terrains and a viser-based terrain visualizer 😎

1:15

158

16,777

Grace Luo

Brent Yi retweeted

Grace Luo @graceluo_

Feb 9

We trained diffusion models on a billion LLM activations, and we want you to use them! New preprint: Learning a Generative Meta-Model of LLM Activations Joint work with @feng_jiahai, @trevordarrell, @AlecRad, @JacobSteinhardt. More in thread 🧵

0:07

192

1,436

221,558

Qiayuan Liao

Brent Yi retweeted

Qiayuan Liao @qiayuanliao

Feb 6

One of my favorite robot clips (filmed Oct 2025). You can train any crazy full-body motions like this with our open-source stack without changing any parameters. whole_body_tracking: github.com/HybridRobotics/wh… mjlab: github.com/mujocolab/mjlab/t…

0:07

413

37,085

Brent Yi

Brent Yi @brenthyi

Feb 6

New project! Flow Policy Gradients for Robot Control tldr; a simple online RL recipe for training and fine-tuning flow policies for robots co-led w/ @redstone_hong: hongsukchoi.github.io/fpo-co…

0:19

100

603

73,891

more replies

Brent Yi

Brent Yi @brenthyi

Feb 6

and DexMimicGen: x.com/SteveTod1998/status/18…

Zhenyu Jiang @SteveTod1998

1 Nov 2024

How can we scale up humanoid data acquisition with minimal human effort? Introducing DexMimicGen, a large-scale automated data generation system that synthesizes trajectories from a few human demonstrations for humanoid robots with dexterous hands. (1/n)

0:58

1,087

Brent Yi

Brent Yi @brenthyi

Feb 6

Thanks to everyone who worked on these projects 🙏

670