Chenhao Li

Chenhao Li

218 Photos and videos

Tweets

Pinned Tweet

Chenhao Li @breadli428

May 2

🌎We learn robot control policies from world models with real deployment. 1⃣ Robotic World Model (RWM) corrects errors through online interaction. 🔗sites.google.com/view/roboti… 2⃣ Uncertainty-Aware RWM adds uncertainty penalties during policy optimization. 🔗sites.google.com/view/uncert…

2:40

136

10,265

C. Zhang

Chenhao Li retweeted

C. Zhang @ChongZzZhang

Jun 10

Random comments 1) not having a action bound in SAC is technically bad, 2) but having joint limits as action bounds for position control learning is also technically bad: the policy loses the ability to generate large torques near joint limits. Maybe torque is better for SAC.

Robotic Systems Lab @leggedrobotics

Jun 10

Replying to @leggedrobotics

PPO has been the go-to algorithm for training robots in simulation. SAC is more sample-efficient in theory, but consistently fell short in practice. And while PPO thrives where data is cheap, it hits a hard wall when moving to real-robot learning. 🔥We set out to close that gap.

11,726

Chen Tessler

Chenhao Li retweeted

Chen Tessler

@ChenTessler

Jun 10

Replying to @leggedrobotics

Thanks for this! I want to start looking into off policy methods for humanoids and this is very useful 🙏🏼

1,155

Ethan Clark

Chenhao Li retweeted

Ethan Clark

@ethanmclark1

Jun 11

Super cool to see Tried using SAC for locomotion a few months ago and it kept breaking my head. The thing that got me was that asymmetric actor critic isn't equivalent across PPO and SAC. On policy the privileged critic magnifies the actors gradient. Off policy it sends it in a different direction entirely Most people would just accept SAC doesn't work here. Glad to see Big PPO finally getting antitrusted. Hopefully this sparks a revival of off policy algorithms

Robotic Systems Lab @leggedrobotics

Jun 10

PPO has long dominated robot locomotion training in simulation. SAC, despite its sample efficiency, couldn't keep up. We analyze why: 🔗sabagian.github.io/sac_relea… 🔥Integrated into RSL-RL, our approach requires only minimal changes, making SAC a drop-in alternative out of the box.

0:20

8,922

Chenhao Li

Chenhao Li @breadli428

Jun 10

sabagian.github.io/sac_relea…

3,791

Chenhao Li

Chenhao Li @breadli428

Jun 10

x.com/leggedrobotics/status/…

Robotic Systems Lab @leggedrobotics

Jun 10

0:20

987

Chenhao Li

Chenhao Li @breadli428

Jun 10

Today, we bring SAC to RSL-RL, one of the most widely used RL frameworks in massively parallel robot learning, developed at RSL @leggedrobotics. We try to understand the long-standing performance gap between SAC and PPO, and crystallize important factors sabagian.github.io/sac_relea…

Robotic Systems Lab @leggedrobotics

Jun 10

0:20

137

10,870

Chenhao Li

Chenhao Li @breadli428

Jun 10

After many ablations, we left four important factors that make a difference ✅Right-sizing the action space ✅Treating timeouts as timeouts, not failures ✅Smoother targets via n-step returns ✅Starting exploration where it should Check out our findings arxiv.org/abs/2605.24975

Bridging the Gap: Enabling Soft Actor Critic for High Performance...

Proximal Policy Optimization (PPO) has become the de facto standard for training legged robots, thanks to its robustness and scalability in massively parallel simulation environments like...

arxiv.org

782

Chenhao Li

Chenhao Li @breadli428

Jun 10

This project is led by Gianluca Sabatini, supported by Chenhao Li @breadli428 and Marco Hutter @leggedrobotics. We thank Clemens Schwarke's implementation insights. github.com/leggedrobotics/rs…

GitHub - leggedrobotics/rsl_rl_sac: Bridging the Gap: Enabling Soft Actor-Critic for High Perform...

Bridging the Gap: Enabling Soft Actor-Critic for High Performance Legged Locomotion - leggedrobotics/rsl_rl_sac

github.com

572

Chenhao Li

Chenhao Li @breadli428

Jun 9

Crazy! A new modality!

York Kang

@york1to

Jun 5

Replying to @yacineMTB

0:12

4,375

Chenhao Li

Chenhao Li @breadli428

Jun 8

It’s a very first attempt to leverage pure generated videos to enable learning on physical platforms. If one can close the loop from physics grounding back to video generation, then we have a self-evolving system.

Robots Digest 🤖

@robotsdigest

Jun 8

NIL is building a long-term research agenda around natural intelligence rather than treating AI as a pure scaling problem. The lab sits at the intersection of machine learning, cognitive science, neuroscience, and robotics, asking a different question: what principles make intelligent behavior emerge in biological systems, and how can those principles be engineered into artificial ones?

4:42

2,447

Chenhao Li

Chenhao Li @breadli428

Jun 8

Time to use AI-generated demonstrations!

Jie Wang

@JieWang_ZJUI

Jun 8

I am so sad that I missed it, check out cool works in robot learning from ETH folks!

2,719

Chenhao Li

Chenhao Li @breadli428

Jun 8

🤷What if we want to learn from human data... without human data? In our work NIL (No-data Imitation Learning) @CVPR, we explore a simple but ambitious question: Can robots learn directly from AI-generated videos without any curated demonstration data? 🔗nil.is.tue.mpg.de/

4:42

168

13,669

more replies

Chenhao Li

Chenhao Li @breadli428

Jun 8

🗣️ This work a joint work ETH Zurich @ETH_en, MPI for Intelligent Systems @MPI_IS and @ETH_AI_Center, led by Mert Albaba @brtmertalb, Chenhao Li @breadli428, Markos Diomataris @Markos11571524, Omid Taheri, Andreas Krause @arkrause, and Michael J. Black @Michael_J_Black.

456

Chenhao Li

Chenhao Li @breadli428

Jun 8

If you did not catch us in Denver @CVPR, shoot @brtmertalb or me a message here!

341

Eric Rosen

Chenhao Li retweeted

Eric Rosen @_ericrosen

Jun 2

😍I love the usage of the arms to balance the climb! More contact means making use of all parts of the humanoid!

Chenhao Li @breadli428

Jun 2

⛰️We try to push motion learning beyond what one can do on flat ground. Kudos to Zewei Zhang @ctki49, Kehan Wen @KehanWen170077, and Michael Xu @mxu_cg, who made this a reality. Check out now wholebodylocomotion.github.i…

2:55

1,846

Takahiro Miki

Chenhao Li retweeted

Takahiro Miki @ki_ki_ki1

Jun 2

Cool work done by my students @ctki49 @KehanWen170077 Perceptive motion generator tracker on rough terrain.

Chenhao Li @breadli428

Jun 2

❗️Flat-terrain tracking is solved. Rough terrain breaks everything, because the reference itself has to change. So we built a system that generates references as it goes. 🦿Parkour over boxes, hurdles, stairs - all onboard. 🔗 wholebodylocomotion.github.i… 📄 arxiv.org/abs/2604.17335

2:55

6,525