Siheng Zhao

Siheng Zhao

5 Photos and videos

Tweets

Peter Chen retweeted

Siheng Zhao

@SihengZhao

Jun 8

🪜 What if humanoids could climb ladders and work on them straight out of simulation? Meet LadderMan: a perceptive system for zero-shot sim-to-real ladder climbing and on-ladder manipulation. Watch the humanoid climb, stabilize, and manipulate—all in one system. 🤖👇

0:32

312

103,952

Tommie Kerssies

Peter Chen retweeted

Tommie Kerssies

@tommiekerssies

Apr 9

World models are heavy. They don't need to be. Each frame is encoded as 1024 spatial tokens. What if it were just 1? In our #CVPR2026 Highlight from Amazon FAR, we compress frames into "delta" tokens for efficient generative world modeling. Paper, code & models below ↓ (1/7)

Outline of DeltaWorld. Unlike large existing generative world models that require many forward passes and represent each frame with many spatial tokens, our small DeltaWorld generates multiple futures in a single forward pass by using a single delta token to encode the difference between consecutive frames.

ALT Outline of DeltaWorld. Unlike large existing generative world models that require many forward passes and represent each frame with many spatial tokens, our small DeltaWorld generates multiple futures in a single forward pass by using a single delta token to encode the difference between consecutive frames.

597

55,663

Chen Feng

Peter Chen retweeted

Chen Feng @simbaforrest

Mar 10

🤖We are hiring multiple Summer'26 Research Interns at @amazon FAR to work on open-world navigation and robot foundation models, especially in neural rendering & simulation/predictive world models/reasoning & agency/real-world evaluation/long-term autonomy!

392

38,303

Zhen Wu

Peter Chen retweeted

Zhen Wu @zhenkirito123

Feb 17

Can humanoids perform agile, autonomous, long-horizon parkour—based on what they see in the world? We present 𝗣𝗲𝗿𝗰𝗲𝗽𝘁𝗶𝘃𝗲 𝗛𝘂𝗺𝗮𝗻𝗼𝗶𝗱 𝗣𝗮𝗿𝗸𝗼𝘂𝗿 (𝗣𝗛𝗣): a framework that chains dynamic human skills using onboard depth perception for long-horizon traversal. 1/6

0:30

134

690

142,385

Brent Yi

Peter Chen retweeted

Brent Yi @brenthyi

Feb 6

New project! Flow Policy Gradients for Robot Control tldr; a simple online RL recipe for training and fine-tuning flow policies for robots co-led w/ @redstone_hong: hongsukchoi.github.io/fpo-co…

0:19

100

603

73,936

Carlo Sferrazza

Peter Chen retweeted

Carlo Sferrazza

@carlo_sferrazza

Feb 5

We introduce a framework that enables robust, long-horizon bi-directional locomotion over complex terrains, by effectively leveraging a single policy with dual-depth camera streams, without the need for LiDAR-based elevation maps. Check out @Yuanhang__Zhang's tweet for architecture and implementation details, and the website for full, uncut rollouts.

Yuanhang Zhang @Yuanhang__Zhang

Feb 4

Robust humanoid perceptive locomotion is still underexplored. Especially when different cameras see different terrains, paths get narrow, and payloads disturb balance... Introduce RPL, tackling this with one unified policy: • Challenging terrains (slopes, stairs and stepping stones); • Multiple directions; • Payloads; Trained in sim. Validated long-horizon in the real world. Watch the robot walk it all🦿 Details below👇

2:10

10,081

Yuanhang Zhang

Peter Chen retweeted

Yuanhang Zhang @Yuanhang__Zhang

Feb 4

2:10

275

59,475

Younggyo Seo

Peter Chen retweeted

Younggyo Seo @younggyoseo

2 Dec 2025

Tired of waiting hours for humanoids to learn to walk? Our new technical report shows how to train sim-to-real humanoid locomotion in 15 minutes with FastSAC and FastTD3! The full pipeline is open-source in the newly released Holosoma codebase. Thread 🧵

0:22

182

36,530

Rocky Duan

Peter Chen retweeted

Rocky Duan @rocky_duan

2 Dec 2025

Excited to share this latest work from our team! Holosoma is now our go-to option for humanoid research at FAR, and we will continue to maintain it and add new capabilities in the future. We're also hiring! Research: amazon.jobs/en/jobs/2850576/… Software: amazon.jobs/en/jobs/3043054/…

Carlo Sferrazza

@carlo_sferrazza

1 Dec 2025

Sim-to-real learning for humanoid robots is a full-stack problem. Today, Amazon FAR is releasing a full-stack solution: Holosoma. To accelerate research, we are open-sourcing a complete codebase covering multiple simulation backends, training, retargeting, and real-world inference.

0:34

20,068

Carlo Sferrazza

Peter Chen retweeted

Carlo Sferrazza

@carlo_sferrazza

1 Dec 2025

0:34

132

598

212,900

Pieter Abbeel

Peter Chen retweeted

Pieter Abbeel

@pabbeel

1 Dec 2025

Open-source: complete codebase covering multiple simulation backends, training, retargeting, and real-world inference. Infra built for humanoid, but also readily modified for quadruped (also included). Lots of infra gems/conveniences we rely on consistently. Hopefully equally helpful for others.

Carlo Sferrazza

@carlo_sferrazza

1 Dec 2025

0:34

472

76,335

Yanjie Ze

Peter Chen retweeted

Yanjie Ze

@ZeYanjie

5 Nov 2025

14/n Visuomotor policy learning ...but can also use feet to kick a T-shaped box to the target region, i.e., Humanoid Kick-T.

0:24

10,239

Yanjie Ze

Peter Chen retweeted

Yanjie Ze

@ZeYanjie

5 Nov 2025

Excited to introduce TWIST2, our next-generation humanoid data collection system. TWIST2 is portable (use anywhere, no MoCap), scalable (100 demos in 15 mins), and holistic (unlock major whole-body human skills). Fully open-sourced: yanjieze.com/TWIST2

1:59

112

495

100,259

Atli Kosson

Peter Chen retweeted

Atli Kosson @AtliKosson

23 Oct 2025

Why override µP? Because its core assumptions only hold very early in training! In practice wide models quickly stop being more sensitive to weight updates than smaller models! This is caused by changes in the geometric alignment of updates and layer inputs over training. 🧵6/8

42,067

Peter Chen

Peter Chen

@peterxichen

23 Oct 2025

We investigated what enables muP's LR transfer across model sizes and found some new insights into large model optimization dynamics:

Atli Kosson @AtliKosson

23 Oct 2025

The Maximal Update Parameterization (µP) allows LR transfer from small to large models, saving costly tuning. But why is independent weight decay (IWD) essential for it to work? We find µP stabilizes early training (like an LR warmup), but IWD takes over in the long term! 🧵

2,173

Zhen Wu

Peter Chen retweeted

Zhen Wu @zhenkirito123

10 Oct 2025

I've long wondered if we can make a humanoid robot do a 𝘄𝗮𝗹𝗹𝗳𝗹𝗶𝗽 - and we just made it happen by leveraging 𝗢𝗺𝗻𝗶𝗥𝗲𝘁𝗮𝗿𝗴𝗲𝘁 with BeyondMimic tracking! This came after our original OmniRetarget experiments, with only minor tweaks to RL training: relaxing a termination threshold and removing one reward term. The policy achieved a 𝟱/𝟱 success rate in our real-world experiments, showing the strength of high-quality, interaction-preserving motion retargeting combined with BeyondMimic’s minimal RL tracking. Here is the updated arXiv: arxiv.org/abs/2509.26633 (In Sec. V. A)

0:20

Zhen Wu @zhenkirito123

1 Oct 2025

Humanoid motion tracking performance is greatly determined by retargeting quality! Introducing 𝗢𝗺𝗻𝗶𝗥𝗲𝘁𝗮𝗿𝗴𝗲𝘁🎯, generating high-quality interaction-preserving data from human motions for learning complex humanoid skills with 𝗺𝗶𝗻𝗶𝗺𝗮𝗹 RL: - 5 rewards, - 4 DR terms, - Proprio. ONLY, - NO history/curriculum. Ready for agile, human-like 🤖? (Best with 🎧) 🔗 omniretarget.github.io 🎥 1/9

0:54

175

534

3,877

1,050,991

Angjoo Kanazawa

Peter Chen retweeted

Angjoo Kanazawa @akanazawa

8 Oct 2025

Super exciting news!! Welcome Nikita co and looking forward to working together!

Nikita Karaev

@n_karaev

6 Oct 2025

🚨Big news - our team is joining the Frontier AI and Robotics (FAR) lab at @amazon to keep building the future of robotics. Excited to be joining such an inspiring team and to work with @peterxichen, @pabbeel, @rocky_duan, @akanazawa and many others at FAR.

19,195

Qiayuan Liao

Peter Chen retweeted

Qiayuan Liao @qiayuanliao

1 Oct 2025

SOTA data generation from OmniRetarget SOTA formulation from BeyondMimic = mind-blowing performance

Zhen Wu @zhenkirito123

1 Oct 2025

0:54

5,112

Zhen Wu

Peter Chen retweeted

Zhen Wu @zhenkirito123

1 Oct 2025

Our grand finale: A complex, long-horizon dynamic sequence, all driven by a proprioceptive-only policy (no vision/LIDAR)! In this task, the robot carries a chair to a platform, uses it as a step to climb up, then leaps off and performs a parkour-style roll to absorb the landing. This pushes the boundaries of agile, human-like loco-manipulation! 7/9

0:24

155

31,635

Pieter Abbeel

Peter Chen retweeted

Pieter Abbeel

@pabbeel

1 Oct 2025

Very excited to start sharing some of the work we have been doing at Amazon FAR. In this work we present OmniRetarget, which can generate high-quality interaction-preserving data from human motions for learning complex humanoid skills. High-quality re-targeting really helps the reinforcement learning. Why? The control policy optimization landscape is much nicer for (near)feasible trajectories than for trajectories with artifacts like (e.g.) foot skating or ground penetration.

Zhen Wu @zhenkirito123

1 Oct 2025

0:54

343

93,068