Building general purpose robotics at Amazon FAR. Previously Covariant CEO and Co-Founder, @OpenAI, @UCBerkeley PhD.

Joined December 2017
5 Photos and videos
Peter Chen retweeted
๐Ÿชœ What if humanoids could climb ladders and work on them straight out of simulation? Meet LadderMan: a perceptive system for zero-shot sim-to-real ladder climbing and on-ladder manipulation. Watch the humanoid climb, stabilize, and manipulateโ€”all in one system. ๐Ÿค–๐Ÿ‘‡
17
61
312
103,952
Peter Chen retweeted
World models are heavy. They don't need to be. Each frame is encoded as 1024 spatial tokens. What if it were just 1? In our #CVPR2026 Highlight from Amazon FAR, we compress frames into "delta" tokens for efficient generative world modeling. Paper, code & models below โ†“ (1/7)
12
74
597
55,663
Peter Chen retweeted
๐Ÿค–We are hiring multiple Summer'26 Research Interns at @amazon FAR to work on open-world navigation and robot foundation models, especially in neural rendering & simulation/predictive world models/reasoning & agency/real-world evaluation/long-term autonomy!
13
15
392
38,303
Peter Chen retweeted
Can humanoids perform agile, autonomous, long-horizon parkourโ€”based on what they see in the world? We present ๐—ฃ๐—ฒ๐—ฟ๐—ฐ๐—ฒ๐—ฝ๐˜๐—ถ๐˜ƒ๐—ฒ ๐—›๐˜‚๐—บ๐—ฎ๐—ป๐—ผ๐—ถ๐—ฑ ๐—ฃ๐—ฎ๐—ฟ๐—ธ๐—ผ๐˜‚๐—ฟ (๐—ฃ๐—›๐—ฃ): a framework that chains dynamic human skills using onboard depth perception for long-horizon traversal. 1/6
23
134
690
142,385
Peter Chen retweeted
New project! Flow Policy Gradients for Robot Control tldr; a simple online RL recipe for training and fine-tuning flow policies for robots co-led w/ @redstone_hong: hongsukchoi.github.io/fpo-coโ€ฆ
16
100
603
73,936
Peter Chen retweeted
We introduce a framework that enables robust, long-horizon bi-directional locomotion over complex terrains, by effectively leveraging a single policy with dual-depth camera streams, without the need for LiDAR-based elevation maps. Check out @Yuanhang__Zhang's tweet for architecture and implementation details, and the website for full, uncut rollouts.
Robust humanoid perceptive locomotion is still underexplored. Especially when different cameras see different terrains, paths get narrow, and payloads disturb balance... Introduce RPL, tackling this with one unified policy: โ€ข Challenging terrains (slopes, stairs and stepping stones); โ€ข Multiple directions; โ€ข Payloads; Trained in sim. Validated long-horizon in the real world. Watch the robot walk it all๐Ÿฆฟ Details below๐Ÿ‘‡
1
7
44
10,081
Peter Chen retweeted
Robust humanoid perceptive locomotion is still underexplored. Especially when different cameras see different terrains, paths get narrow, and payloads disturb balance... Introduce RPL, tackling this with one unified policy: โ€ข Challenging terrains (slopes, stairs and stepping stones); โ€ข Multiple directions; โ€ข Payloads; Trained in sim. Validated long-horizon in the real world. Watch the robot walk it all๐Ÿฆฟ Details below๐Ÿ‘‡
5
56
275
59,475
Peter Chen retweeted
Tired of waiting hours for humanoids to learn to walk? Our new technical report shows how to train sim-to-real humanoid locomotion in 15 minutes with FastSAC and FastTD3! The full pipeline is open-source in the newly released Holosoma codebase. Thread ๐Ÿงต
5
40
182
36,530
Peter Chen retweeted
Excited to share this latest work from our team! Holosoma is now our go-to option for humanoid research at FAR, and we will continue to maintain it and add new capabilities in the future. We're also hiring! Research: amazon.jobs/en/jobs/2850576/โ€ฆ Software: amazon.jobs/en/jobs/3043054/โ€ฆ

Sim-to-real learning for humanoid robots is a full-stack problem. Today, Amazon FAR is releasing a full-stack solution: Holosoma. To accelerate research, we are open-sourcing a complete codebase covering multiple simulation backends, training, retargeting, and real-world inference.
11
77
20,068
Peter Chen retweeted
Sim-to-real learning for humanoid robots is a full-stack problem. Today, Amazon FAR is releasing a full-stack solution: Holosoma. To accelerate research, we are open-sourcing a complete codebase covering multiple simulation backends, training, retargeting, and real-world inference.
20
132
598
212,900
Peter Chen retweeted
1 Dec 2025
Open-source: complete codebase covering multiple simulation backends, training, retargeting, and real-world inference. Infra built for humanoid, but also readily modified for quadruped (also included). Lots of infra gems/conveniences we rely on consistently. Hopefully equally helpful for others.
Sim-to-real learning for humanoid robots is a full-stack problem. Today, Amazon FAR is releasing a full-stack solution: Holosoma. To accelerate research, we are open-sourcing a complete codebase covering multiple simulation backends, training, retargeting, and real-world inference.
11
53
472
76,335
Peter Chen retweeted
5 Nov 2025
14/n Visuomotor policy learning ...but can also use feet to kick a T-shaped box to the target region, i.e., Humanoid Kick-T.
1
2
39
10,239
Peter Chen retweeted
5 Nov 2025
Excited to introduce TWIST2, our next-generation humanoid data collection system. TWIST2 is portable (use anywhere, no MoCap), scalable (100 demos in 15 mins), and holistic (unlock major whole-body human skills). Fully open-sourced: yanjieze.com/TWIST2
24
112
495
100,259
Peter Chen retweeted
Why override ยตP? Because its core assumptions only hold very early in training! In practice wide models quickly stop being more sensitive to weight updates than smaller models! This is caused by changes in the geometric alignment of updates and layer inputs over training. ๐Ÿงต6/8
2
10
68
42,067
23 Oct 2025
We investigated what enables muP's LR transfer across model sizes and found some new insights into large model optimization dynamics:
The Maximal Update Parameterization (ยตP) allows LR transfer from small to large models, saving costly tuning. But why is independent weight decay (IWD) essential for it to work? We find ยตP stabilizes early training (like an LR warmup), but IWD takes over in the long term! ๐Ÿงต
2
2
12
2,173
Peter Chen retweeted
10 Oct 2025
I've long wondered if we can make a humanoid robot do a ๐˜„๐—ฎ๐—น๐—น๐—ณ๐—น๐—ถ๐—ฝ - and we just made it happen by leveraging ๐—ข๐—บ๐—ป๐—ถ๐—ฅ๐—ฒ๐˜๐—ฎ๐—ฟ๐—ด๐—ฒ๐˜ with BeyondMimic tracking! This came after our original OmniRetarget experiments, with only minor tweaks to RL training: relaxing a termination threshold and removing one reward term. The policy achieved a ๐Ÿฑ/๐Ÿฑ success rate in our real-world experiments, showing the strength of high-quality, interaction-preserving motion retargeting combined with BeyondMimicโ€™s minimal RL tracking. Here is the updated arXiv: arxiv.org/abs/2509.26633 (In Sec. V. A)
Humanoid motion tracking performance is greatly determined by retargeting quality! Introducing ๐—ข๐—บ๐—ป๐—ถ๐—ฅ๐—ฒ๐˜๐—ฎ๐—ฟ๐—ด๐—ฒ๐˜๐ŸŽฏ, generating high-quality interaction-preserving data from human motions for learning complex humanoid skills with ๐—บ๐—ถ๐—ป๐—ถ๐—บ๐—ฎ๐—น RL: - 5 rewards, - 4 DR terms, - Proprio. ONLY, - NO history/curriculum. Ready for agile, human-like ๐Ÿค–? (Best with ๐ŸŽง) ๐Ÿ”— omniretarget.github.io ๐ŸŽฅ 1/9
175
534
3,877
1,050,991
Peter Chen retweeted
Super exciting news!! Welcome Nikita co and looking forward to working together!
๐ŸšจBig news - our team is joining the Frontier AI and Robotics (FAR) lab at @amazon to keep building the future of robotics. Excited to be joining such an inspiring team and to work with @peterxichen, @pabbeel, @rocky_duan, @akanazawa and many others at FAR.
2
3
63
19,195
Peter Chen retweeted
SOTA data generation from OmniRetarget SOTA formulation from BeyondMimic = mind-blowing performance
Humanoid motion tracking performance is greatly determined by retargeting quality! Introducing ๐—ข๐—บ๐—ป๐—ถ๐—ฅ๐—ฒ๐˜๐—ฎ๐—ฟ๐—ด๐—ฒ๐˜๐ŸŽฏ, generating high-quality interaction-preserving data from human motions for learning complex humanoid skills with ๐—บ๐—ถ๐—ป๐—ถ๐—บ๐—ฎ๐—น RL: - 5 rewards, - 4 DR terms, - Proprio. ONLY, - NO history/curriculum. Ready for agile, human-like ๐Ÿค–? (Best with ๐ŸŽง) ๐Ÿ”— omniretarget.github.io ๐ŸŽฅ 1/9
5
39
5,112
Peter Chen retweeted
Our grand finale: A complex, long-horizon dynamic sequence, all driven by a proprioceptive-only policy (no vision/LIDAR)! In this task, the robot carries a chair to a platform, uses it as a step to climb up, then leaps off and performs a parkour-style roll to absorb the landing. This pushes the boundaries of agile, human-like loco-manipulation! 7/9
5
28
155
31,635
Peter Chen retweeted
1 Oct 2025
Very excited to start sharing some of the work we have been doing at Amazon FAR. In this work we present OmniRetarget, which can generate high-quality interaction-preserving data from human motions for learning complex humanoid skills. High-quality re-targeting really helps the reinforcement learning. Why? The control policy optimization landscape is much nicer for (near)feasible trajectories than for trajectories with artifacts like (e.g.) foot skating or ground penetration.
Humanoid motion tracking performance is greatly determined by retargeting quality! Introducing ๐—ข๐—บ๐—ป๐—ถ๐—ฅ๐—ฒ๐˜๐—ฎ๐—ฟ๐—ด๐—ฒ๐˜๐ŸŽฏ, generating high-quality interaction-preserving data from human motions for learning complex humanoid skills with ๐—บ๐—ถ๐—ป๐—ถ๐—บ๐—ฎ๐—น RL: - 5 rewards, - 4 DR terms, - Proprio. ONLY, - NO history/curriculum. Ready for agile, human-like ๐Ÿค–? (Best with ๐ŸŽง) ๐Ÿ”— omniretarget.github.io ๐ŸŽฅ 1/9
21
42
343
93,068