Haruki Nishimura

Haruki Nishimura

39 Photos and videos

Tweets

Pinned Tweet

Haruki Nishimura @imp_aa

Apr 22

A huge shout-out to TRI's VLA team for the public release of VLA Foundry! You can take full control of VLA training with this fully open-sourced codebase, which comes with a nice GUI dashboard with rigorous policy comparison powered by STEP🪜 tri-ml.github.io/step/

Jean Mercat @MercatJean

Apr 22

Releasing VLA Foundry: an open-source framework that unifies LLM, VLM, and VLA training in a single codebase. End-to-end control from language pretraining to action-expert fine-tuning — no more stitching together incompatible repos.

0:34

7,778

Andrea Bajcsy

Haruki Nishimura retweeted

Andrea Bajcsy @andrea_bajcsy

Jun 9

We extended the deadline to *June 22nd*! Submit your coolest and craziest (in-progress or completed) works on generalist robot safety 😎 Workshop co-organized with the great team: @ArpitBahety @kensukenk @imp_aa @RobobertoMM @ianabraha @LihanZha

Arpit Bahety @ArpitBahety

May 19

Excited to announce our #RSS2026 workshop: "Rethinking What It Means to be 'Safe' for Generalist Robots"! 🛡️🤖 Have new work or videos of robot safety failures? Submit by June 12! 👇

1,660

Kensuke Nakamura

Haruki Nishimura retweeted

Kensuke Nakamura @kensukenk

May 20

What does it actually mean for a modern robot to be safe? As generalist robots move across tasks, environments, and users, safety must encompass many dimensions: collisions, semantic constraints, perceptions of safety, privacy, and more. Join our discussion at RSS 2026!

Arpit Bahety @ArpitBahety

May 19

Excited to announce our #RSS2026 workshop: "Rethinking What It Means to be 'Safe' for Generalist Robots"! 🛡️🤖 Have new work or videos of robot safety failures? Submit by June 12! 👇

1,590

Arpit Bahety

Haruki Nishimura retweeted

Arpit Bahety @ArpitBahety

May 19

Excited to announce our #RSS2026 workshop: "Rethinking What It Means to be 'Safe' for Generalist Robots"! 🛡️🤖 Have new work or videos of robot safety failures? Submit by June 12! 👇

5,256

Sergey Zakharov

Haruki Nishimura retweeted

Sergey Zakharov

@ZakharovSergeyN

Apr 29

Releasing RecGen: a collaboration between @ToyotaResearch, @toyota_europe, and @UvA_Amsterdam tackling a core 3D vision challenge: reconstructing complete multi-object scenes (parts, poses, textures, even occluded geometry) from just 1 to a few RGB-D views. Trained purely on synthetic data, RecGen achieves SOTA on real-world robotics and 6D pose benchmarks, handling occlusions, symmetry, and complex interactions. A step toward scalable, high-fidelity digital twins for robotics, and better evaluation and training of generalist policies. reconstruction-by-generation…

0:11

220

26,948

Anirudha Majumdar

Haruki Nishimura retweeted

Anirudha Majumdar

@Majumdar_Ani

Apr 25

I was thrilled to be back at @MIT for the Robotics Seminar! The talk recording is available now: Rethinking Robot Safety & Alignment in the Era of Generalist Policies youtu.be/pZM8sgLAye0?si=GG7t…

Anirudha Majumdar: Rethinking Robot Safety & Alignment in the Era of...

MIT - Nov. 21, 2025Speaker: Anirudha MajumdarSeminar title: Rethi...

youtube.com

9,253

Katherine Liu

Haruki Nishimura retweeted

Katherine Liu @robo_kat

Apr 23

A few interesting rollouts from the Foundry-QwenVLA-2.5B multi-task model on seen tasks in sim – a 🧵. I really like behaviors that involve non-prehensile manipulation, like the little nudges in StoreCerealBoxUnderShelf.

0:22

Jean Mercat @MercatJean

Apr 22

0:34

118

14,825

Sedrick Keh

Haruki Nishimura retweeted

Sedrick Keh @sedrickkeh2

Apr 23

Having control over upstream LLM/VLM training is key to training a good robotics model. We hope VLA Foundry opens the door for researchers and practitioners to answer questions they previously wouldn’t even have thought of asking if upstream pretraining was simply inherited!

Jean Mercat @MercatJean

Apr 22

0:34

3,528

Shun Iwase

Haruki Nishimura retweeted

Shun Iwase

@s1wase

Apr 22

TRIで最後に関わったプロジェクトである、VLA Foundryがついにリリースされました！異なる言語モデルやビジョンモデルを手軽に試せるだけでなく、Drake Blenderを用いたシミュレーション環境で複数タスクの評価も簡単に行えます。ぜひ試してみてください！

Jean Mercat @MercatJean

Apr 22

0:34

117

15,169

Haruki Nishimura

Haruki Nishimura @imp_aa

Apr 22

This is hugely based on @das_princeton's implementation that came out of the collaboration between TLU tri.global/trustworthy-learn… and @Majumdar_Ani's group at Princeton out of an internship project!

Katherine Liu @robo_kat

Apr 22

This is actually a pretty big deal — we rely on @imp_aa’s implementations to tell when policies are statistically different than each other. If someone presents some quick mean-only results internally without the CLD analysis, you can be sure someone will eventually ask for it.

815

Jean Mercat

Haruki Nishimura retweeted

Jean Mercat @MercatJean

Apr 22

0:34

491

74,717

Haruki Nishimura

Haruki Nishimura @imp_aa

Apr 22

Jean Mercat @MercatJean

Apr 22

0:34

7,778

Haruki Nishimura

Haruki Nishimura @imp_aa

Apr 22

See also: "Statistical Thinking for Robot Policy Evaluation: From Rigorous A/B Testing to Effective Visualization" medium.com/toyotaresearch/st…

Statistical Thinking for Robot Policy Evaluation: From Rigorous A/B Testing to Effective…

By Haruki Nishimura Masha Itkina

medium.com

250

Anirudha Majumdar

Haruki Nishimura retweeted

Anirudha Majumdar

@Majumdar_Ani

Apr 14

Great to see @LeRobotHF using STEP as a tool for statistically rigorous policy comparison! arxiv.org/abs/2503.10966

Is Your Imitation Learning Policy Better than Mine? Policy...

Imitation learning has enabled robots to perform complex, long-horizon tasks in challenging dexterous manipulation settings. As new methods are developed, they must be rigorously evaluated and...

arxiv.org

LeRobot

@LeRobotHF

Apr 7

Releasing the Unfolding Robotics blog! Time to unfold robotics: we trained a robot to fold clothes using 8 bimanual setups, 100 hours of demonstrations, and 5k GPU hours. Flashy robot demos are everywhere. But you rarely see the real story: the data, the failures, the engineering. We’re sharing everything: code, data, and details in the blog → huggingface.co/spaces/lerobo…

1:40

6,297

Haruki Nishimura

Haruki Nishimura @imp_aa

Apr 13

Congrats to the @LeRobotHF team on this remarkable contribution to the robotics community by open-sourcing "everything" including code, data, and all the valuable knowledge! Our TLU team at TRI is fortunate to have collaborated on statistical evaluation and analysis.

LeRobot

@LeRobotHF

Apr 7

1:40

913

Zubair Irshad

Haruki Nishimura retweeted

Zubair Irshad @mzubairirshad

Apr 2

A really solid step toward scalable, high-quality robot data collection — Raiden, from colleagues at TRI @ZakharovSergeyN (and led by @s1wase) lowering the barrier to entry for bimanual data collection, with support for leader–follower setups and SpaceMouse teleop. Big highlight - it natively supports camera calibration and integrates TRI’s learned stereo depth model out of the box, with strong improvements over vanilla ZED SDK. If you're working on robot learning or data collection pipelines, definitely worth a look👇 tri-ml.github.io/raiden/

0:05

Sergey Zakharov

@ZakharovSergeyN

Mar 24

Our 3D Vision team (3DGR) is releasing Raiden — a data collection toolkit for YAM robots. Built for scalable, high-quality data: supports leader–follower SpaceMouse teleop, multi-camera setups, and modern stereo depth (incl. TRI learned stereo). tri-ml.github.io/raiden/

1:02

172

15,371

Jean Mercat

Haruki Nishimura retweeted

Jean Mercat @MercatJean

Mar 24

Baking without premix.

1:35

9,982

Haruki Nishimura

Haruki Nishimura @imp_aa

Mar 11

Are you about to evaluate robot policies for your next paper, comparing your policy with baselines? Take a moment to review this article by @MashaItkina and myself, introducing practical tips on rigorous statistical analysis with easy-to-use Python tools: medium.com/toyotaresearch/st…

Statistical Thinking for Robot Policy Evaluation: From Rigorous A/B Testing to Effective…

By Haruki Nishimura Masha Itkina

medium.com

2,420

more replies

Haruki Nishimura

Haruki Nishimura @imp_aa

Mar 11

We also highlight our open-source, plug-and-play plotting tool in Python, which extends STEP to multi-policy comparisons and concisely visualizes the output of the statistical testing.

150

Haruki Nishimura

Haruki Nishimura @imp_aa

Mar 11

STEP is open-sourced here: tri-ml.github.io/step/ Explore the new plotting tool and tutorial here: lnkd.in/gBReeEdH Working examples of our statistical analysis tool can be found in the recent co-training study here: arxiv.org/abs/2602.01067

105