Thomas Weng

Thomas Weng

25 Photos and videos

Tweets

David Held retweeted

Thomas Weng

@thomas_weng

May 11

My first last-author paper, with @lakshitadodeja and @rai_inst collaborators! When deploying Behavior Cloning (BC) policies, we encounter performance gaps due to compounding error, subtle task shifts, etc. We want to be able to post-train BC policies with a few online interactions to quickly improve performance. Q2RL is what we hoped for in a robot post-training algorithm: - ✅it starts with BC policy rollouts (no need for large, diverse datasets), and - ✅it's sample-efficient enough for real-world online reinforcement learning! See the thread and linked website for more details. Paper and code are released!

Lakshita Dodeja

@lakshitadodeja

May 11

Can BC policies be quickly improved through real world experience? Our new #RSS2026 paper proposes Q2RL, a method that bridges BC and RL for on-robot learning. Q2RL improves BC policies by up to 3.75x with just 1-2 hours of online interaction! So when life gives you BC, make Q-functions! 🍋 Details in thread 🧵

0:09

11,362

Tal Daniel

David Held retweeted

Tal Daniel @TalDaniel8

Apr 22

I’m attending #ICLR2026 in Rio this week to present LPWM! Friday April 24 Poster Session 3 10:30, Oral Session 4B 3:15p. Happy to chat about self-sup object-centric learning and world models. I’ll be on the job market soon and looking for exciting opportunities! #ICLR @iclr_conf

Tal Daniel @TalDaniel8

Mar 6

🚀 #ICLR2026 Oral 💥 How can we design world models that capture object interactions directly from pixels? Introducing Latent Particle World Models-the first end-to-end self-supervised, object-centric world model, trained from videos, supporting action/img/lang conditioning. 1/n

0:15

6,116

Guanya Shi

David Held retweeted

Guanya Shi

@GuanyaShi

Mar 25

I’m so tired of writing rebuttals to this kind of “lack of novelty” review: “This paper trivially combines A, B, and C, so the algorithmic novelty is limited.” Technically, most (if not all) robotics papers are convex combinations of existing ideas. I still deeply appreciate A B C papers—especially when they deliver: - New capabilities: the “trivial combination” unlocks behaviors we simply couldn’t achieve before - Sensible & organic design: A B C is clearly the right composition—not some arbitrary A′ B C′ - Nontrivial interactions: careful analysis of the dynamics, coupling, or failure modes between A, B, C - Rehabilitating old ideas: A was dismissed for years, but paired with modern B/C, it suddenly works—and teaches us why - System-level & "interface" insight: the contribution is not any single piece, but how the pieces talk to each other - Scaling laws or regimes: identifying when/why A B C works (and when it doesn’t) - Engineering clarity: making something actually work robustly in the real world is not “trivial” - New problem formulations: sometimes the real novelty is in the reformulation—only under this view does A B C make sense. Maybe worth keeping these in mind when reviewing the next A B C paper : )

121

983

115,082

David Held

David Held @davheld

Mar 18

Excellent analysis by @Majumdar_Ani, recommended read!

Anirudha Majumdar

@Majumdar_Ani

Mar 17

x.com/i/article/203356458793…

4,473

Tal Daniel

David Held retweeted

Tal Daniel @TalDaniel8

Mar 6

0:15

280

29,081

David Held

David Held @davheld

14 Nov 2025

I am filling out a fellowship application letter of recommendation, and one question is: "Applicant's achievements reflect his or her ability: * - select -Not at All; Well; Moderately Well; Very Well; Extremely Well". If I select "Well" then what does that imply?

3,640

David Held

David Held @davheld

7 Nov 2025

What are the best slides you have seen to explain transformers to someone who has never seen them before?

2,142

Alexis Hao

David Held retweeted

Alexis Hao @hao_alexis

22 Sep 2025

Introducing FMVP: a method that adapts to natural arm motions during robot-assisted dressing. Pre-trained on vision in sim, fine-tuned with limited real-world vision force data, and tested in a 12-user, 264-trial study, FMVP is robust across garments and motions. #CoRL2025

0:25

5,401

David Held

David Held @davheld

19 Sep 2025

Check out our new CoRL paper on how to find shapes that cause failures for robot manipulation policies!

Divyam Goel @divyamgo10

19 Sep 2025

How do we discover a robot's failure modes before deploying it in the real world? Standard benchmarks often don't capture the full picture, leaving policies vulnerable to plausible variations in object shape. Thrilled that our work, "Geometric Red-Teaming for Robotic Manipulation," has been accepted as an oral presentation at #CoRL2025! We introduce a framework to automatically find these geometric blindspots. georedteam.github.io 🧵

0:06

3,534

David Held

David Held @davheld

18 Sep 2025

Check out our CoRL 2025 paper on learning from past interactions; we show, theoretically and experimentally, the benefit of using a generator-verifier paradigm!

Yishu Li @LisaYishu

18 Sep 2025

A closed door looks the same whether it pushes or pulls. Two identical-looking boxes might have different center of mass. How should robots act when a single visual observation isn't enough? Introducing HAVE 🤖, our method that reasons about past interactions online! #CORL2025

2,840

David Held

David Held @davheld

18 Sep 2025

Check out our new CoRL paper on planning in continuous action spaces by searching over sequences of object transformations!

Kallol Saha

@_ksaha

18 Sep 2025

🚨Introducing SPOT: Search over Point Cloud Object Transformations. SPOT is a combined learning-and-planning approach that searches in the space of object transformations. Website: planning-from-point-clouds.g… Paper: arxiv.org/abs/2509.04645 Code: github.com/kallol-saha/SPOT

0:29

3,745

David Held

David Held @davheld

20 Jun 2025

Check out our latest work on opening unseen articulated objects in the real world; this is a single policy that operates zero-shot on 3 robot embodiments! Key insights: 1) Large-scale sim training 2) Hierarchical imitation learning

Yufei Wang @YufeiWang25

20 Jun 2025

Introducing ArticuBot🤖at #RSS2025, in which we learn a single policy for manipulating diverse articulated objects across 3 robot embodiments in different labs, kitchens & lounges, achieved via large-scale simulation and hierarchical imitation learning. articubot.github.io 🧵

0:17

2,536

David Held

David Held @davheld

2 Jun 2025

Does anyone have a copy of the ShapeNetSem dataset? The Sshapenet website is having issues for downloading the dataset. Thanks!!

1,705

David Held

David Held @davheld

21 Mar 2025

A colleague mentioned this to me today and I just checked it: ChatGPT still cannot count the number of r's in strrrawberrrrry: chatgpt.com/share/67dd8bef-2…

1,784

Eugene Vinitsky 🦋

David Held retweeted

Eugene Vinitsky 🦋@EugeneVinitsky

9 Mar 2025

I am Jewish and a faculty. When the current administration cancels grants and attempts to deport students in the name of "protecting Jews from antisemitism" they are not speaking for me or defending me. They are using us as a totem to attack universities and immigration

165

12,517

Mohammad Nomaan Qureshi

David Held retweeted

Mohammad Nomaan Qureshi

@qunomaan

27 Jan 2025

Accepted to ICRA, 2025!🎉

Mohammad Nomaan Qureshi

@qunomaan

9 Oct 2024

1/ 🎉 How can we develop methods to generate synthetic, photorealistic data for training #AI models in robotics? We present SplatSim, a step in this direction. SplatSim is a scalable framework that generates photorealistic data for manipulation tasks using existing simulators as a physics backbone — enabling zero-shot Sim2Real policy transfer for RGB policies! 🌍 Check out our project page: splatsim.github.io and paper on arXiv arxiv.org/abs/2409.10161 With @_sparshgarg_ Francisco Yandun, @davheld, George Kantor, @abhi_silwal @CMU_Robotics

0:34

4,823

Yufei Wang

David Held retweeted

Yufei Wang @YufeiWang25

11 Dec 2024

Checkout our new NeurIPS paper on using differentiable trajectory optimization for deep RL and IL! Join our poster session at Friday 11-2. Ziyu will present this paper (he is also applying for grad schools this cycle)

Weikang Wan @weikang_wan

11 Dec 2024

#NeurIPS2024 Spotlight: Fri 11am - 2pm , West Ballro.m #6307 We present DiffTORI, which utilizes Differentiable Trajectory Optimization as policy representation for deep Reinforcement and Imitation learning, with superior performances on existing RL and IL benchmarks. 🧵 1/

4,458

Xialin He

David Held retweeted

Xialin He @Xialin_He

5 Nov 2024

Excited to share our recent work on CoRL2024, We enable quadruped robots to manipulate (flip &push) objects combined with locomotion. The whole visual policy is trained end2end by RL and controlled by MPC. More details can be found on our website: legged-manipulation.github.i…

10,227

Yishu Li

David Held retweeted

Yishu Li @LisaYishu

3 Nov 2024

[1/N] Ambiguities are common in manipulation: objects can be inherently multimodal (e.g., fully-closed doors🚪), and severe occlusions add uncertainty! This poses challenges to deterministic models. Check out FlowBotHD🤖, a history-aware diffusion handling ambiguities #CoRL2024

8,930