Nicholas Pfaff

Nicholas Pfaff

21 Photos and videos

Tweets

Pinned Tweet

Nicholas Pfaff

@NicholasEPfaff

Feb 11

Meet SceneSmith: An agentic system that generates entire simulation-ready environments from a single text prompt. VLM agents collaborate to build scenes with dozens of objects per room, articulated furniture, and full physics properties. We believe environment generation is no longer the bottleneck for scalable robot training and evaluation in simulation. Website: scenesmith.github.io/ 👇🧵(1/8)

1:04

569

74,377

Nicholas Pfaff

Nicholas Pfaff

@NicholasEPfaff

Jun 3

SceneSmith is now an ICML 2026 Spotlight (top 2.2%) and will be presented in Korea this summer! Meet SceneSmith: an agentic system that generates simulation-ready indoor environments from a single text prompt. New in the camera-ready: zero-shot rollouts of an externally trained robot policy inside generated SceneSmith scenes. 👇 (1/3)

1:04

161

11,579

more replies

Nicholas Pfaff

Nicholas Pfaff

@NicholasEPfaff

Jun 3

The camera-ready adds a new page on qualitative robot simulation demonstrations, plus additional baseline-control experiments, and a detailed limitations/failure analysis. Project: scenesmith.github.io/ Paper (updated): arxiv.org/abs/2602.09153 Code: github.com/nepfaff/scenesmit… Joint work with @cohnthomas43, @ZakharovSergeyN, @RickCory21, @RussTedrake (3/3)

SceneSmith: Agentic Generation of Simulation-Ready Indoor Scenes

A hierarchical agentic framework that generates simulation-ready indoor environments from natural language prompts.

scenesmith.github.io

646

Nicholas Pfaff

Nicholas Pfaff

@NicholasEPfaff

Jun 3

And one more of our teleop demos (head and external camera view) in the generated scenes. The project site has more zero-shot videos, more teleop videos, and videos of a mobile iiwa policy being evaluated in our scenes.

0:31

323

Sergey Zakharov

Nicholas Pfaff retweeted

Sergey Zakharov

@ZakharovSergeyN

Apr 29

Releasing RecGen: a collaboration between @ToyotaResearch, @toyota_europe, and @UvA_Amsterdam tackling a core 3D vision challenge: reconstructing complete multi-object scenes (parts, poses, textures, even occluded geometry) from just 1 to a few RGB-D views. Trained purely on synthetic data, RecGen achieves SOTA on real-world robotics and 6D pose benchmarks, handling occlusions, symmetry, and complex interactions. A step toward scalable, high-fidelity digital twins for robotics, and better evaluation and training of generalist policies. reconstruction-by-generation…

0:11

220

26,907

Katherine Liu

Nicholas Pfaff retweeted

Katherine Liu @robo_kat

Apr 23

Also, if you’re wondering how we generated all these cool videos from the Drake sim, check out @NicholasEPfaff’s repo github.com/nepfaff/drake-ble… as a starting point 👀

GitHub - nepfaff/drake-blender-tools: Tools for replaying Drake simulations in Blender

Tools for replaying Drake simulations in Blender. Contribute to nepfaff/drake-blender-tools development by creating an account on GitHub.

github.com

Katherine Liu @robo_kat

Apr 23

A few interesting rollouts from the Foundry-QwenVLA-2.5B multi-task model on seen tasks in sim – a 🧵. I really like behaviors that involve non-prehensile manipulation, like the little nudges in StoreCerealBoxUnderShelf.

0:22

2,767

Jean Mercat

Nicholas Pfaff retweeted

Jean Mercat @MercatJean

Apr 22

Releasing VLA Foundry: an open-source framework that unifies LLM, VLM, and VLA training in a single codebase. End-to-end control from language pretraining to action-expert fine-tuning — no more stitching together incompatible repos.

0:34

491

74,701

jenny huang

Nicholas Pfaff retweeted

jenny huang @JennyHuang99

Mar 4

🧵1/ 🤔New paper: Do LLMs Benefit from Their Own Words? In multi-turn chats, models are typically given their own past responses as context. But do their own words always help… or can they sometimes be a distraction?

172

18,428

Shivaram Kumar

Nicholas Pfaff retweeted

Shivaram Kumar @shirakuex

Feb 12

Replying to @allen_ai

This is awesome work! Curious—any plans to integrate SceneSmith-like agentic scene generation into MolmoSpaces? It feels like a natural combo: MolmoSpaces benchmark SceneSmith prompt-to-sim scenes = infinite evaluation distribution. scenesmith.github.io/

SceneSmith: Agentic Generation of Simulation-Ready Indoor Scenes

A hierarchical agentic framework that generates simulation-ready indoor environments from natural language prompts.

scenesmith.github.io

1,049

Ai2

Nicholas Pfaff retweeted

Ai2

@allen_ai

Feb 11

Introducing MolmoSpaces, a large-scale, fully open platform benchmark for embodied AI research. 🤖 230k indoor scenes, 130k object models, & 42M annotated robotic grasps—all in one ecosystem.

0:59

101

716

97,006

Ilir Aliu

Nicholas Pfaff retweeted

Ilir Aliu

@IlirAliu_

Feb 12

Agentic Generation of Simulation-Ready Indoor Scenes and Robot Test Environments. 📍 Paper AND Code: Instead of hand-building scenes in simulation, you write one prompt. SceneSmith builds the world for you. > Room layout. > Furniture. > Wall and ceiling objects. > Small movable items. Each stage is handled by a team of VLM agents: one proposes, one critiques, one coordinates. The result is not just pretty scenes, but physics-ready environments. Every object: •Metric scale •Collision geometry •Estimated mass, inertia, friction •<2% object collisions •96% stable under gravity And it exports directly to MJX, USD, SDFormat. If you train or evaluate robot policies, environment creation is usually the bottleneck. SceneSmith turns it into an on-demand layer. You can generate dozens of diverse scenes per task and automatically evaluate policies across them, with 99.7% agreement to human labels. That means: •More robust policies •Faster benchmarking •No hand-written success predicates 205 participants preferred SceneSmith scenes 92% of the time for realism and 91% for prompt faithfulness. Environment generation is no longer the slow part of robot research. If you work on sim2real, policy scaling, or automated evaluation, this is worth bookmarking and sharing with your team. 📍GitHub: scenesmith.github.io/ Paper: arxiv.org/abs/2602.09153 Code: github.com/nepfaff/scenesmit… —- Weekly robotics and AI insights. Subscribe free: 22astronauts.com

0:53

3,888

Russ Tedrake

Nicholas Pfaff retweeted

Russ Tedrake @RussTedrake

Feb 11

I've been saying for years that the biggest challenge for simulation in robotics is not actually the physics engine (although you do have to get that right). The real challenge is capturing the *diversity* of the real world. There was no doubt that generative AI had the potential to change that, but it's still amazing to see it take shape. Watching Nick's incredibly fast progress has convinced me that content generation might not actually be a bottleneck anymore. This is a beautiful combination of hardened tools for e.g. low-level mesh processing with the latest tools for generative asset creation, wrapped in a powerful agentic workflow. Please do give it a try and share your feedback.

Nicholas Pfaff

@NicholasEPfaff

Feb 11

1:04

325

38,091

Divya Tiwari

Nicholas Pfaff retweeted

Divya Tiwari

@_divyuh

Feb 11

Amazing, I used to think about such projects during grad school but the technical complexity is super high for such a plug and play level simulator. Seems like an amazing piece of work. Will try for sure later this week.

Nicholas Pfaff

@NicholasEPfaff

Feb 11

Replying to @NicholasEPfaff

SceneSmith exports to any major robotics simulator (MJX, USD, SDFormat). Here is a Rainbow RBY1 being teleoperated in our scenes. Opening cabinets, grasping mugs, navigating rooms. Third-person view (left) robot head camera (right). 🧵(7/8)

0:19

905

Adam Wei

Nicholas Pfaff retweeted

Adam Wei @adamwei_

Feb 11

We can now generate realistic and physically interactable scenes with just a text prompt! I'm super excited to see how this tool will shape the future of sim-based development and evaluation

Nicholas Pfaff

@NicholasEPfaff

Feb 11

1:04

570

Sergey Zakharov

Nicholas Pfaff retweeted

Sergey Zakharov

@ZakharovSergeyN

Feb 11

Super excited to share our latest work on 3D scene generation! SceneSmith turns natural language prompts into richly furnished, simulation-ready indoor environments—enabling robot training and evaluation at scale. Huge kudos to @NicholasEPfaff and team for the tremendous effort!

Nicholas Pfaff

@NicholasEPfaff

Feb 11

1:04

1,004

Thomas Cohn

Nicholas Pfaff retweeted

Thomas Cohn @cohnthomas43

Feb 11

We all know that being able to generate a new world at the touch of a button is nice. But the fact that you can just directly simulate these scenes and everything works is a huge boon for robotics. I can personally vouch for the quality and realism of the resulting simulations.

Nicholas Pfaff

@NicholasEPfaff

Feb 11

1:04

3,898

Nicholas Pfaff

Nicholas Pfaff

@NicholasEPfaff

Feb 11

1:04

569

74,377

more replies

Nicholas Pfaff

Nicholas Pfaff

@NicholasEPfaff

Feb 11

0:19

1,954

Nicholas Pfaff

Nicholas Pfaff

@NicholasEPfaff

Feb 11

This is an amazing collaboration with @cohnthomas43, @ZakharovSergeyN, @RickCory21, @RussTedrake 📄 arxiv.org/abs/2602.09153 💻 scenesmith.github.io/ 🔧 github.com/nepfaff/scenesmit… Explore our interactive 3D scenes on our website or download them from Hugging Face! 🧵(8/8)

0:34

1,679