Imagine you told a robot to "find your car keys" in your apartment and it looked around, opened a drawer, and retrieved them for you.
As a step towards that, I adapted TiPToP to run on the RBY1 humanoid in our lab! Here's an example instruction it follows: "Put the green block on the blue plate and the yellow block on the magazine."
TiPToP helps plan over the right arm single torso joint, but it's easy to unlock more joints -- even the base wheels -- for more expressive, real-world tasks.
Humans find objects without thinking twice. One day, robots will too! 🤖
State-of-the-art robot policies often need hundreds of hours of data. What if we needed none?
Introducing TiPToP: a manipulation system that zero-shots open-world tasks from pixels and language using vision foundation models and GPU-parallelized Task and Motion Planning (TAMP).