GEN-1 plays the 🐚 shell game, trained on just 1 hr of robot data. It also generalizes to unseen objects, like
@BerkayAntmen 's car keys.
Physical AI models should be capable of benchmark tasks like this one. It's interesting for the all the reasons
@RhodaAI calls out -- requires visual memory, and the model must track the cups from the very start, at high frame rates.
Interestingly, GEN-1 appears to exhibit a degree of "active perception." It's subtle; the hands can sometimes appear to "follow" the cups, using its own movements to help attend to where it thinks the object should be.
Read more about GEN-1 in our blog post in the comments below ↓