๐๐ซ๐๐๐ฆ๐๐๐ซ๐จ ๐ข๐ฌ #๐ ๐จ๐ง ๐๐จ๐ญ๐ก ๐๐จ๐ฅ๐ฆ๐จ๐๐ฉ๐๐๐๐ฌ ๐๐ง๐ ๐๐จ๐๐จ๐๐ซ๐๐ง๐ ๐
๐ช๐ต๐ฎ๐ ๐บ๐ฎ๐ธ๐ฒ๐ ๐๐ต๐ถ๐ ๐ป๐ผ๐๐ฎ๐ฏ๐น๐ฒ: DreamZero-DROID is trained ๐๐๐๐ ๐ ๐๐๐๐ก๐โ using only the DROID dataset. No pretraining on large-scale robot data, unlike competing VLAs. This demonstrates the strength of video-model backbones for generalist robot policies (VAMs/WAMs).
More broadly, training ๐๐๐๐ฆ on real data and evaluating on (1) transparent, distributed benchmarks like ๐๐จ๐๐จ๐๐ซ๐๐ง๐ or (2) scalable sim-benchmarks like ๐๐จ๐ฅ๐ฆ๐จ๐๐ฉ๐๐๐๐ฌ is an exciting step toward fairer and more reproducible evaluation of generalist policies, one that the community can hillclimb together to measure progress.
Special thanks to the Ai2 MolmoSpaces team (
@notmahi @omarrayyann @YejinKim4 Max Argus) and the RoboArena team (
@pranav_atreya) for helping with the set-up and getting these evaluations! Special shout out to
@youliangtan @NadunRanawakaA @chuning_zhu, who led these efforts from the GEAR side :)
We also release our DreamZero-AgiBot checkpoint & post-training code to enable very efficient few-shot adaptation. Post-train on just ~30 minutes of play data for your specific robot, and see the robot do basic language following and pick-and-place ๐ค(See YAM experiments in our paper for more detail).
We also provide the entire codebase & preprocessed dataset to replicate the DreamZero-DROID checkpoint.
๐
dreamzero0.github.io
๐ป
github.com/dreamzero0/dreamzโฆ
RoboArena:
robo-arena.github.io/leaderbโฆ
MolmoSpaces:
molmospaces.allen.ai/leaderbโฆ