🧠Memory is crucial for robots — to handle occlusions, track progress, stay coherent, etc. Yet, most VLA truncate context.
🤔Why is long-context hard for robot policies? And how can we fix it?
📄Our new paper: Learning Long-Context Diffusion Policies via Past-Token Prediction
Giving history to our robot policies is crucial to solve a variety of daily tasks. However, diffusion policies get worse when adding history. 🤖
In our recent work we learn how adding an auxiliary loss that we name Past-Token Prediction (PTP) together with cached embeddings enables us to reliably add longer history context to our robot policies! 🧠
We also show how PTP enables some test-time scaling techniques for robotics! 🚀