ScenarioControl 🚗🛣️ - Scenario Generation from a single Dashcam Image 📸 or Text Prompt 💬!! Excited to introduce a new vision-language control mechanism for learned driving scenario generation. Given a single dashcam image or a scene prompt or an image, we generate a full scene layout 🧩, temporally consistent rollouts, including map 🗺️, agents 🚗, and ego video🛣️
ScenarioControl enables direct, fine-grained control over layout and traffic while preserving realism. It operates in a vectorized latent space with a new cross-global control mechanism to fuse vision-language inputs with scene structure while preserving realism. Interfaces seamlessly with generative video models!
Project:
light.princeton.edu/Scenario…
Super fun project by Lili Gao,
@Yanbo_Xu_ , William Koch, Samuele Ruffino,
@Luke22R , Behdad Chalaki, Dmitriy Rivkin, Julian Ost,
@rogg1111, Mario Bijelic.