Even the SOTA VideoLLMs see videos in 1 fps, and you CANNOT perceive fine-grained motion 💃 with this frequency 🥲
📣 Presenting Video Parallel Scaling (VPS), an inference-time strategy that lets VideoLLMs see more frames by scaling compute in the parallel-axis 🤩