Interesting comparison between our VideoPoet and other competitive models.
The comparison is incredibly helpful and reinforces my belief that VideoPoet excels in generating larger motions. We know the exact reasons for this and are working on improving single frame quality.
Google VideoPoet, Runway, Pika & Genmo
Google recently announced Video Poet.
Google's VideoPoet is a large language model (LLM) that is capable of a wide variety of video generation tasks, including:
- text-to-video
- image-to-video
- video stylization
- video inpainting and outpainting
- video-to-audio.
I tried some of their text-to-image prompts (from their demo) in Pika, Runway and Genmo. Here are the results:
10 examples
1/10
Two teddy bears holding hands, walking down rainy 5th avenue.