Less than a year ago, we introduced Aleph 1.0, with the thesis that video models can become as general as language models. It was the first model of its kind, moving beyond rigid tasks like text-to-video or image-to-video, to accept combinations of image, video, and text inputs and generalize to tasks unseen during training.
Today, we're releasing Aleph 2.0, with the goal of making video editing models as powerful as possible for real-world use cases. It generates up to 30 seconds of video at 1080p and propagates edits consistently across shots. It preserves the details of the original video extremely well.
Based on all the learnings from 1.0, we've gotten more opinionated about what the right interface for a video editing model looks like. You can now preview edits on a single frame before generating the entire video, which makes for a much more interactive and controllable editing experience.
Hope you have as much fun with it as we've had these past weeks.
Aleph 2.0 is here. Now you can edit a single frame in your video, preview the change and then Aleph 2.0 carries that edit across the rest of your video.
Try it now in the new Edit Studio on web at the link below.