Stable Video Diffusionv1.1Version updatev1.1Feb 8, 2024Image-to-Video Generation: SVD 1.1 can generate short video clips from a single input image, producing sequences of 14 or 25 frames at customizable frame rates between 3 and 30 frames per second.
Improved Motion Consistency: The model incorporates a motion bucket ID set at 127, enhancing the coherence of motion throughout the generated video, resulting in smoother transitions and more realistic motion portrayal. 
Enhanced Resolution: SVD 1.1 generates videos at a resolution of 1024×576 pixels, providing clearer and more detailed visual outputs. 
Open-Source Accessibility: The model is available on platforms like Hugging Face, allowing users to access, experiment with, and contribute to its development.
Overview
Stable Video Diffusion is an open-source generative AI model developed by Stability AI. It is the company's first foundation model for generating videos based on the image model Stable Diffusion.
The model is currently in a research preview stage.The code for Stable Video Diffusion is available on Stability AI's GitHub repository, and the weights required to run the model locally can be found on their Hugging Face page.
The model is adaptable to various video applications and can be easily fine-tuned for multi-view synthesis from a single image using multi-view datasets.Stable Video Diffusion offers competitive performance compared to other closed models, as evidenced by external evaluation and user preference studies.It's important to note that at this stage, Stable Video Diffusion is exclusively intended for research purposes and not for real-world or commercial applications.
Feedback on safety and quality is valuable for further refining the model before its eventual release.Stable Video Diffusion is part of Stability AI's diverse range of open-source models, which includes modalities like image, language, audio, 3D, and code.
The company is committed to amplifying human intelligence and regularly updates its models with the latest advancements.To stay updated on Stability AI's progress, users can sign up for the newsletter and explore commercial applications by contacting the company.
Social media platforms like Twitter and Instagram are also available for following Stability AI's updates.
Releases
Improved Motion Consistency: The model incorporates a motion bucket ID set at 127, enhancing the coherence of motion throughout the generated video, resulting in smoother transitions and more realistic motion portrayal. 
Enhanced Resolution: SVD 1.1 generates videos at a resolution of 1024×576 pixels, providing clearer and more detailed visual outputs. 
Open-Source Accessibility: The model is available on platforms like Hugging Face, allowing users to access, experiment with, and contribute to its development.
Other tools by Stability AI
Top alternatives
-
Turn Music & Ideas into Viral Videos In One Click
-
Create AI-generated videos with easeWE USE D-ID AT THE COLORADO VIRTUAL CREATIVE FACTORY...AND LOVE IT.
-
Transform text into captivating videos instantly.MagicLight — v1.2.1850-minute videos — Max length doubles from 30 to up to 50 minutes (plan-dependent), with characters, style, and pacing kept consistent scene to scene. Seedance 2.0 engine — ByteDance's leaderboard-topping model is now a selectable engine for physics-aware motion, cinematic camera work, and high-fidelity image-to-video. 19 AI video models, one workflow — Pick the engine per generation: Sora 2, Veo 3.1, Kling 3.0, Seedance 1.5 Pro and more, across 50+ visual styles. Nano Control — Break scenes into micro-actions with timeline-level animation planning. Voices & lip sync — Improved multi-language lip sync; narration in 30+ languages. Mobile (v1.2.18) — iOS and Android apps add Google Sign-In, image downloads in multiple formats alongside video export, and faster, more stable performance. Plus a bigger character library and clearer credit estimates before generating.
-
Create AI spokesperson videos from text
-
AI Video GenerationThey're dreaming if they think I'd give them my credit card info just for a free trial. Most useless thing ever...
-
Multi-shot video generation from text and image.


