LTX by Lightricksv2.3Version updatev2.3Mar 5, 2026Sharper fine details via a rebuilt latent space and updated VAE, improving textures and edge fidelity.
Better prompt understanding from a larger, upgraded text connector, reducing drift on complex prompts.
Stronger image-to-video motion with fewer frozen clips, less “Ken Burns” panning, fewer unexpected cuts, and better consistency.
Cleaner, more reliable audio from dataset filtering plus a new vocoder, with tighter alignment and fewer artifacts.
Native portrait video support up to 1080x1920, trained for vertical instead of cropped landscape.
Overview
LTX-2 is an open-source, next-generation multimodal AI model designed for video with synchronized audio and image creation as a comprehensive solution for creative workflows.
Capable of running on consumer-grade GPUs, it combines high-fidelity visuals, coherent sound, and multi-flow performance modes into a single platform.
LTX-2 has the capability to generate, enhance, and repurpose visuals more efficiently. Unlike most models, it considers sound and visuals in a unified production process for synchronized motion, dialogue, ambience, and music.
The model is designed to work with real production workflows, connecting directly with editing suites, broadcast tools, game engines, and VFX pipelines.
It supports both quick previews and delivery-ready 4K outputs. The model provides creative control through text, image, depth, and reference-video inputs, and offers multi-keyframe conditioning, 3D camera logic, and fine-tuning options.
As an open-source tool, LTX-2 enables researchers, enterprises, and independent creators to customize the model to fit their needs. Its use cases include post-production, pre-production, animation, restoration among others, offering solutions to automate motion tracking, rotoscoping, plate replacement, and other tasks, thereby reducing the time and cost of production while maintaining quality.
The upcoming releases will offer open access to the model's weights and training code. Its flexibility and customization options make it ideal for studios, research teams, and solo developers.
Supported features
Releases
Better prompt understanding from a larger, upgraded text connector, reducing drift on complex prompts.
Stronger image-to-video motion with fewer frozen clips, less “Ken Burns” panning, fewer unexpected cuts, and better consistency.
Cleaner, more reliable audio from dataset filtering plus a new vocoder, with tighter alignment and fewer artifacts.
Native portrait video support up to 1080x1920, trained for vertical instead of cropped landscape.
Other tools by Lightricks
Top alternatives
-
Turn Music & Ideas into Viral Videos In One Click
-
Create AI-generated videos with easeWE USE D-ID AT THE COLORADO VIRTUAL CREATIVE FACTORY...AND LOVE IT.
-
Transform text into captivating videos instantly.MagicLight — v1.2.1850-minute videos — Max length doubles from 30 to up to 50 minutes (plan-dependent), with characters, style, and pacing kept consistent scene to scene. Seedance 2.0 engine — ByteDance's leaderboard-topping model is now a selectable engine for physics-aware motion, cinematic camera work, and high-fidelity image-to-video. 19 AI video models, one workflow — Pick the engine per generation: Sora 2, Veo 3.1, Kling 3.0, Seedance 1.5 Pro and more, across 50+ visual styles. Nano Control — Break scenes into micro-actions with timeline-level animation planning. Voices & lip sync — Improved multi-language lip sync; narration in 30+ languages. Mobile (v1.2.18) — iOS and Android apps add Google Sign-In, image downloads in multiple formats alongside video export, and faster, more stable performance. Plus a bigger character library and clearer credit estimates before generating.
-
Create AI spokesperson videos from text
-
AI Video GenerationThey're dreaming if they think I'd give them my credit card info just for a free trial. Most useless thing ever...
-
Multi-shot video generation from text and image.


