Overview
Wan 2.5 is the next-gen text-to-video system. It delivers sharper detail, longer and more stable shots, stronger physics, and tighter identity/style locking, with precise control over camera and motion. It supports text→video, image→video, and in-place edits for fast iteration to production-ready clips.
Description
Wan 2.5 turns directions into coherent, cinematic footage that holds together across frames and scenes. You can start from a prompt, a styleframe, or a short reference, and the model keeps faces, hands, materials, and lighting consistent while following your notes for framing, lens moves, pacing, and choreography. Temporal stability is improved, so motion reads clearly without flicker or warping, and typography and small UI elements remain legible under movement. Editing happens inside the same pipeline: extend a take, retime action, replace a background, or inpaint/outpaint regions without restarting the shot, which makes iteration feel like normal post. Wan 2.5 also respects reference style frames, letting teams lock brand look or character identity across multiple shots, and it balances quick previews with high-quality renders for delivery. Used for ads, product demos, explainers, social content, and pre-viz, it pairs controllable cinematography with dependable continuity so clips cut cleanly into real production workflows.
About Alibaba
Chinese e-commerce and cloud leader behind Taobao, Tmall, and Alipay.
View Company Profile