Wan 2.6 Video Generator
Wan 2.6 — Alibaba's AI Video Generator
Wan 2.6 is Alibaba's latest video model. Generate up to 15-second 1080p clips from text or an image, with multi-shot storytelling, native audio-visual sync, and reference-to-video that preserves a character's look and voice across new scenes. No install, free credits to start.


Made With Wan 2.6
Stills from Wan 2.6 clips — reference-driven characters, multi-shot sequences, lip-synced dialogue, and cinematic motion across 16:9, 9:16, and 1:1.
Why Wan 2.6
Wan 2.6 adds reference-to-video and multi-shot planning to a strong text- and image-to-video base, with audio synced frame by frame.
Reference-To-Video (R2V)
Upload a character reference with both appearance and voice, then prompt new scenes that keep that same look and sound — a powerful way to put a consistent character into fresh shots.
Multi-Shot Storytelling
Wan plans shot transitions automatically and holds characters, environment, and lighting consistent across a sequence — complete narratives, not single fragments.
Native Audio-Visual Sync
Characters speak with accurate mouth shapes and timing, and visuals match the audio track frame by frame with precise lip-sync.
Up To 15s, 1080p, Any Ratio
Generate 5–15 second clips up to 1080p in 16:9, 9:16, or 1:1 — sized for film, vertical social, or square placements from one model.
Best Use Cases for Wan 2.6
Where Wan 2.6's reference-to-video and multi-shot planning deliver most.
Use reference-to-video to keep a real character's look and voice while generating entirely new scenes — strong for creators and personalised content.
Generate lip-synced dialogue clips where mouth shapes and timing match the audio — useful for explainers, ads, and presenters.
Render 9:16 clips up to 15 seconds for reels and shorts, with multi-shot transitions planned automatically.
Tell a short story across several shots while Wan holds characters, lighting, and environment consistent throughout.
Wan 2.6 FAQ
What Wan 2.6 is, what it generates, and how to direct it.
Wan 2.6 is Alibaba's latest AI video model. It generates up to 15-second 1080p clips from text or image inputs, with multi-shot storytelling, native audio-visual sync, and a reference-to-video mode that preserves a character's look and voice.
Generate With Wan 2.6
Open the playground, switch to Wan 2.6, and turn your prompt or reference into a multi-shot cinematic clip with free credits.