Wan 2.5 NSFW

Balanced quality and speed — the everyday workhorse.

Wan 2.5 is the multimodal mid-tier: faster than 2.6 at slightly lower fidelity, and significantly more capable than 2.2. It introduces native audio-visual generation (sound synthesised alongside video) and a unified T2I + I2V + T2V pipeline. A strong choice for rapid iteration and drafts before committing to a 2.6 final render.

Technical Specifications

Output

Max duration	~10 s
Resolution	1080p-class
Frame rate	~24 fps
Audio	Native A/VSound generated with video
Format	MP4

Generation modes

Text-to-video (T2V)	✓
Image-to-video (I2V)	✓
Text-to-image (T2I)	✓Unified pipeline

Quality traits

Identity consistency	Good
Prompt adherence	Good
Motion stability	Improved vs 2.2
Audio sync	Native (experimental)

Access

Free tier	✓
No download / install	✓

Strengths

Fastest among the capable Wan video models — good for iterating on prompts before a final 2.6 run.
Unified T2I + I2V + T2V pipeline in a single interface.
Native audio-visual generation — no separate audio step needed.
More affordable credit cost per generation than 2.6.

Limitations

Slightly lower motion coherence and identity retention than 2.6.
Audio generation is still experimental — sync quality varies.
10-second cap — cannot produce the 15-second clips of 2.6.

Best For

Rapid iteration and draft passes before committing to Wan 2.6.
Scenes where audio atmosphere adds value.
Budget-conscious creators who need good-not-perfect results quickly.

← 全モデルの仕様モデルを使う →