Wan NSFW
仕様書
安定版動画モデル

Wan 2.5 NSFW

Balanced quality and speed — the everyday workhorse.

Wan 2.5 is the multimodal mid-tier: faster than 2.6 at slightly lower fidelity, and significantly more capable than 2.2. It introduces native audio-visual generation (sound synthesised alongside video) and a unified T2I + I2V + T2V pipeline. A strong choice for rapid iteration and drafts before committing to a 2.6 final render.

Technical Specifications

Output

Max duration~10 s
Resolution1080p-class
Frame rate~24 fps
AudioNative A/VSound generated with video
FormatMP4

Generation modes

Text-to-video (T2V)
Image-to-video (I2V)
Text-to-image (T2I)Unified pipeline

Quality traits

Identity consistencyGood
Prompt adherenceGood
Motion stabilityImproved vs 2.2
Audio syncNative (experimental)

Access

Free tier
No download / install

Strengths

  • Fastest among the capable Wan video models — good for iterating on prompts before a final 2.6 run.
  • Unified T2I + I2V + T2V pipeline in a single interface.
  • Native audio-visual generation — no separate audio step needed.
  • More affordable credit cost per generation than 2.6.

Limitations

  • Slightly lower motion coherence and identity retention than 2.6.
  • Audio generation is still experimental — sync quality varies.
  • 10-second cap — cannot produce the 15-second clips of 2.6.

Best For

  • Rapid iteration and draft passes before committing to Wan 2.6.
  • Scenes where audio atmosphere adds value.
  • Budget-conscious creators who need good-not-perfect results quickly.