Original data · 2026-05-12

19 AI video models. One prompt. Same hardware.

We ran the same cinematic prompt through every AI video model in VIBE and scored the results on speed, quality, motion, and prompt adherence. Here's what we found.

The benchmark prompt

Cinematic drone shot of a dragon made of living fire flying over a frozen Nordic fjord at twilight. Slow tracking camera, golden hour, deep blue water below.

All 19 models ran the same prompt at default settings. Quality, motion, and adherence are subjective scores from 0-10 reviewed by 3 human raters. Generation times averaged over 3 runs.

🏆 Best overall quality

Sora 2 Pro

9.6/10

⚡ Fastest

LTX 2

9s

🎬 Best motion

Kling 3.0

9.4/10

🎯 Best prompt adherence

Sora 2 Pro

9.5/10

Full benchmark results

Sorted by overall quality.

RankModelTimeQualityMotionAdherenceAudio
#1Sora 2 Pro178s9.69.29.5Yes
#2Sora 292s9.28.99.3Yes
#3Google Veo 3.164s9.18.39.0Yes
#4Luma Ray Flash 252s8.78.08.1
#5Kling 3.071s8.69.48.4
#6Veo 3.1 Lite31s8.48.08.6Yes
#7Seedance 2.0124s8.49.17.9
#8Kling o378s8.38.97.6
#9WAN 2.662s8.17.79.2
#10Hailuo22s8.07.88.2
#11Vidu Q347s7.97.87.8
#12Pruna 534s7.77.57.8
#13Seedance Pro Fast28s7.68.77.3
#14Veo 3.1 Fast11s7.57.48.0Yes
#15Grok Imagine38s7.57.37.7
#16WAN 2.224s7.57.28.6
#17PixVerse 5.643s7.47.67.2
#18LTX 29s6.87.07.0
#19Happy Horse31s6.56.86.6

Per-model notes

  • Sora 2 Pro

    Q 9.6 · M 9.2 · A 9.5 · 178s

    Best overall. Cinematic composition. Camera move respected. 4K-ready.

  • Sora 2

    Q 9.2 · M 8.9 · A 9.3 · 92s

    Excellent — only a half-step behind Pro. Best quality-per-second among flagships.

  • Google Veo 3.1

    Q 9.1 · M 8.3 · A 9 · 64s

    Most photoreal output of the lineup. Audio added ambient wind cleanly.

  • Luma Ray Flash 2

    Q 8.7 · M 8 · A 8.1 · 52s

    Best atmosphere in the lineup. Lighting and mood — strongest of all 19.

  • Kling 3.0

    Q 8.6 · M 9.4 · A 8.4 · 71s

    Smoothest motion in the lineup. The dragon's flapping wings won here.

  • Veo 3.1 Lite

    Q 8.4 · M 8 · A 8.6 · 31s

    Strong middle tier. Hard to tell apart from full Veo at social sizes.

  • Seedance 2.0

    Q 8.4 · M 9.1 · A 7.9 · 124s

    Strong motion. Less optimized for atmospheric shots — better for action.

  • Kling o3

    Q 8.3 · M 8.9 · A 7.6 · 78s

    More creative interpretation. Took risks with color — sometimes pays off.

  • WAN 2.6

    Q 8.1 · M 7.7 · A 9.2 · 62s

    Highest prompt adherence score. Got the details right where others drifted.

  • Hailuo

    Q 8 · M 7.8 · A 8.2 · 22s

    Highest reliability score across re-runs. Boring but bankable.

  • Vidu Q3

    Q 7.9 · M 7.8 · A 7.8 · 47s

    Solid all-rounder. No single strength but no major weakness.

  • Pruna 5

    Q 7.7 · M 7.5 · A 7.8 · 34s

    Efficient. Clean output for the compute used.

  • Seedance Pro Fast

    Q 7.6 · M 8.7 · A 7.3 · 28s

    Quick motion-specialist tier.

  • Veo 3.1 Fast

    Q 7.5 · M 7.4 · A 8 · 11s

    Astonishing speed. Loses some atmospheric detail but composition holds.

  • Grok Imagine

    Q 7.5 · M 7.3 · A 7.7 · 38s

    Sharper, more 'online' look — better for memes than for atmospheric shots.

  • WAN 2.2

    Q 7.5 · M 7.2 · A 8.6 · 24s

    WAN's prompt adherence at faster compute.

  • PixVerse 5.6

    Q 7.4 · M 7.6 · A 7.2 · 43s

    Pushed the prompt toward a stylized look. Beautiful, but not photoreal.

  • LTX 2

    Q 6.8 · M 7 · A 7 · 9s

    Fastest in the lineup. Loses fine detail but composition is still readable.

  • Happy Horse

    Q 6.5 · M 6.8 · A 6.6 · 31s

    Not built for cinematic prompts. Shines on character / meme content instead.

Run the same prompt yourself

VIBE includes all 19 models we tested. Try the benchmark prompt — or your own — and see which model wins for your use case.

Download on the App StoreGet it on Google Play