Video Providers
All video generation models, organized by company.
Models below power Video Generation mode. Every model is available on every paid plan — billing is per generation from your credit balance. The only plan-based limit is output resolution:
- Basic — up to 720p
- Plus — up to 1080p
- Pro / Team — up to 4K (where the model supports it)
For pricing, see the Pricing page.
Google
VEO
Video generation with up to 4K resolution.
| Model | Key Features |
|---|---|
| VEO 3.1 Fast | 720p–4K, 4–8s |
| VEO 3.1 | Higher quality |
Sub-modes: Text to Video, Start Frame, Interpolation, References. Supports 720p, 1080p, and 4K.
Alibaba
HappyHorse
Alibaba's latest video model family, available as its own provider in Video Generation.
| Model | Sub-Mode |
|---|---|
| HappyHorse 1.0 T2V | Text to Video |
| HappyHorse 1.0 I2V | Start Frame |
| HappyHorse 1.0 R2V | References (1-9 image refs) |
| HappyHorse 1.0 Video Edit | Edit |
Unique: 720p/1080p, 3-15s output duration, seeds, optional watermark. I2V uses a single first-frame image and inherits its framing. R2V accepts image references only. Video Edit accepts a 3-60s source video plus up to 5 reference images and uses up to the first 15s of the source.
Wan Video
Cost-effective video generation with the widest set of sub-modes, including video editing and continuation.
Wan 2.7 (current)
| Model | Sub-Mode |
|---|---|
| Wan 2.7 T2V | Text to Video |
| Wan 2.7 I2V | Start Frame, Interpolation, Continuation |
| Wan 2.7 R2V | References (up to 5 image or video refs) |
| Wan 2.7 Video Edit | Edit |
Unique: 480p–1080p, 2–15s duration, optional audio, negative prompts, seeds. R2V accepts both image and video references; Video Edit applies instruction-based edits to an existing clip using up to 3 reference images.
Wan 2.6 (legacy)
Still available if you prefer them.
| Model | Sub-Mode |
|---|---|
| Wan 2.6 I2V Flash | Start Frame |
| Wan 2.6 I2V | Start Frame |
| Wan 2.2 KF2V Flash | Interpolation |
| Wan 2.6 R2V Flash | References |
| Wan 2.6 R2V | References |
KlingAI
Kling
Premium video generation with motion control.
| Model | Key Features |
|---|---|
| Kling V2.6 | 5–10s duration |
| Kling V3 | Latest generation, 720p/1080p/4K |
| Kling V3 Omni | Omni reference support, 720p/1080p/4K without reference video |
| Kling Video O1 | Newest model |
Unique: Motion control and motion reference input. Kling V3 supports 4K for text-to-video and image-to-video. Kling V3 Omni supports 4K for Omni-Video except reference-video jobs.
BytePlus
SeedAnce
Video generation with synchronized audio.
| Model | Key Features |
|---|---|
| SeedAnce 1.5 Pro | Audio-inclusive generation |
OmniHuman
Human-focused video generation.
| Model |
|---|
| OmniHuman 1.5 |
LTX
Video generation with up to 4K and audio-driven modes.
| Model | Key Features |
|---|---|
| LTX 2.3 Fast | 1080p–4K |
| LTX 2.3 Pro | Higher quality |
| LTX 2.3 Pro Audio | Audio-driven video |
Unique: 4K video output, audio-driven generation for music videos.
xAI
Grok Video
Budget-friendly video generation.
| Model |
|---|
| Grok Imagine Video |