Providers & Models

Video Providers

All video generation models, organized by company.

Models below power Video Generation mode. Every model is available on every paid plan — billing is per generation from your credit balance. The only plan-based limit is output resolution:

  • Basic — up to 720p
  • Plus — up to 1080p
  • Pro / Team — up to 4K (where the model supports it)

For pricing, see the Pricing page.


Google Google

VEO

Video generation with up to 4K resolution.

ModelKey Features
VEO 3.1 Fast720p–4K, 4–8s
VEO 3.1Higher quality

Sub-modes: Text to Video, Start Frame, Interpolation, References. Supports 720p, 1080p, and 4K.


Alibaba Alibaba

HappyHorse

Alibaba's latest video model family, available as its own provider in Video Generation.

ModelSub-Mode
HappyHorse 1.0 T2VText to Video
HappyHorse 1.0 I2VStart Frame
HappyHorse 1.0 R2VReferences (1-9 image refs)
HappyHorse 1.0 Video EditEdit

Unique: 720p/1080p, 3-15s output duration, seeds, optional watermark. I2V uses a single first-frame image and inherits its framing. R2V accepts image references only. Video Edit accepts a 3-60s source video plus up to 5 reference images and uses up to the first 15s of the source.

Wan Video

Cost-effective video generation with the widest set of sub-modes, including video editing and continuation.

Wan 2.7 (current)

ModelSub-Mode
Wan 2.7 T2VText to Video
Wan 2.7 I2VStart Frame, Interpolation, Continuation
Wan 2.7 R2VReferences (up to 5 image or video refs)
Wan 2.7 Video EditEdit

Unique: 480p–1080p, 2–15s duration, optional audio, negative prompts, seeds. R2V accepts both image and video references; Video Edit applies instruction-based edits to an existing clip using up to 3 reference images.

Wan 2.6 (legacy)

Still available if you prefer them.

ModelSub-Mode
Wan 2.6 I2V FlashStart Frame
Wan 2.6 I2VStart Frame
Wan 2.2 KF2V FlashInterpolation
Wan 2.6 R2V FlashReferences
Wan 2.6 R2VReferences

KlingAI KlingAI

Kling

Premium video generation with motion control.

ModelKey Features
Kling V2.65–10s duration
Kling V3Latest generation, 720p/1080p/4K
Kling V3 OmniOmni reference support, 720p/1080p/4K without reference video
Kling Video O1Newest model

Unique: Motion control and motion reference input. Kling V3 supports 4K for text-to-video and image-to-video. Kling V3 Omni supports 4K for Omni-Video except reference-video jobs.


BytePlus BytePlus

SeedAnce

Video generation with synchronized audio.

ModelKey Features
SeedAnce 1.5 ProAudio-inclusive generation

OmniHuman

Human-focused video generation.

Model
OmniHuman 1.5

LTX LTX

Video generation with up to 4K and audio-driven modes.

ModelKey Features
LTX 2.3 Fast1080p–4K
LTX 2.3 ProHigher quality
LTX 2.3 Pro AudioAudio-driven video

Unique: 4K video output, audio-driven generation for music videos.


xAI xAI

Grok Video

Budget-friendly video generation.

Model
Grok Imagine Video

On this page