Supported Models
agent-media CLI gives you access to 7 AI models from 5 providers through a single command. No separate API keys needed — just pick a model and generate.
Video Models
4 models for AI video generation
Kling 3.0 Pro
Kie.ai (Kuaishou)
Kling 3.0 Pro is a video generation model from Kuaishou (Kie.ai). It produces high-quality video from text prompts and supports image-to-video generation. Known for consistent motion, realistic human faces, and strong prompt adherence.
Veo 3.1
Google DeepMind
Veo 3.1 is Google DeepMind's latest video generation model. It generates high-fidelity video with strong temporal coherence and cinematic quality. Excels at complex scenes, camera movements, and photorealistic output.
Sora 2 Pro
OpenAI
Sora 2 Pro is OpenAI's video generation model. It creates detailed, imaginative video from text descriptions. Known for creative interpretation of prompts, smooth motion, and ability to handle abstract or artistic concepts.
Seedance 1.0 Pro
ByteDance
Seedance 1.0 Pro is ByteDance's video generation model focused on human motion and dance choreography. It excels at generating videos with realistic body movement, facial expressions, and dynamic poses.
Image Models
3 models for AI image generation
Flux 2 Pro
fal.ai (Black Forest Labs)
Flux 2 Pro is the flagship image generation model from Black Forest Labs, served via fal.ai. It produces high-resolution images with exceptional detail, accurate text rendering, and strong prompt adherence. One of the fastest high-quality image generators available.
Flux 2 Flex
fal.ai (Black Forest Labs)
Flux 2 Flex is the fast, cost-effective image generation model from Black Forest Labs. It trades some quality for significantly faster generation speed and lower credit cost. Ideal for rapid prototyping and bulk generation.
Grok Imagine
xAI
Grok Imagine is xAI's image generation model. It produces creative, detailed images with a distinctive artistic style. Known for imaginative interpretations and strong handling of complex, multi-element scenes.
Quick Comparison
| Model | Type | Provider | Resolutions | Durations | Input |
|---|---|---|---|---|---|
| Kling 3.0 Pro | video | Kie.ai (Kuaishou) | 720p, 1080p | 5s, 10s | Text prompt, Image + text prompt |
| Veo 3.1 | video | Google DeepMind | 720p, 1080p | 4s, 8s | Text prompt |
| Sora 2 Pro | video | OpenAI | 720p, 1080p | 4s, 8s, 12s | Text prompt |
| Seedance 1.0 Pro | video | ByteDance | 720p, 1080p | 5s, 10s | Text prompt, Image + text prompt |
| Flux 2 Pro | image | fal.ai (Black Forest Labs) | 1024x1024, 1024x768, 768x1024 | — | Text prompt |
| Flux 2 Flex | image | fal.ai (Black Forest Labs) | 1024x1024, 1024x768, 768x1024 | — | Text prompt |
| Grok Imagine | image | xAI | 1024x1024 | — | Text prompt |