Use Case

Generate Talking Head & UGC Videos with AI — From Your Terminal

Transform any script into a complete UGC video in one command. The pipeline handles scene splitting, AI talking heads, AI B-roll generation, TTS voiceover, crossfade assembly, animated subtitles, background music, and end-screen CTA automatically.

Best Models for This

These models are recommended for generating talking head & ugc videos. Each one brings different strengths to this workflow.

ModelWhy
ugc-basicFull pipeline (AI talking heads + AI B-roll + AI voiceover). Handles scene splitting, crossfade assembly, animated subtitles, background music, and end-screen CTA automatically.

Step-by-Step Workflow

Follow these steps to start generating talking head & ugc videos with agent-media CLI.

1

Generate from script

Provide your script and the UGC pipeline splits it into scenes, generates TTS voiceover, creates AI talking head clips and AI B-roll, then assembles everything with crossfades, animated subtitles, background music, and an end-screen CTA.

terminal
$ agent-media ugc "Ever wonder why top founders wake up at 5am?" --sync
2

Add a face photo for talking heads

Provide a face photo to drive talking head scenes. The pipeline auto-detects the best TTS voice to match the face and uses AI lip-sync for realistic results.

terminal
$ agent-media ugc "script..." --face-url ./photo.png --sync
3

Customize style and music

Choose from subtitle styles (hormozi, minimal, bold, karaoke, clean), set the background music genre, and customize aspect ratio and duration to match your brand.

terminal
$ agent-media ugc "script..." --style hormozi --music chill --sync
4

Multi-scene with custom B-roll images

Supply your own images as B-roll sources. The pipeline animates them using AI video generation and weaves them into the assembled video between talking head scenes.

terminal
$ agent-media ugc "script..." --broll-images "url1,url2" --sync
5

Dub into another language

Generate the full UGC video in a different language. The pipeline translates the script, synthesises a matching TTS voice, and re-assembles the video with localized subtitles.

terminal
$ agent-media ugc "script..." --dub-language es --sync
6

Create a reusable persona

Save a persona with a specific voice and face for consistent UGC across videos. Reference it with --persona brand in future ugc commands to maintain a unified creator identity.

terminal
$ agent-media persona create --name brand --voice ./sample.mp3 --face ./photo.png

How Much Does It Cost

A 5-second UGC video costs 150 credits, covering AI talking heads, voiceover, animated subtitles, and full assembly. A 10s video is 300 credits, 15s is 450 credits. The Creator plan ($39/month for 3,900 credits) lets you produce ~13 videos at 10s per month. For higher volume, Pro Plus ($129/month for 12,900 credits) covers ~43 videos monthly.

Start generating talking head & ugc videos

Install agent-media CLI and create your first video in under 60 seconds.

npm install -g agent-media-cli