Blog/

How to Create UGC Videos with AI in 60 Seconds

User-generated content is the number one format for social media ads. It converts better than polished studio content because it feels authentic — a real person talking to camera about a product they use. The problem is that producing UGC at scale is expensive and slow. agent-media solves this with a fully automated UGC pipeline that takes you from a script to a finished, ready-to-post video in a single command. No video editors, no actors, no studio time.

What is the UGC pipeline?

The UGC pipeline is agent-media's core product. It automates the entire process of creating a talking-head video with B-roll, voiceover, subtitles, and music. When you run a single command, the pipeline executes seven steps in sequence:

  1. 1Script splitting. Your script is analyzed and broken into individual scenes, each with its own visual direction.
  2. 2TTS voiceover. Each scene gets a natural-sounding text-to-speech voiceover with accurate timing.
  3. 3AI talking heads. A lip-synced talking head is generated for each scene using your face photo or an AI avatar.
  4. 4B-roll generation. AI-generated B-roll clips are created to intercut with talking head footage.
  5. 5Assembly with crossfades. All clips are assembled into a single timeline with smooth transitions between scenes.
  6. 6Animated subtitles. Word-level synced subtitles are overlaid in your chosen style.
  7. 7Music + CTA overlay. Background music is mixed in and a call-to-action overlay is added to the final frame.

The entire pipeline runs end-to-end without any manual intervention. You provide a script and a face — agent-media handles everything else.

Step 1: Write your script

Start with a short script for your UGC video. This could be a product review, a testimonial, a how-to explanation, or an ad. The pipeline works best with scripts between 3 and 5 sentences. Here is an example for a skincare product review:

"I've been using this serum for two weeks and my skin has

never looked better. It absorbs in seconds, no greasy residue.

I used to spend $200 a month on facials — this replaced all of

that for $35. If you're on the fence, just try it."

You do not need to manually break the script into scenes. The pipeline analyzes sentence boundaries and pacing to split it automatically. Each scene gets its own talking head segment and optional B-roll intercut.

Step 2: Choose a face

The UGC pipeline generates lip-synced talking head footage. You have two options for the face:

  • 1.Your own photo. Provide a clear, front-facing headshot with good lighting. The pipeline will generate a realistic talking head that matches the person in the photo, with accurate lip movements synced to the voiceover.
  • 2.AI-generated avatar. If you do not have a face photo or want variety, the pipeline can generate an AI avatar. Skip the --face flag and the pipeline will create one for you.

The talking head output is not a static image with an animated mouth. It is a full video with natural head movements, blinking, and expression changes that match the tone of the voiceover.

Step 3: Run the pipeline

With your script and face photo ready, run the UGC pipeline with a single command. The --style flag sets the subtitle style, and --sync waits for the pipeline to finish before returning.

$ agent-media ugc \

-s "I've been using this serum for two weeks and my skin has never looked better. It absorbs in seconds, no greasy residue. I used to spend $200 a month on facials — this replaced all of that for $35. If you're on the fence, just try it." \

--face photo.jpg \

--style hormozi \

--sync

Here is what you see in your terminal as the pipeline runs:

Splitting script into 4 scenes...

Generating TTS voiceover (11.2s total)...

Generating talking heads (4 scenes)...

Generating B-roll clips...

Assembling timeline with crossfades...

Rendering subtitles (hormozi style)...

Adding background music + CTA overlay...

Completed in 58s | 330 credits

https://ppwvarkmpffljlqxkjux.supabase.co/storage/v1/.../ugc-output.mp4

Under a minute from script to finished video. The output URL is a direct link to your MP4 file, ready to download or share.

Step 4: Choose your subtitle style

Subtitles are a critical part of UGC performance. Most viewers watch social media videos with sound off, so your subtitles need to carry the message on their own. agent-media includes five built-in subtitle styles:

--style hormozi

Alex Hormozi style. Large, bold words with animated color highlights that emphasize key phrases. High contrast, high energy. Best for direct-response ads and bold claims.

my skin has NEVER looked better

--style karaoke

Word-by-word sync where each word lights up as it is spoken. Smooth and precise. Great for music-backed content and storytelling.

I used to spend $200 a month...

--style bold

Clean, professional white text on a subtle dark background. No animations. Best for B2B content, explainers, and polished brand videos.

This replaced all of that for $35.

--style tiktok

Trendy style with colored text, playful positioning, and emoji accents. Matches the native look of popular TikTok creators.

If you're on the fence, just try it

--style minimal

Small, understated text at the bottom of the frame. Stays out of the way. Best when visuals are the focus and subtitles are just an accessibility layer.

It absorbs in seconds, no greasy residue.

You can switch styles by changing the --style flag and re-running the pipeline. The subtitle rendering step is fast, so experimenting with different looks is cheap and quick.

What you get

The output of the UGC pipeline is a single MP4 file that is ready to post. No editing required. Here is what is included in the final video:

  • *Lip-synced talking head footage with natural expressions
  • *AI-generated B-roll intercuts for visual variety
  • *Professional TTS voiceover synced to each scene
  • *Animated subtitles in your chosen style
  • *Background music mixed at the right level
  • *Call-to-action overlay on the final frame

The video is formatted in 9:16 vertical aspect ratio — the standard for TikTok, Instagram Reels, and YouTube Shorts. Download it from the output URL and upload directly to any platform.

Cost breakdown

The UGC pipeline costs 30 credits per second of output video. Here is what that looks like for each supported duration:

DurationCreditsCost
5 seconds150$1.50
10 seconds300$3.00
15 seconds450$4.50

On the Creator plan at $39/month you get 3,900 credits. That is enough for approximately 13 ten-second UGC videos per month, or 26 five-second clips, or 8 fifteen-second videos. For most social media campaigns, that is a full month of content from a single subscription.

Compare that to hiring a UGC creator at $150-500 per video. A single Creator plan subscription replaces thousands of dollars in production costs.

Ready to create your first UGC video?

One script, one command, one finished video. Go from idea to ready-to-post UGC in under 60 seconds.

$ agent-media ugc -s "your script" --face photo.jpg --sync