- Leo Grundström
- Posts
- The right way to generate AI voiceovers
The right way to generate AI voiceovers
Most people don’t know how to generate quality voice-overs for their videos…
And honestly, it's costing them thousands in revenue they don't even know they're missing
to make it simple fo you:
Bad voiceover = people click off in 30 seconds
Good voiceover = they watch the entire video and come back for more
So in this email I'm going to show you exactly how to generate voices that sound 100% human using ElevenLabs
This is the exact process channels making $30K+ per month are using right now
Step 1: Use the right platform
ElevenLabs is by far the most realistic AI voice generator you can find
It's also the cheapest
So it's a no-brainer
Step 2: Find the right voice type
the wrong voice will kill your retention
you need a voice that matches your content's vibe
so for example if you're doing sleep content, you'll need a calm voice
and here’s how you can find it inside of eleven labs
Go to the "Voices" tab first
Then use these exact filters:
Search for "storytelling" as your primary filter
Select "male" (for sleep/history content)
Choose "middle-aged" for authority and trust
Add "narrative" tag for story-telling capability
Set language to English with any accent
You want something very calm, something for storytelling
Someone that is middle-aged would be good
Step 3: Always filter for "highest quality" voices
this is important
if you want to get the highest quality voices and the best voices,
you need to highlight the "highest quality" filter
They cost slightly more per word but they're worth it
They sound a lot better and actually provide a lot better feeling for your audience
because:
Better viewer experience = better retention = more money
Step 4: Use the latest model
Always use the latest model that they have
Right now that's Eleven V3
Before it was Eleven multilingual V2, but use Eleven V3 to get the best results
Step 5: Generate in small chunks (this saves you money)
don’t generate an entire 2-hour script at once
don't even generate 3,000 characters at once
and here’s why:
If the AI pronounces something weird in a long generation, you have to regenerate everything
This burns through your credits quickly and wastes both money and time
so you don’t want to do that
Instead, you can generate in 500 words,
this way you can test each section before moving on
and you can catch and fix errors immediately
Step 6: Generate the voice overs for 99% cheaper (I’m actually not kidding)
I don’t wanna saturate the tool and risk it being removed, so I’ll be sharing it with a few selected people in one of my upcoming YouTube uploads
if you want to find out how to use ElevenLabs at a 99% cheaper rate, stay tuned on my YouTube, it’s coming
that's all I got for you at the moment
this is the exact voice generation process channels making $30,000 per month are using
and this is only about 15% of what it actually takes to launch a $30K/month faceless YouTube channel
you still need to:
know how to find viral topics
write scripts that keep people watching
and how to hire the right team
that's why I put together a full 2.5-hour free course that walks you through the entire process in the meantime
It's the same system students are using to hit their first $10K months
Leo
Reply