How to Add Emotion to Text to Speech: Step-by-Step Guide
Make AI voices sound natural, expressive, and engaging with emotion tags
Quick Answer
To add emotion to text to speech: (1) Use a TTS tool with emotional control — SpeechGeneration AI's Studio+ tier or ElevenLabs v3. (2) Insert emotion tags in brackets like [excited] or [whisper]. (3) Generate and preview. (4) Adjust tags and regenerate until the delivery matches your vision. SG.ai accepts any bracketed emotion — not limited to a fixed list. 10,000 free characters to try.
What You Need Before Starting
Prerequisites for this tutorial
- A TTS tool with emotional control — SG.ai Studio+ recommended
- Your script or text (up to 5,000 characters per generation)
- Understanding of which emotions fit your content type and audience
Important: Emotion tags only work on Studio+ (2×) and Performance (1×) voices in SG.ai. Economy and Studio tiers generate natural speech but ignore emotion tags.
How Emotion Tags Work in Text to Speech
The concept in three sentences
Tags are bracketed words placed inline: [excited] We just hit a million subscribers!
The AI adjusts tone, pitch, pacing, and delivery based on the tag — no manual audio editing required.
Tags affect everything after them until the next tag or end of generation. SG.ai's system accepts any emotion — not limited to a fixed list.
Three Ways to Control TTS Emotion
| Method | Tool | Syntax | Complexity |
|---|---|---|---|
| Bracket tags | SG.ai, ElevenLabs v3 | [excited] Text here | Simple |
| SSML express-as | Azure, Google | <mstts:express-as style="cheerful"> | Complex (XML) |
| Natural language | Hume Octave | "Say this excitedly" | Medium |
This guide focuses on bracket tags — the simplest and most widely supported method.
5 Steps to Add Emotion to Your Voiceover
Follow this workflow for expressive AI audio every time
Write Your Script with Emotion in Mind
- Plan emotional beats before adding tags
- Map script sections: opening hook (excited), explanation (calm), key point (serious), CTA (enthusiastic)
- Write shorter sentences for emotional sections
Choose Your Voice and Tier
- Select Studio+ (2×) or Performance (1×) — only these support emotion tags
- Match voice to content: authoritative for tutorials, warm for storytelling
- Test 2–3 voices with a sample paragraph before committing
Insert Emotion Tags
- Place tags in brackets before the text they affect: [excited], [calm], [serious], [whisper], [laugh], [sad], [angry], [cheerful], [sarcastic], [nervous], [surprised], [thoughtful], [dramatic], [gentle], [urgent]
- Non-verbal cues: [sigh], [pause], [laughs], [gasps]
- Tags stay active until the next tag or end of generation
Generate and Preview
- Generate and listen to the full output before adjusting
- Check: smooth transitions, good pacing, natural breaks
- If something sounds off, adjust one tag at a time and regenerate
Refine and Export
- Move tags earlier or later for different effects
- Combine tags with punctuation: [whisper] Listen carefully... (ellipsis adds pause)
- Export as MP3 for web or WAV for production
- Pro tip: generate the same text with 2–3 emotion combos and compare
Before & After: See the Difference Emotion Tags Make
Four real examples showing the transformation
YouTube Video Intro
Without Tags
Welcome back to the channel. Today we're looking at the top five budget laptops for 2026.
With Tags
[excited] Welcome back to the channel! [calm] Today we're looking at the top five budget laptops for 2026.
Podcast Ad Read
Without Tags
This episode is brought to you by CloudHost. Get 50% off your first three months with code PODCAST50.
With Tags
[friendly] This episode is brought to you by CloudHost. [excited] Get 50 percent off your first three months with code PODCAST50!
E-Learning Module
Without Tags
In this module, you'll learn about data privacy regulations. These laws affect how your company handles customer information.
With Tags
[calm] In this module, you'll learn about data privacy regulations. [serious] These laws affect how your company handles customer information.
Audiobook Scene
Without Tags
She opened the letter. The words blurred as tears filled her eyes. It was over.
With Tags
[gentle] She opened the letter. [sad] The words blurred as tears filled her eyes. [whisper] It was over.
7 Common Mistakes When Using Emotion Tags
Avoid these pitfalls for consistent, expressive output
Using the wrong voice tier
Emotion tags are silently ignored on Economy and Studio tiers.
Fix: Switch to Studio+ (2×) or Performance (1×) to enable tag processing.
Misspelling tags
[exited] won't trigger the [excited] behaviour — the model sees an unknown word.
Fix: Double-check spelling before generating. Copy-paste from the reference table.
Too many tags
A tag before every sentence produces erratic, over-acted delivery.
Fix: Aim for 3–5 tags per 500 characters. Let natural speech carry the middle.
Expecting subtle emotions
[sarcastic] and similarly nuanced emotions are unreliable across voices.
Fix: Use broader emotions and reinforce meaning with script phrasing.
Long sentences after a tag
Emotional effect fades in sentences over 20 words.
Fix: Keep sentences under 20 words after each emotion tag.
No transition between contrasting emotions
Jumping from [excited] to [sad] without a break sounds jarring.
Fix: Insert [pause] between strongly contrasting emotions.
Not previewing before publishing
Tags that look correct on screen can sound wrong in audio.
Fix: Always generate and listen to the full output before exporting.
Emotion Tag Quick Reference
Grouped by category — not exhaustive
| Category | Tags | Best For |
|---|---|---|
| Energy | [excited], [enthusiastic], [urgent], [energetic] | Intros, CTAs, announcements |
| Calm | [calm], [gentle], [soothing], [relaxed] | Tutorials, meditation, stories |
| Serious | [serious], [authoritative], [professional], [stern] | News, training, compliance |
| Emotional | [sad], [melancholy], [nostalgic], [hopeful] | Storytelling, audiobooks |
| Conversational | [friendly], [warm], [casual], [cheerful] | Podcasts, social media |
| Dramatic | [dramatic], [suspenseful], [mysterious], [intense] | Trailers, fiction, horror |
| Non-verbal | [whisper], [sigh], [laugh], [pause], [gasp] | Emphasis, transitions |
Not exhaustive — SG.ai interprets any bracketed emotion. Experiment with [thoughtful], [bittersweet], [mischievous], and more.
Pro Tips for Expressive AI Voiceovers
Techniques that separate good audio from great audio
Test with the same sentence
Generate [excited], [serious], and [whisper] versions of the same line to hear how much a tag transforms delivery.
Use punctuation as amplifiers
Exclamation marks boost energy, ellipses add pauses, and em dashes create tension — all without extra tags.
Layer tags with structure
Place high-energy tags at openings and CTAs; use calm or serious tags through the informational middle.
Generate Economy drafts first
Confirm flow and phrasing at 0.1× cost before spending Studio+ credits on the final version.
Compare across voices
The same [whisper] tag sounds markedly different across voices — test on 2–3 before committing.
Emotional TTS: SpeechGeneration AI vs Alternatives
Side-by-side feature comparison
| Feature | SG.ai | ElevenLabs | Azure (SSML) |
|---|---|---|---|
| Syntax | [tag] brackets | [tag] brackets (v3) | XML express-as |
| Custom emotions | ✓ Any tag | ✓ Any tag | ~15 presets |
| Non-verbal sounds | ✓ [laugh], [sigh] | ✓ [laughs], [gasps] | Limited |
| Learning curve | Minimal | Minimal | Moderate (XML) |
| Emotion on free tier | ✓ | ✓ | N/A (pay-per-use) |
| Entry price | $5/mo | $5/mo | ~$0.016/1K chars |
Honest note: Both SG.ai and ElevenLabs offer similar bracket-tag systems. SG.ai gives 3× more characters per dollar. ElevenLabs has a larger voice library and voice cloning.
Frequently Asked Questions
Everything you need to know about emotion tags
Any bracketed emotion: [excited], [calm], [whisper], [serious], [laugh], and many more. SpeechGeneration AI accepts any bracketed emotion — you are not limited to a fixed list. Experiment with [thoughtful], [bittersweet], [mischievous], etc.
Start Creating Expressive AI Voiceovers
10,000 free characters with Studio+ emotional control. No credit card required.
No credit card required
Related Resources
Continue learning with these guides and features