Skip to main content
This page covers how to update voice settings, manage audio quality, and fix common audio issues.

Quick reference

I need to…Action
Change the agent’s voiceChannels > Voice > Agent Voice → Change
Adjust voice parametersChannels > Voice > Agent Voice → settings gear
Fix a mispronunciationChannels > Voice > Response Control → Pronunciations
Update cached audioAudio Management → Edit → Sync
Change interaction styleChannels > Voice > Audio Management → Interaction style
Enable/disable barge-inChannels > Voice > Audio Management → toggle
Upload custom audioAudio Management → Upload

Changing your agent’s voice

Consider updating when your brand refreshes, customers report clarity issues, you’re expanding to new languages, or newer voice models become available.
  1. Go to Channels > Voice > Agent Voice
  2. Click Change to open the Voice Library
  3. Filter by Language, Region, and Gender
  4. Preview voices with custom text
  5. Click Select to apply
  6. Test in Agent Chat before publishing
For non-English projects, use a multilingual_v2 model to ensure proper language support.
For programmatic voice configuration, see voice classes and Add a voice.

Interaction style and barge-in

Interaction style (response latency)

Control how quickly your agent responds in Channels > Voice > Audio Management:
ModeDelayBest for
Turbo400msUltra-fast, may interrupt more
Swift1200msSimple queries
Balanced1600msMost use cases (default)
Precise2000msComplex queries needing accuracy

Barge-in

Toggle in Channels > Voice > Audio Management. Lets callers interrupt the agent mid-sentence. Enable when: using Turbo mode, callers frequently interrupt, or you want more natural conversations. Disable when: delivering complete information (legal disclaimers), background noise causes false interruptions.

Managing audio quality

Cached audio

The Audio Management tab lets you cache and optimize frequently-used audio for reduced latency and consistent quality.
  • Find utterances in Channels > Voice > Audio Management
  • Click Edit to adjust stability/clarity settings or add IPA pronunciation corrections
  • Click the sync icon to regenerate, then preview
Audio is only cached after the same TTS is generated at least twice within 24 hours. For critical phrases (greetings, transfers), generate them repeatedly or upload manually.

Custom audio uploads

Upload pre-recorded audio (WAV or MP3) for maximum control over greetings, legal disclaimers, or brand-specific moments.

Fixing pronunciations

When the agent mispronounces words:
  1. Go to Channels > Voice > Response ControlPronunciations tab
  2. Click Add pronunciation
  3. Enter the word as it appears in text
  4. Provide the IPA pronunciation (e.g., “PolyAI” → /ˈpɒli eɪ aɪ/)
  5. Test in Agent Chat
You can also use SSML for advanced control:
<break time="500ms"/>
<prosody rate="slow">Speak this slowly</prosody>

Troubleshooting

IssueLikely causeFix
Voice sounds roboticLow-quality TTSSwitch to Cartesia or ElevenLabs
Agent speaks too fastRate set too highAdjust via settings gear in Agent Voice
Agent interrupts frequentlyTurbo mode without barge-inEnable barge-in or switch to Balanced
MispronunciationsTTS doesn’t recognize wordAdd pronunciation in Response Control
High latencySlow TTS providerSwitch to Cartesia or use cached audio
Background noise interruptionsBarge-in too sensitiveDisable barge-in or adjust interaction style

Maintenance routine

  • Monthly: Listen to recent calls and identify voice quality issues
  • As needed: Add pronunciations for new terms
  • After voice changes: Regenerate cached audio
Last modified on March 29, 2026