Cloned voices appear alongside standard voices in the voice selection panel and can be selected from the My Cloned Voices sidebar.
Creating a Cloned Voice
Open the clone form
In your agent editor, go to General → Speaking and click Clone Voice. You can also open it from the My Cloned Voices sidebar in the voice selection panel.
Choose a provider
Select a cloning provider based on your audio sample:
Choose ElevenLabs when you have a longer clean recording and want the model to learn more tone variation. Choose Cartesia when you want a fast clone from a short clean clip. If unsure, create both and compare them.Supported formats: MP3, WAV, OGG, WebM — ElevenLabs also accepts M4A.
| Provider | Best for | Duration | File size |
|---|---|---|---|
| ElevenLabs | Longer samples, more natural variation | Min 5s, recommended 1-2 min, max 180s | Up to 10 MB |
| Cartesia | Short clean clips, fast cloning | Min 3s, recommended 5-10s, max 10s | Up to 5 MB |
Provide audio
Switch between two input modes using the Upload and Record tabs:Upload — Drag and drop an audio file or click to browse. The accepted formats and size limit are shown based on the selected provider.Record — Click the record button to capture audio directly from your microphone. A live waveform visualization and duration counter are displayed while recording. Recording automatically stops when the provider’s maximum duration is reached.The form tracks the total duration of your audio and shows whether it meets the provider’s requirements.
Fill in voice details
- Voice name (required) — A descriptive name shown in the voice selection panel
- Language (required) — The language of the audio sample (English, Spanish, French, German, Italian, Portuguese, Dutch, Japanese, Korean, or Chinese)
- Description (optional) — Internal notes about this voice
- Remove background noise (optional) — Available for ElevenLabs only — cleans up noise in the sample before cloning
Managing Cloned Voices
Cloned voices are shared across your account. All team members with appropriate permissions can view and assign cloned voices to their agents.Viewing Cloned Voices
Your cloned voices appear in the My Cloned Voices sidebar within the voice selection panel. Each voice shows its name, provider, language, and current status badge. You can also view all cloned voices in a table that shows:- Name and Provider
- Language
- Status (with color-coded badges)
- Actions such as selecting or deleting the voice
Deleting a Cloned Voice
- Find the cloned voice in the sidebar or table
- Click the delete button or select Delete from the actions menu
- Confirm the deletion
Using Cloned Voices with Agents
Once a cloned voice shows a Ready status, assign it to any agent:Find your cloned voice
Look in the My Cloned Voices sidebar on the right side of the voice selection panel. Only voices with Ready status can be selected.
Select the voice
Click the cloned voice to assign it. The agent uses this voice for all subsequent conversations. You can further customize the output with voice settings.
Audio Sample Best Practices
Recording quality
Recording quality
- Use a good quality microphone (USB condenser or better)
- Record in a quiet environment with sound dampening
- Maintain a consistent distance from the microphone (6-12 inches)
- Avoid rooms with echo or reverb
- Use 44.1 kHz sample rate or higher
Content variety
Content variety
- Include both short and long sentences
- Cover different tones: questions, statements, explanations
- Read naturally at a conversational pace
- Avoid reading too fast or too slow
- Include pauses between sentences
What to avoid
What to avoid
- Background music or ambient noise
- Multiple speakers in one sample
- Heavy audio processing, compression, or filters
- Whispering or shouting
- Samples shorter than the provider’s minimum duration
- Low-quality phone recordings
Sample Strategy
Use the provider you selected to decide how much audio to collect.For ElevenLabs
- Aim for 1-2 minutes when possible
- Include varied phrasing, not one repeated sentence
- Use the Remove background noise option if the recording is otherwise good
- Prefer one speaker, one microphone, one room
For Cartesia
- Aim for a short, clean 5-10 second clip
- Do not over-record just to add more material
- Remove room noise before recording because the clone will reflect it closely
- Choose a clip with stable volume and no interruptions
Good sample script
Read 4-6 natural sentences in the way you want the agent to sound:- a greeting
- one short factual sentence
- one question
- one longer explanatory sentence
- one closing sentence
Legal Considerations
Before cloning, ensure you have:- Written consent from the voice owner
- Rights to use the voice commercially
- Clear agreement on how the voice will be used
- Documentation of the consent for your records
- Voices without consent
- Voices of public figures without licensing
- Voices for deceptive or impersonation purposes
Troubleshooting
My sample is rejected
My sample is rejected
Check the selected provider’s duration, file size, and file format limits. Most failed uploads are caused by clips that are too short, too long, or too noisy.
The cloned voice sounds noisy
The cloned voice sounds noisy
Re-record with less background noise. For ElevenLabs, try enabling Remove background noise. For Cartesia, start with a cleaner clip rather than a longer one.
The voice does not sound expressive enough
The voice does not sound expressive enough
Use a better source sample, not just a different speed or pitch setting. Add clearer phrasing variety and natural intonation, then create a new clone.
I am not sure which provider to use
I am not sure which provider to use
Start by cloning the same sample with both providers. Then compare them in the voice picker using the same test script.
Next Steps
Select Voice
Browse and compare all available voices
Voice Settings
Fine-tune speed, pitch, and stability
Custom Pronunciations
Correct pronunciation for your cloned voice
Test Your Agent
Test your cloned voice in conversations