Skip to main content

What Thinking Sounds Do

Expert Mode Thinking Sounds play subtle audio cues—like keyboard typing—while your agent processes responses. This adds realism to conversations by filling silence during AI processing, actions like calendar bookings, or tool execution.
When enabled, callers hear realistic keyboard typing sounds during processing, making it feel like someone is actively working on their request.

How It Works

Thinking Sounds are triggered during:
  • AI processing: While the agent generates a response
  • Tool execution: While running tools such as calendar bookings, transfers, or API calls
  • Complex reasoning: Multi-step processes that take longer
The sounds stop automatically when the agent begins speaking the response.

Configuration

Navigate to GeneralSounds in your agent editor.

Enable Thinking Sounds

Toggle thinking sounds on or off:
  • Enabled: Keyboard typing sounds play during processing
  • Disabled: Silent processing (default)

Set the Delay Threshold

Use the Delay threshold slider to control when thinking sounds begin:
  • 0 ms: Sounds start immediately when the agent begins thinking
  • 750 ms: Default setting, which avoids sounds for very fast responses
  • 3000 ms: Only plays for longer processing pauses

Upload Custom Thinking Sounds

When thinking sounds are enabled, you can upload up to five custom MP3, WAV, or OGG files. If you do not upload custom files, the default keyboard typing sound is used. For each uploaded sound, you can adjust:
  • Volume: 25%, 50%, 75%, or 100%
  • Probability: 25%, 50%, 75%, or 100%
If you upload custom sounds, the default sound is disabled and the custom sounds are selected randomly based on their probability.

Use Cases

Customer Support

Makes it feel like the agent is actively looking up information in a system

Booking & Scheduling

Provides audio feedback while processing calendar integrations

Sales Calls

Adds professionalism—sounds like the agent is taking notes

Technical Support

Simulates looking up documentation or running diagnostics

Best Practices

Use sparingly: Thinking sounds work best for occasional pauses. If your agent processes frequently, consider combining with Smart Filler instead. Match your use case: Typing sounds work well for business contexts but may feel out of place for casual or entertainment applications. Test the experience: Make several test calls to ensure thinking sounds enhance rather than distract from the conversation. Combine with other features: Use alongside Smart Filler for better latency management — verbal acknowledgment plus audio feedback.

Combining with Smart Filler

Thinking Sounds and Smart Filler complement each other:
FeatureWhat it doesBest for
Smart FillerVerbal acknowledgment (“Let me check…”)Longer pauses (1-3+ seconds)
Thinking SoundsAudio feedback (keyboard typing)Shorter pauses (0.5-2 seconds)
Both togetherVerbal + audio feedbackComplex processing
Recommendation: Enable both for the most natural-feeling conversations.

Technical Details

  • Default sound type: Realistic keyboard typing
  • Trigger: Automatic during processing states
  • Delay: Configurable from 0 ms to 3000 ms
  • Custom sounds: Up to 5 per agent, MP3/WAV/OGG, max 10 MB each
  • Duration: Matches actual processing time
  • Blending: Sounds fade naturally when speech begins

Next Steps

Smart Filler

Add verbal acknowledgments during processing

Ambient Sound

Add background atmosphere

Voice Settings

Fine-tune voice parameters

Test Your Agent

Test thinking sounds in the simulator