Skip to main content
Natural voice experiences depend on timing. If the agent speaks too quickly, customers feel interrupted. If it waits too long, the call feels uncertain or slow. This page explains the main controls that shape timing and turn-taking.
Timing is only one layer of the caller experience. If the conversation still feels wrong after timing changes, step back and review the full AI Pipeline Guide.

The Four Controls

ControlWhat it affectsMain doc
Greeting timingWhen the first message starts and whether callers can interrupt itGreeting Messages
VAD / turn detectionWhen the platform decides the caller has finished speakingVAD & Turn Detection
Inactivity handlingWhat happens when nobody speaks for a whileInactivity Timeout Settings
Ambient soundHow pauses feel between spoken turnsAmbient Sound

1. Greeting Timing

The first few seconds set the tone for the whole call. Use greeting timing to control:
  • how long the agent waits before speaking
  • whether the caller can interrupt the opening line
  • whether inbound and outbound greetings behave differently
  • short initial delay
  • non-interruptible greeting for most inbound use cases
  • outbound-specific override only when proactive calls need a different opening

2. VAD And Turn Detection

VAD decides when the caller has stopped speaking and the agent should respond. This strongly affects:
  • perceived responsiveness
  • interruption behavior
  • whether the agent cuts people off
  • whether the agent waits too long after short pauses

When to tighten it

  • the agent waits too long after obvious answers
  • callers say the assistant feels slow

When to loosen it

  • the agent interrupts mid-thought
  • callers pause naturally before finishing
  • the call includes number sequences or careful explanations

3. Inactivity Handling

Inactivity settings control what happens when the call goes quiet. Use them to define:
  • how long the platform waits
  • whether it should reprompt
  • when it should end the conversation
This matters most for:
  • outbound calls where the recipient does not answer clearly
  • long pauses during support or booking flows
  • calls where the customer may step away temporarily

4. Ambient Sound

Ambient sound does not change turn detection directly, but it changes how pauses feel. Used carefully, it can make pauses feel:
  • intentional
  • less sterile
  • more natural
Used badly, it can make the call feel:
  • noisy
  • distracting
  • less professional
For regulated or clarity-critical use cases, less is usually better.

How These Controls Work Together

Symptoms And Likely Fixes

SymptomMost likely area to review
Agent starts talking too soonGreeting delay or VAD sensitivity
Agent cuts callers offVAD / turn detection
Agent feels slow after short answersVAD / pause timing
Calls feel awkward during silenceInactivity settings
Pauses feel sterile or abruptAmbient sound and overall pacing

How Model And Transcriber Choices Affect Timing

One settings panel does not control all timing. Your experience also depends on:
  • the speech model and voice choice
  • transcriber behavior
  • how much work the agent is doing during the turn
  • whether tools or knowledge retrieval add extra latency
If a conversation feels slow, do not only change one timing slider. Also check:

A Practical Tuning Loop

  1. Start with default timing.
  2. Run a short Web Simulator test.
  3. Run at least one real Phone Test.
  4. Listen for interruptions, long pauses, and awkward silence.
  5. Change one timing variable at a time.
  6. Test again with the same scenario.

Common Mistakes

If you change greeting delay, VAD, inactivity, and ambient sound together, it becomes hard to know what actually improved or broke the experience.
Ambient sound can improve feel, but it does not fix slow tools, slow retrieval, or slow models.
Browser tests are useful, but phone calls expose timing issues more clearly, especially around greeting pace and interruption behavior.

Next Steps

Greeting Messages

Configure the opening experience

VAD & Turn Detection

Fine-tune how the platform detects completed speech

Inactivity Timeout Settings

Decide what happens during silence

Ambient Sound

Adjust the feel of pauses and background atmosphere