TryVox

VoxML Reference

Declarative call control with JSON — TryVox's answer to TwiML.

VoxML Reference

VoxML is TryVox's declarative call control language that lets you control calls by returning JSON from your webhooks. When TryVox calls your answer_url, you return VoxML instructions that are executed sequentially.

VoxML Response Envelope

Every VoxML response follows this structure:

{
  "voxml_version": "1.0",
  "instructions": [
    {"verb": "Say", "text": "Hello world"},
    {"verb": "Hangup"}
  ]
}

Available Verbs

VoxML supports 11 verbs for controlling call flow:

VerbDescription
SayConvert text to speech on the call
PlayPlay an audio file on the call
GatherCollect DTMF digits or speech input from the caller
DialConnect the call to another party
RecordRecord audio from the caller
StreamStream live call audio to a WebSocket endpoint
ConferenceJoin the caller into a named conference room
RedirectFetch new VoxML from a different URL
PausePause execution for a specified duration
HangupEnd the call
RejectReject an incoming call without answering

Execution Model

Instructions are executed sequentially from top to bottom. Some verbs like Gather and Dial can interrupt the flow by making callbacks to your server, allowing you to return new VoxML instructions dynamically.

On this page