Voice Receptionist — Twilio Conversation Relay × GuideAnts

A phone-based AI receptionist demo. A caller dials a Twilio number; Twilio Conversation Relay handles speech-to-text and text-to-speech and streams the conversation over a WebSocket to this middleware; the middleware forwards the caller's words to a GuideAnts guide and streams the guide's reply back to Twilio, which speaks it to the caller.

Caller ⇄ Twilio number ⇄ Conversation Relay ⇄ this app (/twiml, /ws) ⇄ GuideAnts guide

What's in this project

File	Purpose
`app.py`	FastAPI server: `POST /twiml` returns the TwiML that opens the relay; `WS /ws` is the Conversation Relay message loop.
`guide_client.py`	Streams replies from the GuideAnts guide using the `openai` SDK pointed at GuideAnts' OpenAI-compatible endpoint.
`barge_in.py`	Pure logic for selective barge-in: decides whether a caller's utterance should stop the AI's reply (a stop command or a question) or let it resume (a filler, statement, or noise).
`config.py`	Loads settings from `.env`.
`.env.example`	Template for required configuration — copy to `.env`.

All of the receptionist's actual knowledge/behavior (business hours, services, tone, FAQs, etc.) lives in the guide's instructions inside GuideAnts, not in this code. This app is just the phone/WebSocket bridge.

How the call flow works

Twilio receives a call and POSTs to /twiml.

/twiml returns:

<Response>
  <Connect>
    <ConversationRelay url="wss://<your-host>/ws" welcomeGreeting="..." .../>
  </Connect>
</Response>

Twilio opens a WebSocket to /ws and sends JSON messages:
- setup — call metadata (callSid, from, to)
- prompt — voicePrompt holds the caller's transcribed speech
- interrupt — caller spoke over the AI; Twilio pauses TTS immediately and reports what they'd heard so far via utteranceUntilInterrupt
- dtmf — caller pressed a key
- error — Conversation Relay reported a problem
On each prompt, /ws sends the running chat history to the GuideAnts guide (guide_client.stream_reply) and streams the reply back to Twilio as it's generated:
```
{"type": "text", "token": "Hello", "last": false}
...
{"type": "text", "token": "", "last": true}
```
Twilio starts speaking tokens as they arrive, so the caller doesn't wait for the full reply to be generated.
On interrupt, the app pauses the reply rather than stopping it, and waits for the caller's transcribed words (the next prompt) to decide what to do: a stop command ("stop", "hold on", ...) or a question actually cancels the reply and starts a fresh one for the new words; anything else (a filler like "uh-huh", a statement, background noise) resumes the paused reply right where Twilio left off. If no prompt follows within a couple of seconds (e.g. a cough), the app resumes on its own. See ARCHITECTURE.md for the full state machine.

Setup

See SETUP.md for the complete, step-by-step guide to getting this running on your device — GuideAnts guide creation, installing dependencies, .env, Twilio account/number configuration, tunneling, and placing a call.

Manual verification without a phone call

With the server running and .env pointed at a real published guide, confirm GuideAnts is reachable directly:

curl -N http://localhost:5107/api/published/openai/<pubId>/v1/chat/completions ^
  -H "Authorization: Bearer <key-or-anonymous>" -H "Content-Type: application/json" ^
  -d "{\"model\":\"<alias>\",\"messages\":[{\"role\":\"user\",\"content\":\"hi\"}],\"stream\":true}"

You should see data: chunks containing choices[].delta.content. If this works, /ws will work too.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Voice Receptionist — Twilio Conversation Relay × GuideAnts

What's in this project

How the call flow works

Setup

Manual verification without a phone call

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.env.example		.env.example
.gitignore		.gitignore
ARCHITECTURE.md		ARCHITECTURE.md
README.md		README.md
SETUP.md		SETUP.md
app.py		app.py
barge_in.py		barge_in.py
config.py		config.py
guide_client.py		guide_client.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Voice Receptionist — Twilio Conversation Relay × GuideAnts

What's in this project

How the call flow works

Setup

Manual verification without a phone call

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages