GitHub - derezed88/samaritan-webfe: Person of Interest-themed web UI for agent-mcp with full voice I/O

samaritan-webfe

A Person of Interest-themed web front-end for the llmem-gw AI service. Streams LLM responses word-by-word in the Samaritan UI style with full voice I/O — speak to Samaritan and hear it speak back.
Explore the docs »

Report Bug · Request Feature

Table of Contents

Screenshots
About The Project
- Three Frontends, One Portfolio
- Built With
Getting Started
- Prerequisites
- Installation
Mode: Default
Mode: Cognitive
Voice I/O
Commands
Configuration
Other Frontends
Remote Access via Pinggy
Security
Developer Notes
Contributing
License
Contact
Acknowledgments

Screenshots

`#mode default` — Samaritan Voice UI

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
docs		docs
memory		memory
static		static
.env.example		.env.example
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
knowledge_graph_ingest.py		knowledge_graph_ingest.py
knowledge_graph_ingest_sources.py		knowledge_graph_ingest_sources.py
requirements.txt		requirements.txt
run.sh		run.sh
samaritan.py		samaritan.py
stop-watchdog-webfe.sh		stop-watchdog-webfe.sh
watchdog-webfe.sh		watchdog-webfe.sh

Frontend	Route	Description	Docs
Samaritan Voice UI	`/`	Person of Interest-themed voice interface (this README)	—
Chat	`/chat`	Claude-style scrolling chat with markdown, LaTeX, memory display	docs/CHAT.md
Chat-GED	`/chat-ged`	GED exam prep tutor with subject isolation, score tracking, Mermaid charts	docs/CHAT-GED.md

Column	Width	Content
Left data stack	220px	Goals, beliefs, prospective memory, plans (auto-scrolling cards)
Center top	flex	Dashboard with live timer table
Center bottom	flex	Chat log with input field
Right	352px	Samaritan-style countdown timer cards

Provider	Button	API Key	Audio Format	Notes
Inworld AI	`I`	`INWORLD_API_KEY`	Streaming NDJSON, base64 WAV chunks	Server proxy at `/api/tts/inworld`. Voice: `Evelyn`. Model: `inworld-tts-1.5-mini`. 44-byte RIFF header stripped per chunk.
xAI Realtime	`X`	`XAI_API_KEY`	Per-turn WebSocket	Ephemeral token minted server-side per response. Voices: Eve, Ara, Rex, Sal, Leo. Sentence-level streaming.

Provider	Model	API	Notes
Deepgram Flux	`flux-general-en`	v2 `/listen`	Default STT. Native turn detection via `TurnInfo` events. EOT threshold: 0.8. Do NOT send `language` or `punctuate` params.
Deepgram Nova-3 (Diarize)	`nova-3`	v1 `/listen`	Used in DI mic mode only. `diarize=true`, labels each speaker as `[Speaker N]: text`.

Mode	Icon	Behavior
Off	mic icon	Mic disabled in full-voice mode
Barge-in	B (red)	Incoming speech during TTS immediately stops playback — best for headphones/AirPods
Speaker	S (amber)	Mic stays on but transcripts are suppressed during TTS — safe for phone speaker use. 1500ms cooldown after audio ends.
Diarize	DI (blue/red pulse)	Deepgram nova-3 + speaker diarization; each turn prefixed with `[Speaker N]:` for multi-person conversations

Command	Mode	Effect
`#mode default`	Any	Switch to the Samaritan voice UI
`#mode cognitive`	Any	Switch to the cognitive monitoring dashboard
`#mode chat`	Default	Redirect to `/chat` (Chat UI)
`#mode chat-ged`	Default	Redirect to `/chat-ged` (GED Study UI)
`#screen_mode dark`	Default	Switch to dark theme (3s crossfade)
`#screen_mode light`	Default	Switch to light theme
`#inworld_voice <name>`	Default	Change Inworld TTS voice at runtime
`#db <name>`	Cognitive	Switch active database across all cognitive sessions

Variable	Required	Description
`SAMARITAN_API_KEY`	Yes	Access password for the web UI. Set to any strong secret string. Must not end with `!` (iOS autofill strips it).
`LLMEM_GW_API_KEY`	No	Bearer token forwarded to llmem-gw. Leave blank if llmem-gw has no key set.
`LLMEM_GW_URL`	No	Base URL of the llmem-gw service. Default: `http://localhost:8767`.
`DEEPGRAM_API_KEY`	For STT + Deepgram TTS	Used server-side for STT WebSocket proxy and TTS streaming. Never sent to browser. console.deepgram.com
`XAI_API_KEY`	For xAI voice	Used server-side to mint ephemeral WebSocket tokens. Never sent to browser. console.x.ai
`INWORLD_API_KEY`	For Inworld voice	Base64-encoded credential from Inworld Portal (Settings > API Keys). Never sent to browser.

Port	Protocol	Purpose
8800	HTTPS	Local network access (self-signed cert)
8801	HTTP	Pinggy tunnel endpoint (pinggy provides TLS)

Layer	File	Role
Browser UI	`static/index.html`	Single-file HTML/CSS/JS — all rendering, SSE parsing, TTS/STT logic
Python proxy	`samaritan.py`	FastAPI server — auth gate, API key management, stream translation

Service	Where coupled	How to swap
llmem-gw (LLM backend)	`samaritan.py` routes + `index.html` SSE parser	See Swapping the LLM Backend below
Deepgram (STT)	`samaritan.py` WebSocket proxy (`/api/stt-proxy`), `index.html` AudioWorklet	Replace proxy + browser WS client
xAI Realtime (TTS)	`samaritan.py` `/api/tts/xai`, `index.html` `ttsProviders.xai`	Implement new provider object + server route
Inworld AI (TTS)	`samaritan.py` `/api/tts/inworld`, `index.html` `ttsProviders.inworld`	Implement new provider object + server route

Event	Payload	Meaning
`tok`	`{"type":"tok","text":"..."}`	One token/word to display
`flush`	`{"type":"flush","text":"..."}`	Intermediate checkpoint (tool call done, more coming); resets TTS buffer
`done`	`{"type":"done"}`	Turn complete — trigger TTS and re-open mic
`error`	`{"type":"error","text":"..."}`	Stream error

Folders and files

Latest commit

History

Repository files navigation

samaritan-webfe

Screenshots

#mode default — Samaritan Voice UI

#mode cognitive — Live Monitoring Dashboard

About The Project

Three Frontends, One Portfolio

Built With

Getting Started

Prerequisites

Installation

#mode default — Samaritan Voice UI

Voice Input (LIVE mode)

Full-Voice Hands-Free Mode

Keyboard Mode

Idle Behaviour

#mode cognitive — Live Monitoring Dashboard

Voice I/O

TTS Providers (Text-to-Speech)

STT Providers (Speech-to-Text)

Mic Modes

Commands

Configuration

Other Frontends

Remote Access via Pinggy

Security

Developer Notes: Adapting This Frontend

Architecture Overview

Service Coupling Map

Swapping the LLM Backend

iOS Safari Notes

Contributing

License

Contact

Acknowledgments

README template

Visual design sources

Libraries & tools

AI assistance

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`#mode default` — Samaritan Voice UI

`#mode cognitive` — Live Monitoring Dashboard

`#mode default` — Samaritan Voice UI

`#mode cognitive` — Live Monitoring Dashboard

Packages