MeetScribe

Meeting transcription with speaker diarization using remote speaches API servers. Offloads VAD, speaker embeddings, and transcription to GPU servers — the client stays lightweight with no ML dependencies.

Features

Remote processing: VAD, speaker embeddings, and transcription via speaches API (OpenAI-compatible)
Web UI: Browser-based interface with step-by-step workflow
User authentication: Login/password auth with team-scoped access
Multi-track processing: Handle video files with multiple audio tracks or individual audio files
Speaker enrollment: Register speakers with voice samples for automatic identification
Speaker diarization: Automatically separate and identify speakers without enrollment
Multi-team support: Separate speaker databases and sessions per team
Parallel transcription: Distribute chunks across multiple servers
Flexible input: Video files, audio files, directories, or glob patterns

Installation

Requirements

Python 3.12+
FFmpeg (for audio extraction and conversion)
One or more speaches API servers

FFmpeg

FFmpeg is required for processing audio and video files.

Windows:

winget install "FFmpeg (Shared)"

macOS:

brew install ffmpeg

Linux (Debian/Ubuntu):

sudo apt install ffmpeg

After installation, restart your terminal to update PATH.

Python package

uv venv
uv pip install -e ".[web]"

The [web] extra installs FastAPI, Uvicorn, Jinja2 and other web dependencies. Omit it for CLI-only usage.

Configuration

MeetScribe uses two configuration files:

File	Purpose
`.env`	Data directory path and environment settings
`data/config.yaml`	Servers, pipeline parameters, web UI settings

Quick start

# 1. Set up environment
cp .env.example .env

# 2. Set up config
cp config.example.yaml data/config.yaml
# Edit data/config.yaml — set your server URL

# 3. Initialize database and create admin user
meetscribe team create default
meetscribe user create admin --team default --admin

`.env`

Controls where MeetScribe stores its data. See .env.example.

Variable	Description	Default
`MEETSCRIBE_DATA_DIR`	Root directory for DB, logs, sessions, samples	Platform-specific (see below)
`MEETSCRIBE_TMP_DIR`	Temp files directory	`DATA_DIR/tmp`
`MEETSCRIBE_MAX_UPLOAD_SIZE`	Max upload size in bytes	`4294967296` (4 GB)

Default data directory without MEETSCRIBE_DATA_DIR:

Windows: %LOCALAPPDATA%/meetscribe
macOS: ~/Library/Application Support/meetscribe
Linux: ~/.local/share/meetscribe

Setting MEETSCRIBE_DATA_DIR=./data keeps everything in the project directory — convenient for development and debugging.

`config.yaml`

Located at MEETSCRIBE_DATA_DIR/config.yaml (by default ./data/config.yaml). All application settings in one file. See config.example.yaml for a fully documented example.

Sections:

servers — List of speaches API servers (URL + name)
vad — Voice Activity Detection: server, timeout, silence/speech thresholds
embeddings — Speaker embeddings: server, model, identification thresholds, AHC clustering parameters
transcription — Speech-to-text: servers, model, language, timeout, segment merging
web — Web UI: host, port, session TTL

Web UI

Start the web interface:

meetscribe web
meetscribe web --host 0.0.0.0 --port 8080

Host and port can also be set in config.yaml under the web section. CLI arguments take priority.

First-time setup

Create an admin user via CLI before using the web UI:

meetscribe user create admin --team default --admin

The admin can then register other users through the web UI at /register.

Workflow

The web UI guides you through a 6-step process:

Upload — Upload video or audio files
Configure — Assign speakers to tracks or enable auto-diarization
Extract — Extract speaker samples via VAD + embeddings
Samples — Review and organize extracted speaker samples
Enroll — Register speakers from samples
Transcribe — Generate transcript with speaker attribution

Access control

Each user belongs to a team
Sessions are visible only to users in the same team
Only admin users can register new users (in their own team)
Authentication uses HttpOnly cookies (works with SSE streaming)

Teams

MeetScribe supports multiple teams, each with its own set of enrolled speakers, voice samples, and sessions. This enables separate speaker databases for different projects, clients, or departments.

All commands accept -t/--team flag to specify the team (defaults to default):

meetscribe -t sales enroll "John Doe" ./samples/john/
meetscribe -t sales transcribe meeting.mp4 -o output.md
meetscribe -t sales list-speakers

Team management

meetscribe team create sales
meetscribe team list
meetscribe team delete sales

Team data is stored in teams/<name>/samples/ under the data directory. Voiceprints are stored in a shared SQLite database (meetscribe.db), scoped per team.

User management

# Create an admin user
meetscribe user create admin --team default --admin

# Create a regular user
meetscribe user create john --team sales

# List all users
meetscribe user list

# Delete a user
meetscribe user delete john

CLI Commands

`meetscribe transcribe`

Transcribe a meeting with speaker diarization:

meetscribe transcribe meeting.mp4 -o output.md --track1 "Host"
meetscribe transcribe path/to/tracks/ -o output.md --track1 "Host"
meetscribe transcribe track1.wav track2.wav -o output.md --track1 "Host"

Tracks without a --trackN assignment are diarized automatically.

Option	Description	Default
`-t, --team`	Team to use for speaker identification	`default`
`-o, --output`	Output file or directory	required
`-l, --language`	Language code (overrides config.yaml)	from config
`--trackN`	Assign speaker name to track N	diarize

`meetscribe enroll`

Register known speakers for automatic identification:

meetscribe enroll "John Doe" ./samples/john/
meetscribe -t my-team enroll "John Doe" ./samples/john/

`meetscribe extract`

Extract audio tracks from a video file:

meetscribe extract meeting.mp4 -o output_dir/

`meetscribe extract-samples`

Extract audio samples from unknown speakers for later enrollment:

meetscribe extract-samples meeting.mp4

`meetscribe list-speakers`

Show enrolled speakers:

meetscribe list-speakers
meetscribe -t my-team list-speakers

`meetscribe web`

Start the web UI server:

meetscribe web
meetscribe web --host 0.0.0.0 --port 8080

`meetscribe info`

Display data directories, configuration, and settings:

meetscribe info

Audio Input

MeetScribe supports multiple input formats:

Video files (.mp4, .mkv, .avi, .mov, .webm): audio tracks are extracted automatically
Audio files (.wav, .mp3, .flac, .ogg, .m4a): used directly as tracks
Directories: all audio files in the directory are used as tracks
Glob patterns: matched audio files are used as tracks

For video files with multiple audio tracks (e.g., track 1 = host, track 2 = guests), use --trackN to assign speaker names.

Development

uv venv
uv pip install -e ".[dev,web]"

uv run pytest
uv run ruff check src/
uv run ruff format src/
uv run mypy src/

Test suite

Unit, functional, and integration tests covering the pipeline, database, config, and web services:

uv run pytest                  # all tests
uv run pytest --cov            # with coverage
uv run pytest tests/test_models.py  # single file

CI

GitHub Actions runs on every push and PR: ruff, mypy, pytest, bandit — on Python 3.12 and 3.13.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.github/workflows		.github/workflows
docs		docs
nginx		nginx
src/meetscribe		src/meetscribe
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
config.example.yaml		config.example.yaml
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

MeetScribe

Features

Installation

Requirements

FFmpeg

Python package

Configuration

Quick start

.env

config.yaml

Web UI

First-time setup

Workflow

Access control

Teams

Team management

User management

CLI Commands

meetscribe transcribe

meetscribe enroll

meetscribe extract

meetscribe extract-samples

meetscribe list-speakers

meetscribe web

meetscribe info

Audio Input

Development

Test suite

CI

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`.env`

`config.yaml`

`meetscribe transcribe`

`meetscribe enroll`

`meetscribe extract`

`meetscribe extract-samples`

`meetscribe list-speakers`

`meetscribe web`

`meetscribe info`

Packages