LiveTranslate

English | 中文

Real-time audio translation for Windows. Captures system audio (WASAPI loopback) and optional microphone input, runs ASR, translates via LLM API, and displays results in a transparent overlay.

Works with any system audio — videos, livestreams, voice chat. No player modifications needed.

Screenshot

Video

Features

Real-time pipeline: System audio → VAD → ASR → LLM translation → overlay
Multiple ASR engines: faster-whisper, SenseVoice, FunASR Nano, Qwen3-ASR (GGUF)
Any OpenAI-compatible API: DeepSeek, Grok, Qwen, GPT, Ollama, vLLM, etc.
Streaming translation display: Real-time character-by-character translation output
Per-model settings: Streaming, structured output (JSON), context history, disable thinking
Microphone mix-in: Optionally mix microphone input with system audio for ASR
Low-latency VAD: 32ms chunks + Silero VAD with adaptive silence detection
Transparent overlay: Always-on-top, click-through, draggable, 14 color themes
CUDA acceleration: GPU-accelerated ASR inference
Auto model management: Setup wizard, ModelScope / HuggingFace dual sources
Built-in benchmark: Compare translation model speed and quality

Changelog

See English Changelog | 中文更新日志

Requirements

OS: Windows 10/11
Python: 3.10+
GPU (recommended): NVIDIA + CUDA 12.6 (Blackwell GPUs like RTX 50xx require CUDA 12.8)
Network: Access to a translation API

Quick Start

git clone https://github.com/TheDeathDragon/LiveTranslate.git
cd LiveTranslate

Double-click install.bat — the installer will:

Detect Python 3.10+ (auto-install via winget if missing)
Create a virtual environment
Auto-detect NVIDIA GPU and let you choose CUDA / CPU PyTorch
Install all dependencies

Then double-click start.bat to launch.

To update, double-click update.bat — it will pull the latest code and update dependencies (auto-installs Git via winget if missing).

Manual install

python -m venv .venv
.venv\Scripts\activate

# PyTorch (choose one)
pip install torch torchaudio --index-url https://download.pytorch.org/whl/cu126  # CUDA
pip install torch torchaudio --index-url https://download.pytorch.org/whl/cu128  # CUDA (RTX 50xx)
pip install torch torchaudio --index-url https://download.pytorch.org/whl/cpu    # CPU only

# Dependencies
pip install -r requirements.txt
pip install funasr --no-deps

# Launch
.venv\Scripts\python.exe main.py

FunASR uses --no-deps because editdistance requires a C++ compiler. editdistance-s in requirements.txt is a pure-Python drop-in replacement.

First Launch

Setup wizard appears — choose download source (ModelScope / HuggingFace) and cache path
Silero VAD + SenseVoice models download automatically (~1GB)
Main UI appears when ready

Translation API

Settings → Translation tab:

Parameter	Example
API Base	`https://api.deepseek.com/v1`
API Key	Your key
Model	`deepseek-chat`
Proxy	`none` / `system` / custom URL

Architecture

Audio (WASAPI 32ms) → VAD (Silero) → ASR → LLM Translation → Overlay
         ↑ optional mic mix-in

main.py                 Entry point & pipeline
├── audio_capture.py    WASAPI loopback + mic mix-in
├── vad_processor.py    Silero VAD
├── asr_engine.py       faster-whisper backend
├── asr_sensevoice.py   SenseVoice backend
├── asr_funasr_nano.py  FunASR Nano backend
├── asr_qwen3.py        Qwen3-ASR backend (ONNX + GGUF)
├── translator.py       OpenAI-compatible client (streaming, JSON schema, context)
├── model_manager.py    Model download & cache
├── subtitle_overlay.py PyQt6 overlay
├── control_panel.py    Settings UI (7 tabs)
├── dialogs.py          Wizard, download & model config dialogs
└── benchmark.py        Translation benchmark

Acknowledgements

CapsWriter-Offline — Qwen3-ASR integration reference
Qwen3-ASR-GGUF — ONNX + GGUF inference engine
llama.cpp — GGUF runtime

Star History

License

MIT License

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LiveTranslate

Screenshot

Video

Features

Changelog

Requirements

Quick Start

First Launch

Translation API

Architecture

Acknowledgements

Star History

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
funasr_nano		funasr_nano
i18n		i18n
qwen_asr_gguf		qwen_asr_gguf
screenshot		screenshot
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
README_zh.md		README_zh.md
asr_engine.py		asr_engine.py
asr_funasr_nano.py		asr_funasr_nano.py
asr_qwen3.py		asr_qwen3.py
asr_sensevoice.py		asr_sensevoice.py
audio_capture.py		audio_capture.py
benchmark.py		benchmark.py
config.yaml		config.yaml
control_panel.py		control_panel.py
dialogs.py		dialogs.py
i18n.py		i18n.py
install.bat		install.bat
install.ps1		install.ps1
log_window.py		log_window.py
main.py		main.py
model_manager.py		model_manager.py
requirements.txt		requirements.txt
start.bat		start.bat
subtitle_overlay.py		subtitle_overlay.py
subtitle_settings.py		subtitle_settings.py
subtitle_window.py		subtitle_window.py
test_audio.py		test_audio.py
translator.py		translator.py
update.bat		update.bat
vad_processor.py		vad_processor.py

Folders and files

Latest commit

History

Repository files navigation

LiveTranslate

Screenshot

Video

Features

Changelog

Requirements

Quick Start

First Launch

Translation API

Architecture

Acknowledgements

Star History

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages