Skip to content

5h4h6y46/OpenWebTTS

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

45 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OpenWebTTS: Local Text-to-Speech Web UI

GitHub stars GitHub forks License

OpenWebTTS is the open-source, privacy-first alternative to Speechify and ElevenLabs. Run it locally, use any TTS engine, and read PDFs, Epubs and other documents without subscriptions or tracking.

image

Better than paid alternatives

  • Clean Interface: Straight to the point and no ads, simple by design, powerful if needed.
  • Accessible Design: TTS readers help with many disabilities such as dyslexia and ADHD. We welcome neurodivergence and include accessibility options everywhere possible.
  • Multiple Engine Support: Options for any type of hardware, and even cloud options if wanted.
  • Voice cloning*: With a simple 10 second wav file you can clone any voice to read for you!
  • Import anything: Most document types are supported, and URLs too!
  • Automatically skip headers and footers*: Premium feature no more!
  • Automatic OCR*: If your PDF doesn't have text, we can make some for you.
  • Offline first*: No connection neeeded.
  • Self-hostable: Take control of your data, with no feature locked away.

Features marked with an * are paid on other platforms!

Running

See BUILD.md for detailed instructions. If you know what you're doing: clone the repo, install Python dependencies with a venv and build with npm.

Browser Extension 🔊

OpenWebTTS now includes browser extensions for Chrome and Firefox that let you read any webpage aloud with word-by-word highlighting!

Features:

  • 📖 Read entire webpages or just selected text
  • 🎯 Real-time word-by-word highlighting as text is spoken
  • 🎨 Customizable highlight colors (yellow, green, blue, pink, orange)
  • ⚡ Adjustable reading speed (0.5x to 2.0x)
  • 🔄 Auto-scroll to keep reading position visible
  • 🎭 Support for all OpenWebTTS voice engines

Quick Install:

  1. Start the backend:

    python app.py
  2. Load extension:

    • Chrome: Navigate to chrome://extensions/, enable Developer mode, click "Load unpacked", select browser-extension/ folder
    • Firefox: Navigate to about:debugging#/runtime/this-firefox, click "Load Temporary Add-on", select browser-extension/manifest_firefox.json
  3. Start reading: Click the extension icon, navigate to any webpage, and click "📖 Read Page"!

See browser-extension/README.md for complete installation guide and features.

Using TTS models

Piper

  1. Use the integrated model downloader (recommended)

Or

  1. Download a Piper voice model from the official repository.
  2. Place the files inside models/piper/. For example: models/piper/en_US-lessac-medium.onnx and models/piper/en_US-lessac-medium.onnx.json.

Kokoro

  1. Use the integrated model downloader (recommended)

Or

  1. Download a model from the official repository.
  2. Place the file inside models/kokoro/. For example: models/kokoro/af_heart.pt

Coqui

Coqui downloads itself automatically with Python. Currently we only support XTTS2, with YourTTS coming soon. We don't plan on supporting every Coqui version, as it will be mostly used for voice cloning since other models have since surpassed it in regular TTS.

  1. Place the audio files for voice cloning inside models/coqui/. For example: models/coqui/my-voice.wav.

Chatterbox (WIP)

Chatterbox will use the same audio files for voice cloning as Coqui, so the proccess is the same.

About

Open source Speechify alternative. Read PDFs and EPUBs with local models.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • JavaScript 50.4%
  • Python 31.4%
  • HTML 13.8%
  • CSS 4.1%
  • Other 0.3%