🎧 Vakta 2.0 — Text to Audio Converter

Overview

Vakta 2.0 is a lightweight and powerful Text-to-Audio Converter built with Python.
It can take any PDF, image, or text file and turn it into an audio file — so you can listen to books, notes, or documents hands-free.

The app features a clean graphical interface, automatic text extraction (even from scanned pages), and smooth background processing.

✨ What’s New in Version 2.0

Resolving the problem for scanned books pdf by using OCR.
🧠 OCR Support: Reads text from scanned PDFs and images using keras_ocr
🎨 Modern GUI: Built with tkinter in a dark theme
⚡ Threaded Processing: Uses Python’s threading and queue to keep the GUI responsive while converting files
🔊 Improved Audio Engine: Uses pyttsx3 for clear and natural text-to-speech output

🧰 Tech Stack

Library	Purpose
pdfplumber	Extracts text from text-based PDFs
keras_ocr	Detects and reads text from scanned PDFs or images
pyttsx3	Converts text into an audio file
tkinter	Creates the graphical interface
threading & queue	Handles background tasks without freezing the GUI

⚙️ Installation

You can set up Vakta 2.0 easily:

git clone https://github.com/yourusername/Vakta-2.0.git
cd Vakta-2.0
pip install pdfplumber pyttsx3 keras-ocr Pillow numpy

🧩 Note: The first time you run Vakta, keras_ocr may take a few minutes to download its models.

🚀 How It Works

🧵 Threading and Background Tasks

Vakta 2.0 uses the threading module to handle conversions in a separate thread.
This ensures that the GUI never freezes while reading large PDFs or converting audio.

When you click “Create AudioBook”, a new thread is started:

threading.Thread(
    target=converter,
    args=(book_entry.get(), start_entry.get(), end_entry.get(), audio_entry.get(), task_queue),
    daemon=True
).start()

The Queue object (task_queue) is used to safely send status updates back to the main thread, which then updates the GUI labels using:

root.after(200, poll_queue)

This pattern keeps GUI feel active like every thing is happening at frontend.

🚀 How to Use

Run the app:
```
python vakta2.py
```
Choose a file (PDF, image, or text).
For PDFs, enter start and end page numbers.
Type a name for your audio file.
Click Create AudioBook.
- The conversion runs in a background thread.
- You’ll see live status messages (e.g., Processing page 3/10...).

Your audio (.mp3) file will be saved in the same folder.

🖼️ Example Interface

📦 Output Example

Opening PDF...
Processing page 1/4...
Extracted text successfully, converting to audio...
✅ Conversion Finished: saved as output.mp3

💡 Future Ideas

🗣️ Selectable voices (male/female)
🎛️ Adjustable speech speed and pitch
📁 Output folder selection
📚 Batch file conversion

👨‍💻 About

Vakta 2.0 was developed by Karlex to make reading easier and more accessible.
It’s free, open-source, and designed to turn your reading material into an audiobook effortlessly.

Version: 2.0
License: MIT

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
README.md		README.md
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🎧 Vakta 2.0 — Text to Audio Converter

Overview

✨ What’s New in Version 2.0

🧰 Tech Stack

⚙️ Installation

🚀 How It Works

🧵 Threading and Background Tasks

🚀 How to Use

🖼️ Example Interface

📦 Output Example

💡 Future Ideas

👨‍💻 About

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🎧 Vakta 2.0 — Text to Audio Converter

Overview

✨ What’s New in Version 2.0

🧰 Tech Stack

⚙️ Installation

🚀 How It Works

🧵 Threading and Background Tasks

🚀 How to Use

🖼️ Example Interface

📦 Output Example

💡 Future Ideas

👨‍💻 About

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages