This project enables audio transcription using Whisper, keyword extraction with spaCy, and translation to Hebrew using MarianMT models. It supports multiple languages for processing, including English, French, Spanish, German, and Swedish.
- Transcription: Converts audio to text with Whisper.
- Keyword Extraction: Uses spaCy to find key terms.
- Translation: Translates text into Hebrew with MarianMT models.
- YouTube Audio Processing: Downloads and extracts audio from videos.
- File Support: Works with local media files.
Ensure you have Python 3.8+ installed.
- Clone this repository:
git clone https://github.com/hagaryavin/NoteSync- Install dependencies:
pip install -r requirements.txt- Install spaCy models:
python -m spacy download en_core_web_sm
python -m spacy download fr_core_news_sm
python -m spacy download es_core_news_sm
python -m spacy download de_core_news_sm
python -m spacy download sv_core_news_smTo start the application, run the following command in the terminal:
python app.py