This tool generates audio queries using the ESPnet library and a provided model.
- Python 3.10
- Virtual Environment (venv)
-
Clone the repository:
git clone https://github.com/samiabat/text2speech cd text2speech -
Create and activate a virtual environment:
python3.10 -m venv venv source venv/bin/activate # On Windows, use `venv\Scripts\activate`
-
Install the required dependencies:
pip install -r requirements.txt
Note: Ensure that you are using Python 3.10, as some libraries like pyopenjtalk may not work with Python 3.11.
-
Download the ESPnet model and add the
config.yamlfile inside themodelfolder.
Run the following command to generate audio queries:
python audioQueryGenerator.py