A Multi-speaker, multilingual speech generation tool.
日本語はこちら
- Speech synthesis for 209 speakers (109 English / 100 Japanese)
- Script generation using LLM
- Accent and phoneme editing functions
- Voice conversion by RVC
- Batch voice conversion by RVC
- Python (tested on 3.10.6)
- PyTorch for CUDA
Check if the CUDA version of PyTorch is already installed withpip show torch.
It is OK ifcuis included in the Version as follows.
Version: 2.0.0+cu118If not, install the CUDA version of PyTorch.
pip uninstall torch
pip install torch --index-url https://download.pytorch.org/whl/cu118
- espeak
- Build Tools for Visual Studio (Not necessary if Japanese is not generated)
-
download the zip or
git clone https://github.com/log1stics/voice-generator-webui -
run
setup.batorsetup.bat en
setup.bat enavoids installation of pyopenjtalk used for Japanese generation
Run the webui.py file in PowerShell or at the command prompt
python webui.py
Run the webui.py file at a PowerShell or command prompt
apt-get install espeak # For English Text To Speak
git clone https://github.com/log1stics/voice-generator-webui
chmod +x setup.shsetup.sh
# if you do not use Japanese Text To Speak
# you can avoid install pyopenjtalk
setup.sh enFor example, if you want to handle an RVC trained model named EXAMPLE_MODEL.pth, place it as follows
vc/
└── models/
└── EXAMPLE_MODEL/
├── EXAMPLE_MODEL.pth # file name should be the same as the directory name
├── added.index # not necessary (name is fixed to added.index)
└── total_fea.npy # not necessary (name is fixed to added.index)Here's how to add code to this repo: Contributing
