A simple extension for the text-generation-webui by oobabooga that uses edge_tts for audio output. It also supports post-processing using RVC.
Assuming you already have the text-generation-webui set up (min version 1.6.1):
- Activate the conda environment using
conda activate textgen
- Enter the
text-generation-webui/extensions/
directory and clone this repository
cd text-generation-webui/extensions/
git clone https://github.com/BuffMcBigHuge/text-generation-webui-edge-tts.git edge_tts/
- Install the requirements
pip install -r edge_tts/requirements.txt
- Add
--extensions edge_tts
to your startup script
or
enable it through theSession
tab in the webui - Download the required RVC models and place them in the
extensions/edge_tts/models
folder
curl -L -O https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/hubert_base.pt
curl -L -O https://huggingface.co/lj1995/VoiceConversionWebUI/resolve/main/rmvpe.pt
- Add your
.pth
files toextensions/edge_tts/rvc_models
Edge TTS is a free API provided by Microsoft. An internet connection is required for the TTS to function.
RVC was inspired by rvc-tts-webui