You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
4) Start virtual environment (use this every time you do anything with pip/python packages)
source activate
Info: You may leave the virtual environment by writing 'deactivate'
5) Install tortoise dependencies
Torch (CUDA edition) pip3 install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cu117 --no-cache-dir Makes Vocoder magically work pip3 install -e . BigVGAN (what tortise-fast uses to generate synthesizes high-fidelity waveforms) pip3 install git+https://github.com/152334H/BigVGAN.git Ensure this package is not installed, we source it locally instead pip3 uninstall tortoise
6) Download models and such
echo "Write your text here, the instructions are wrong" | ./scripts/tortoise_tts.py --voice emma --seed 42
This is where it appears to get stuck "Downloading the main structure of voicefixer", but in reality it's downloading over 600MB of data at the slowest speeds imaginable. It may take an hour. You can try a download manager like FDM and try the method below to manually download.
If you cancel your download, delete this folder to try again rm -rf ~/.cache/voicefixer
Faster method to obtain the files above using a download manager (FDM) or mirror (WIP)
Rendering emma_00 (1 of 1)...
Hello
Traceback (most recent call last):
File "/home/nom/Projects/tortoise-tts-fast-GUI/./scripts/tortoise_tts.py", line 352, in <module>
gen = tts.tts_with_preset(
AttributeError: 'TextToSpeech' object has no attribute 'tts_with_preset'
WIP
This repo is so broken, I don't think it ever worked.
The text was updated successfully, but these errors were encountered:
There are two repos, this one and a fork with a working/better UI. They both function much the same. So we'll use the fork.
I found tortoise to be unreliable on Windows, including voice training. So I only use Linux for tortoise.
Instructions
1) Clone repo & change directory into it
git clone --depth=1 https://github.com/Acephalia/tortoise-tts-fast-GUI.git && cd tortoise-tts-fast-GUI
2) Install python 3.10 (if it doesn't exist, find a repository/PPA)
sudo apt -y install python3.10 python3.10-dev python3.10-venv
3) Setup virtual environment (ensures packages & versions only exist within this project)
python3.10 -m venv venv
echo "source venv/bin/activate" > activate
4) Start virtual environment (use this every time you do anything with pip/python packages)
source activate
Info: You may leave the virtual environment by writing 'deactivate'
5) Install tortoise dependencies
Torch (CUDA edition)
pip3 install torch torchvision torchaudio --extra-index-url https://download.pytorch.org/whl/cu117 --no-cache-dir
Makes Vocoder magically work
pip3 install -e .
BigVGAN (what tortise-fast uses to generate synthesizes high-fidelity waveforms)
pip3 install git+https://github.com/152334H/BigVGAN.git
Ensure this package is not installed, we source it locally instead
pip3 uninstall tortoise
6) Download models and such
echo "Write your text here, the instructions are wrong" | ./scripts/tortoise_tts.py --voice emma --seed 42
This is where it appears to get stuck "Downloading the main structure of voicefixer", but in reality it's downloading over 600MB of data at the slowest speeds imaginable. It may take an hour. You can try a download manager like FDM and try the method below to manually download.
If you cancel your download, delete this folder to try again
rm -rf ~/.cache/voicefixer
Faster method to obtain the files above using a download manager (FDM) or mirror (WIP)
Source: https://zenodo.org/record/5469951/files/model.ckpt-1490000_trimed.pt?download=1
Mirror: https://drive.google.com/file/d/1MetvWA9NULZPq0KjTdj0DFjQu5fIiwia/view
Destination: ~/.cache/voicefixer/synthesis_module/44100/model.ckpt-1490000_trimed.pt
Size: 129.3MB
Source: https://zenodo.org/record/5600188/files/vf.ckpt?download=1
Mirror: https://drive.google.com/file/d/1APezpeB6hjZWK3GG7oJZCgs6OKOSIZV-/view
Destination: ~/.cache/voicefixer/analysis_module/checkpoints/vf.ckpt
Size: 466.6MB
7) To fix this next error of this buggy app
Edit api.py
Find:
self.autoregressive.load_state_dict(torch.load(ar_path))
Replace:
self.autoregressive.load_state_dict(torch.load(ar_path), strict=False)
8) To fix this next error of this buggy app
WIP
This repo is so broken, I don't think it ever worked.
The text was updated successfully, but these errors were encountered: