You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@Rongjiehuang I got this repo to work, but i had to correct some things. Hope it helps someone else.
Before starting the install, you should sudo apt install gfortran libopenblas-base. These are required but not specified.
Change environment.yaml to remove duplicates in scipy and numpy, and remove version requirements on scipy and numba (old vresions cause conflicts with numpy).
If you already installed CUDA yourself, remove the one installed with environment.yaml with pip uninstall nvidia_cublas_cu11 (or whatever version you have).
In modules/GenerSpeech/config/generspeech.yaml, change emotion_encoder_path to checkpoints/Emotion_encoder.pt
Add the Generspeech dir root to your sys.path, either by moving GenerSpeech.py to the GenerSpeech dir or adding these lines at the top of GenerSpeech (otherwise Python can't find the imports)
import sys, os
sys.path.insert(0, os.path.join(os.path.dirname(__file__), '..'))
Run mfa thirdparty download
In utils.hparams.py, lines 29 and 32 should remove help='location of the data corpus' becuase it's misleading. Line 41 needs to include remove=False.
Preprocessing fails at data_gen_utils line 299 if there is a word missing from mfa_dict.txt, because the TextGrid will skip the phones of the missing word. Actually, some common words are not in the dictionary like "her" (HH_ER1) and "processing" (P_R_AA1_S_EH0_S_IH0_NG). You have add them to the dictionary yourself. The correct way is to run mfa validate and append to mfa_dict.txt first (see this script),
Also you may want to use praatio as the standard TextGrid parser.
The text was updated successfully, but these errors were encountered:
Hi @iamanigeeit, thank you so much for your installation steps, it helps a lot! but I haven't set it up, so I may need to bother you with some questions. After I use these
two lines to install mfa
I can't run mfa thirdparty download directly, it will have "thirdparty command not exist" error. May I know the version of kaldi pynini mfa you installed?
@Rongjiehuang I got this repo to work, but i had to correct some things. Hope it helps someone else.
sudo apt install gfortran libopenblas-base
. These are required but not specified.environment.yaml
to remove duplicates in scipy and numpy, and remove version requirements on scipy and numba (old vresions cause conflicts with numpy).environment.yaml
withpip uninstall nvidia_cublas_cu11
(or whatever version you have).modules/GenerSpeech/config/generspeech.yaml
, changeemotion_encoder_path
tocheckpoints/Emotion_encoder.pt
sys.path
, either by moving GenerSpeech.py to the GenerSpeech dir or adding these lines at the top of GenerSpeech (otherwise Python can't find the imports)mfa thirdparty download
utils.hparams.py
, lines 29 and 32 should removehelp='location of the data corpus'
becuase it's misleading. Line 41 needs to includeremove=False
.data_gen_utils
line 299 if there is a word missing frommfa_dict.txt
, because the TextGrid will skip the phones of the missing word. Actually, some common words are not in the dictionary like "her" (HH_ER1) and "processing" (P_R_AA1_S_EH0_S_IH0_NG). You have add them to the dictionary yourself. The correct way is to runmfa validate
and append tomfa_dict.txt
first (see this script),praatio
as the standard TextGrid parser.The text was updated successfully, but these errors were encountered: