-
Hi! Very interested in using SOFA as a replacement for MFA. Had a quick question about training a new model. The readme refers to a |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Hi! In terms of the dataset format for training a new model, it's consistent with the one used by More specifically, for
Here's an example of what the CSV content should look like:
For Feel free to reach out if you need any more help. ^_^ 你好!用于训练新模型的数据集格式,与 具体而言,对于
下面是CSV内容的一个示例:
对于 如果需要更多帮助,请随时联系。^_^ |
Beta Was this translation helpful? Give feedback.
Hi! In terms of the dataset format for training a new model, it's consistent with the one used by
openvpi/DiffSinger
. You can utilizeopenvpi/MakeDiffSinger
to create a dataset for your needs.More specifically, for
full_label
data, the CSV should contain three columns:name
,ph_seq
, andph_dur
.name
is the filename of the WAV file without the extension.ph_seq
is the phoneme annotation sequence for the WAV file, separated by spaces. Any phonemes listed inignored_phonemes
within theconfigs/binarize_config.yaml
will be considered as SP.ph_dur
corresponds to the duration sequence for each phoneme in the sequence, measured in seconds, and also separated by spaces.Here's an example of w…