VALL-E 1.0: Reproduce the results of the paper, Train larger models, and greatly improve the speed of token extraction.
LatestVALL-E 1.0 Release notes
- Reproducted Demo https://lifeiteng.github.io/valle/index.html
- Traning commands egs/libritts#demo
- Inference valle/run.sh#L68
- Support Two stages training #59
- Support Mandarin #85 Thanks @zhaomingwork
- Speedup Encodec AudioTokenizer #81 #88 Thanks @Guokr233
- Thanks reviewer @zjwang21