Achieve better transcription in more languages with smaller model size, Implement Massively Multilingual Speech - Meta's Open Source model with less than half of Whispers error rate #491
Labels
enhancement
New feature or request
Please consider improving your tools capabilities by implementing Meta's MMS with speech recognition and generation support for over 1000 languages at a drastically reduced error rate compared to Whisper:
https://github.com/facebookresearch/fairseq/tree/main/examples/mms
https://ai.facebook.com/blog/multilingual-model-speech-recognition/
It even understands and speaks Igbo, among many other Nigerian and African languages ;-)
The text was updated successfully, but these errors were encountered: