Achieve better transcription in more languages with smaller model size, Implement Massively Multilingual Speech - Meta's Open Source model with less than half of Whispers error rate #491

menelic · 2023-06-14T18:19:46Z

Please consider improving your tools capabilities by implementing Meta's MMS with speech recognition and generation support for over 1000 languages at a drastically reduced error rate compared to Whisper:

https://github.com/facebookresearch/fairseq/tree/main/examples/mms

https://ai.facebook.com/blog/multilingual-model-speech-recognition/

It even understands and speaks Igbo, among many other Nigerian and African languages ;-)

raivisdejus added the enhancement New feature or request label Aug 10, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Achieve better transcription in more languages with smaller model size, Implement Massively Multilingual Speech - Meta's Open Source model with less than half of Whispers error rate #491

Achieve better transcription in more languages with smaller model size, Implement Massively Multilingual Speech - Meta's Open Source model with less than half of Whispers error rate #491

menelic commented Jun 14, 2023 •

edited

Loading

Achieve better transcription in more languages with smaller model size, Implement Massively Multilingual Speech - Meta's Open Source model with less than half of Whispers error rate #491

Achieve better transcription in more languages with smaller model size, Implement Massively Multilingual Speech - Meta's Open Source model with less than half of Whispers error rate #491

Comments

menelic commented Jun 14, 2023 • edited Loading

menelic commented Jun 14, 2023 •

edited

Loading