Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Achieve better transcription in more languages with smaller model size, Implement Massively Multilingual Speech - Meta's Open Source model with less than half of Whispers error rate #491

Open
menelic opened this issue Jun 14, 2023 · 0 comments
Labels
enhancement New feature or request

Comments

@menelic
Copy link

menelic commented Jun 14, 2023

Please consider improving your tools capabilities by implementing Meta's MMS with speech recognition and generation support for over 1000 languages at a drastically reduced error rate compared to Whisper:

image

https://github.com/facebookresearch/fairseq/tree/main/examples/mms

https://ai.facebook.com/blog/multilingual-model-speech-recognition/

It even understands and speaks Igbo, among many other Nigerian and African languages ;-)

@raivisdejus raivisdejus added the enhancement New feature or request label Aug 10, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants