Multilingual Transcription with Diarization #967
-
I am interested in transcribing audio files with users speaking in English and Spanish. When I tried Whisper, it did a reasonable job of giving all the text. However, it is missing diarization. I am quite excited about the multilingual feature in Nova 2. Request ID: e2795fbd-b60a-4639-a3bc-17cdb91ea1ba Nova2 output Whisper output As you can see, Nova2 misses the very first sentence: Any suggestions on how to improve the accuracy / solve this use case? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
@anilshanbhag unfortunately there isn't much you can do yourself to improve the model's output in terms of accuracy. This is a new feature of our Nova-2 model, so it's possible you are just experiencing some of the limitations of it. However here are some things to try to see if you can get some incremental gains: To improve the accuracy of your audio transcription with Deepgram, you can consider the following approaches:
Remember to benchmark your transcription results without any enhancements first, then add improvements one by one to notice the effect. Each of these methods can contribute to improving your transcription accuracy, depending on your specific use case and audio characteristics. |
Beta Was this translation helpful? Give feedback.
@anilshanbhag unfortunately there isn't much you can do yourself to improve the model's output in terms of accuracy. This is a new feature of our Nova-2 model, so it's possible you are just experiencing some of the limitations of it.
However here are some things to try to see if you can get some incremental gains:
To improve the accuracy of your audio transcription with Deepgram, you can consider the following approaches:
Use Keyword Boosting: This feature allows you to specify words that Deepgram should pay special attention to during transcription. It's particularly useful for uncommon words, proper nouns, product names, and industry-specific terminology. You can send up to 200 keywor…