Transcripts always misses the first words (at least in french) #546
Replies: 3 comments 8 replies
-
Thanks for asking your question about Deepgram! If you didn't already include it in your post, please be sure to add as much detail as possible so we can assist you efficiently, such as:
|
Beta Was this translation helpful? Give feedback.
-
Happened to me as well! It seems that the support for French language is still very poor :( |
Beta Was this translation helpful? Give feedback.
-
hi @netw0rkf10w and @yafkari Sorry, we missed this question. If you are customer with a support contract, I would recommend submitting a ticket through your customer portal for quicker response. PAYG support is community support, but we usually try to acknowledge new community issues. It seems like this one fell through the cracks. There was a refresh to all the models recently (especially around non-english languages) on Monday I believe. Please give it a try and let me know if you are still running into the same issue. |
Beta Was this translation helpful? Give feedback.
-
Which Deepgram product are you using?
Deepgram SDKs
Details
We have an issue with our service that sends a mp3 file to be transcribe with
await deepgram.listen.prerecorded.transcribeFile(Buffer.from(contents, "base64"), options);
.Everytime we send a file to be transcribed, the first word is missing.
Most of our tests were using french language so I cannot say 100% that this is not the case in eng but from our experience with Deepgram, the first word of the transcription is always missing and it causes issues with our testers that always have to correct their transcriptions by adding the missing first word. In the worst case, it can also make the the next word wrongly interpreted.
I didn't find anyone else experimenting this or complaining about it so I suppose it's related to the french language? I would also be curious to know if this also happens with other languages.
If I have the time I would also like to add a 1s of blank in the audio file to see if it can help detect the first word.
EDIT:
Just found a transcription where the second word start time was "0.08s" so i suppose the first word was too close to the beginning of the audio.
If you are making a request to the Deepgram API, what is the full Deepgram URL you are making a request to?
/v1/listen?punctuate=true&detect_language=true¶graphs=false&model=nova-2-general&diarize=false&smart_format=true&numbers=true
If you are making a request to the Deepgram API and have a request ID, please paste it below:
4213aae1-ae72-499b-bff9-3a66734177f7
If possible, please attach your code or paste it into the text box.
No response
If possible, please attach an example audio file to reproduce the issue.
No response
Beta Was this translation helpful? Give feedback.
All reactions