Transcripts always misses the first words (at least in french) #546

yafkari · 2024-01-20T22:45:13Z

yafkari
Jan 20, 2024

Which Deepgram product are you using?

Deepgram SDKs

Details

We have an issue with our service that sends a mp3 file to be transcribe with await deepgram.listen.prerecorded.transcribeFile(Buffer.from(contents, "base64"), options);.

Everytime we send a file to be transcribed, the first word is missing.

Most of our tests were using french language so I cannot say 100% that this is not the case in eng but from our experience with Deepgram, the first word of the transcription is always missing and it causes issues with our testers that always have to correct their transcriptions by adding the missing first word. In the worst case, it can also make the the next word wrongly interpreted.

I didn't find anyone else experimenting this or complaining about it so I suppose it's related to the french language? I would also be curious to know if this also happens with other languages.

If I have the time I would also like to add a 1s of blank in the audio file to see if it can help detect the first word.

EDIT:

Just found a transcription where the second word start time was "0.08s" so i suppose the first word was too close to the beginning of the audio.

If you are making a request to the Deepgram API, what is the full Deepgram URL you are making a request to?

/v1/listen?punctuate=true&detect_language=true&paragraphs=false&model=nova-2-general&diarize=false&smart_format=true&numbers=true

If you are making a request to the Deepgram API and have a request ID, please paste it below:

4213aae1-ae72-499b-bff9-3a66734177f7

If possible, please attach your code or paste it into the text box.

No response

If possible, please attach an example audio file to reproduce the issue.

No response

team-deepgram · 2024-01-20T22:45:22Z

team-deepgram
Jan 20, 2024
Maintainer

Thanks for asking your question about Deepgram! If you didn't already include it in your post, please be sure to add as much detail as possible so we can assist you efficiently, such as:

The request_id if you have a question about your requests or transcription responses.
The features you used or the full api.deepgram.com URL you sent your request to, including parameters.
Any code snippets you can share.

0 replies

netw0rkf10w · 2024-06-07T17:45:22Z

netw0rkf10w
Jun 7, 2024

Happened to me as well! It seems that the support for French language is still very poor :(

0 replies

davidvonthenen · 2024-06-12T16:51:44Z

davidvonthenen
Jun 12, 2024

hi @netw0rkf10w and @yafkari

Sorry, we missed this question. If you are customer with a support contract, I would recommend submitting a ticket through your customer portal for quicker response. PAYG support is community support, but we usually try to acknowledge new community issues. It seems like this one fell through the cracks.

There was a refresh to all the models recently (especially around non-english languages) on Monday I believe. Please give it a try and let me know if you are still running into the same issue.

8 replies

davidvonthenen Jul 18, 2024

hi @yafkari

No worries about the delay. Trust me... I know that busy feeling. 🤣

Did you try the nova-2 model?

Also, I just realized that detect_language might be a pretty significant hit when first starting vs knowing what languages the calls are in and just setting it. Another metric that could be of interest is trying this out and seeing what the difference in time is.

yafkari Jul 19, 2024
Author

@dvonthenen

The first screenshot was actually the nova-2 model. (The initial request id in my first message is too)

I couldn't test on nova modal since it doesn't support french.

Did some runs (15-20) with language or detect_language, i don't see any significant change. But whatever the time it takes apparently it doesn't get it.

Here are some duration from my tests

model: base
with language parameter
4.67s
1.58s
2.17s
1.3s
0.8s
1.3s

with detect_language
4.5s
4.2s
2.4s
3.3s
1.7s
1.3s
1.3s

model: nova-2
with language parameter
1.5
1
4
2.22
1.4

with detect_language
1.2
1.3
1.5
1.2

I was expecting it would be the same but for the science i did the tests with the rest api while i use the js sdk on the server but as i expected, it's the same.

PS: just realised I'm killing my free credits but it's worth it 😆

davidvonthenen Jul 22, 2024

I will take this back to the team. The piece about the REST call doing the same thing is very valuable.

yafkari Sep 3, 2024
Author

Hey @dvonthenen, just tried with Whisper cloud model from the Deepgram playground page and it actually detected the first word!

yafkari Sep 9, 2024
Author

Me again @dvonthenen , as a workaround I add one second of blank at the beginning of the audio and fix the timings later by myself.

(I also tried to add only 0.1s but it was still not getting the first word, so to be safe i'll add one second and fix the timings later programmatically)

Still a pain, but at least i can continue using nova-2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deepgram

Transcripts always misses the first words (at least in french) #546

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 3 comments 8 replies

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

Select a reply

Deepgram

Transcripts always misses the first words (at least in french) #546

yafkari Jan 20, 2024

Which Deepgram product are you using?

Details

If you are making a request to the Deepgram API, what is the full Deepgram URL you are making a request to?

If you are making a request to the Deepgram API and have a request ID, please paste it below:

If possible, please attach your code or paste it into the text box.

If possible, please attach an example audio file to reproduce the issue.

Replies: 3 comments · 8 replies

team-deepgram Jan 20, 2024 Maintainer

netw0rkf10w Jun 7, 2024

davidvonthenen Jun 12, 2024

davidvonthenen Jul 18, 2024

yafkari Jul 19, 2024 Author

davidvonthenen Jul 22, 2024

yafkari Sep 3, 2024 Author

yafkari Sep 9, 2024 Author

yafkari
Jan 20, 2024

Replies: 3 comments 8 replies

team-deepgram
Jan 20, 2024
Maintainer

netw0rkf10w
Jun 7, 2024

davidvonthenen
Jun 12, 2024

yafkari Jul 19, 2024
Author

yafkari Sep 3, 2024
Author

yafkari Sep 9, 2024
Author