Maximum allowed value of endpointing parameter, if any? #177
-
I am using model = enhanced and language = de. Is there a maximum value for endpointing parameter? I only see examples of it being set to 500ms, but when I try to set it to a larger value (e.g., 2000ms), it does not appear to work. Could you advise on what maximum values of endpointing is allowed? I find 500ms to be too short to detect the end of speech, especially for streaming applications. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 4 replies
-
The Deepgram Because of this, the longer For times > 1000ms, there is another feature which is useful that I use for my phone apps called To use this feature, try setting Apologies for the long-winded explanation, this is a tricky thing and finding the best solution requires some fine-tuning for sure! I've been using |
Beta Was this translation helpful? Give feedback.
The Deepgram
endpointing
algorithm is audio-based - so if you setendpointing=2000
it will wait to trigger aspeech_final
message if it detects 2000ms of silence in the audio. For very clear audio signals, this is no problem, but I find with even slightly noisy signals, like phone calls, it is rare to have 2000ms of silence - some noise/blip/bump/sound is likely to occur within that time period.Because of this, the longer
endpointing
is set to, the more likely it is that some noise will occur, effectively barringendpointing
/speech_final
from triggering. For this reason,endpointing
works best with shorter times (< 1000ms is my rule of thumb, though it depends on how noisy the audio sour…