Error rate and Decoding heuristics #550
tom-huntington
started this conversation in
General
Replies: 1 comment 1 reply
-
I'm wondering about the error rates of transcribing the segments in parallel with some kind of appropriate windowing heuristic. You would loose the previous-text conditioning, but that doesn't seem to affect the error rate that much. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
The original paper gives really good information on the accuracy of the different heuristics:
In total, the different decoding heuristics correctly transcribe about 1 extra word in 100 when the error is sufficiently high.
Would have been interested to know the word error rate breakdown for function vs content words.
Greedy decoding is good enough for me, but 1 extra content word in 100 might make it worth implementing additional heretics in for a onnx/cpp deployment.
Beta Was this translation helpful? Give feedback.
All reactions