OOV words with small LM #115

davidavdav · 2023-11-06T13:42:11Z

Hello,

We have an application with a very small vocabulary (~100 words). With an almost trivial bigram model (as kenlm seems not to be able to make a unigram model), we see that decoder.decode() produces words that are not in the language model.

Is there some kind of fallback to letter decoding? Is there a way to turn this off?

Thanks!

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OOV words with small LM #115

OOV words with small LM #115

davidavdav commented Nov 6, 2023

OOV words with small LM #115

OOV words with small LM #115

Comments

davidavdav commented Nov 6, 2023