Lingua package misinterpret text which contains html tags as different languages #253
AKHIL-RCRM
started this conversation in
General
Replies: 1 comment
-
Please remove all html tags first before passing the text to the language detector. The library is designed to expect natural language text only, so any markup should be removed in a prior step. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
So I passed a text which contains some html tags and stylings, the text inside is written in English. But the package has misinterpreted the language as "Yoruba". No idea why. Is there any fix for this?
Beta Was this translation helpful? Give feedback.
All reactions