Replies: 2 comments 1 reply
-
It would be good to take |
Beta Was this translation helpful? Give feedback.
1 reply
-
#774 and #513 should probably be addressed together, towards making sure all output follows the same canonical forms. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
The Recognizers-Text project offers models per languange with as wide language variant support as possible, without the need to specify fine-grained locales. For example, the English model should support as much of the variations of the English language as possible in a single model (ex: en-US, en-AU, en-GB, en-HK, ...).
However, there may be cases were an entity interpretation - in this case a number format - may be ambiguous between two language variants.
By design, the recognizers consider one variant of a language as the default. For example, for English, the default variant is en-US. If the default behaviour of the model is not as desired, a user can request a variant-specific version, instead of the general model (i.e., request "en-GB" instead of "en-*").
The current design of the recognizers for Numbers can be summarized as:
For example, in English, which supports variants and uses en-US as default variant:
As previously stated, if the behaviour for the ambigous cases is not as desired, a user should be able to specify a specific language variant.
There are two known issues in the current implementation:
Beta Was this translation helpful? Give feedback.
All reactions