Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multiple suffix stripping does not work #116

Open
tgross35 opened this issue Jun 13, 2024 · 0 comments
Open

Multiple suffix stripping does not work #116

tgross35 opened this issue Jun 13, 2024 · 0 comments

Comments

@tgross35
Copy link
Contributor

Reported by @gwen-lg in #108 (comment):

In the sentence : Même mourant, vous répondez présent.
the word répondez is detected as error, while it's a conjugation of the verb répondre
projections is not detected as plural of projection.

We aren't doing suffix stripping in a way that is compliant with Hunspell. Relevant dict entries:

répondre/tA
projection/S.

And a subset of the relevant parts of the affix file:

FLAG long

SFX S. Y 2
SFX S. 0 0/L'D'Q' [^sxz]
SFX S. 0 s/D'Q' [^sxz]

SFX tA Y 44
SFX tA re re/n'q'd'l'm't's' [dp]re
SFX tA re ant/n'q'd'l'm't's' [dp]re
SFX tA re ons/n'l't' [dp]re
...
SFX tA re ez/n'l'm' [dp]re

REP ^l l'
REP ^d d'
REP ^n n'
REP ^s s'
REP ^j j'
REP ^m m'
REP ^t t'
REP ^c c'

Note that the suffixes listed here add further affixes, which is likely the source of the problem. This will unfortunately probably need lazy checking to work (#92).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant