Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue with written accents in Spanish search #54

Open
acuahutle opened this issue Oct 14, 2018 · 1 comment
Open

Issue with written accents in Spanish search #54

acuahutle opened this issue Oct 14, 2018 · 1 comment
Labels

Comments

@acuahutle
Copy link

There is an issue when searching text in Spanish. The search seems to consider words like cafe and café as two completely different words. The search finds the pages containing the word café, which is the correct spelling of the word, only if the user looks for the term with the same exact spelling. If the user types the word without the written accent in his test, the pages containing the correct spelling with written accent are not be returned and only pages where the word was misspelled and the written accent was missing are returned. Enabling fuzzy search did not help.

A similar with Polish language is discussed in the tntsearch library page. It is mentioned that the issue is related to the character set. However, I don't see an option for this in the grav tnt search plugin.

@rhukster
Copy link
Member

Actually, i can't get it to work even when I manually set the charset: utf8.

I mean i can find words correctly with accents, but it doesn't find the non-accented version. I think this is what you are describing, but it's a fundamental limitation in tntsearch. It should have indexing logic to save the string in both accented and non-accented versions.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants