Mroonga and Pgroonga search index support for CJK languages #11214
Unanswered
ishitatsuyuki
asked this question in
Ideas
Replies: 2 comments 5 replies
-
Perhaps a good approach would be to include the Mroonga index as a separate table — this way it can go through schema changes independently. |
Beta Was this translation helpful? Give feedback.
0 replies
-
We are dropping MyISAM as the default Any idea whether InnoDB does better with CJK languages? |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
We're using Craft CMS to build a website in Japanese, and we have a little trouble with the behavior of the search.
English-centric search indexers like MyISAM have trouble tokenizing CJK languages as the word boundaries are not explicitly denoted by space like in English-like languages. As such, it can omit results in unexpected ways or otherwise result in poor search results.
The Mroonga extension is a drop-in replacement to MyISAM that provides CJK language indexing. Using it is easy: Just ALTER TABLE set the engine and it's done. The Pgroonga does the same, but for Postgres.
I would like to see official support for this in Craft, mostly so that the schema doesn't get broken accidentally. It would also be good to see feature detection (checking if extension is loaded) and one-click switching — but I know that this is a niche use case so that isn't 100% necessary.
Beta Was this translation helpful? Give feedback.
All reactions