Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adding N'ko Wordlist #57

Open
NeilSureshPatel opened this issue May 17, 2023 · 2 comments
Open

Adding N'ko Wordlist #57

NeilSureshPatel opened this issue May 17, 2023 · 2 comments
Assignees

Comments

@NeilSureshPatel
Copy link
Contributor

Hi @m4rc1e! I was just about to start trying this out and I see that there isn't an N'ko wordlist. Wondering if one could be created. Fortunately, there is a tool that is being used to collect N'ko words primarily from N'ko news sites. Without an account you can get 1000 of the most frequent words from here. N'ko Wordlist There is also some n-gram data is that is useful too. N'ko N-gram

Would this be doable from this list? It does look like there are some English words that have been pulled in that need to be scrubbed out. Thanks.

@m4rc1e m4rc1e self-assigned this May 18, 2023
@m4rc1e
Copy link
Collaborator

m4rc1e commented May 18, 2023

Hey Neil!

Excellent request. 1000 words may be too few. However, it's better than nothing.

Do you have the urls for any N'ko news sites? we could potentially use this to make our own.

@NeilSureshPatel
Copy link
Contributor Author

The main news site that is still regularly posting content is Kanjamadi. Looking at the site now I see they added a dictionary which may be particularly useful. N'ko Dictionary
There is also content on N'ko Wikipedia
We have been slowly propagating CLDR so there maybe some useful bits there as well.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants