Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Corpus alignment for Arabic and English #44

Open
theamato opened this issue Jan 2, 2023 · 1 comment
Open

Corpus alignment for Arabic and English #44

theamato opened this issue Jan 2, 2023 · 1 comment

Comments

@theamato
Copy link

theamato commented Jan 2, 2023

I'm wondering if Bleualign can be used for corpus alignment of Arabic and English texts? Or will the fact that Arabic is read from right to left affect performance?

@rsennrich
Copy link
Owner

Bleualign will work fine for Arabic-English - only languages that don't segment words with space symbols are a theoretical problem for BLEU with default settings, but you can use --bleu_charlevel for these to perform character-level BLEU.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants