-
-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ci: include newer versions of languagetool
in tests
#126
ci: include newer versions of languagetool
in tests
#126
Conversation
I can't reproduce the failing test by running the docker image locally so I added some minor code for outputting the errors. By the way, what do you think about throwing in a |
Hmm - this time |
CodSpeed Performance ReportMerging #126 will not alter performanceComparing Summary
|
Looks good! Failing checks are not relevant for this PR :-) |
No clue, but there is no reason why it should fail (the exception is that LT could change detect new errors that were not covered before, which could break our tests, but not the API itself). LT hasn't changed its API in almost two years, I think. |
I rarely use Docker, but I am open to better / more reproducible ways to self-host a server. The current |
Probably not be necessary, I'm just a big fan of version: "3"
services:
languagetool:
image: erikvl87/languagetool:latest
ports:
- 8010:8010
environment:
- langtool_languageModel=/ngrams
- Java_Xms=512m
- Java_Xmx=1g
volumes:
- ./data:/ngrams |
Nice! Is it possible to opt out of the ngrams? It is a good default, but also takes more memory I guess. |
I won't lie I don't know what that is as I'm not too familiar with languagetool itself, I just grabbed the default docker compose that was given for some languagetool docker image. However, reducing it to this still seems to work and allows all the tests to pass: version: "3"
services:
languagetool:
image: erikvl87/languagetool:latest
ports:
- 8010:8010
environment:
- Java_Xms=512m
- Java_Xmx=1g docker compose up -d
LANGUAGETOOL_HOSTNAME=http://localhost LANGUAGETOOL_PORT=8010 cargo nextest run --all-features |
N grams are sequences of n grammar tokens (not sure about the work) that you find in a given language and the probability to find it. The ngram dataset contains many such examples in order to help LT determine if a given sequence of words makes sense or not. To summarize, this is an opt in feature from LT that makes error checking much better, at the cost of storing those large dataset files. |
Ah ok, thanks for explaining |
Just adding in the newer versions as requested. Hopefully we can also use this branch for figuring out why the tests for the latest version was/is failing