ci: include newer versions of `languagetool` in tests #126

Rolv-Apneseth · 2024-11-16T14:57:10Z

Just adding in the newer versions as requested. Hopefully we can also use this branch for figuring out why the tests for the latest version was/is failing

…ling CI tests

Rolv-Apneseth · 2024-11-16T15:45:23Z

I can't reproduce the failing test by running the docker image locally so I added some minor code for outputting the errors.

By the way, what do you think about throwing in a docker-compose.yml file into the repo for easier self-hosting of the API?

Rolv-Apneseth · 2024-11-16T15:47:54Z

Hmm - this time 6.5 and latest passed. Any clue why?

codspeed-hq · 2024-11-16T15:52:59Z

CodSpeed Performance Report

Merging #126 will not alter performance

_{Comparing Rolv-Apneseth:test-later-languagetool-versions (4838e40) with v3 (a90bd15)}

Summary

✅ 6 untouched benchmarks

jeertmans · 2024-11-16T16:56:29Z

Looks good! Failing checks are not relevant for this PR :-)

jeertmans · 2024-11-16T16:59:05Z

Hmm - this time 6.5 and latest passed. Any clue why?

No clue, but there is no reason why it should fail (the exception is that LT could change detect new errors that were not covered before, which could break our tests, but not the API itself). LT hasn't changed its API in almost two years, I think.

jeertmans · 2024-11-16T17:00:07Z

I can't reproduce the failing test by running the docker image locally so I added some minor code for outputting the errors.

By the way, what do you think about throwing in a docker-compose.yml file into the repo for easier self-hosting of the API?

I rarely use Docker, but I am open to better / more reproducible ways to self-host a server. The current ltrs docker commands are quite simple, and probably not bulletproof.

Rolv-Apneseth · 2024-11-16T17:14:52Z

I rarely use Docker, but I am open to better / more reproducible ways to self-host a server. The current ltrs docker commands are quite simple, and probably not bulletproof.

Probably not be necessary, I'm just a big fan of docker compose due to the declarative style. I can just define everything that's needed from a single yaml. I'll just leave the content here for future reference in case (I'll probably come back to find this):

version: "3"

services:
  languagetool:
      image: erikvl87/languagetool:latest
      ports:
      - 8010:8010
      environment:
      - langtool_languageModel=/ngrams
      - Java_Xms=512m
      - Java_Xmx=1g
      volumes:
      - ./data:/ngrams

jeertmans · 2024-11-16T21:10:15Z

I rarely use Docker, but I am open to better / more reproducible ways to self-host a server. The current ltrs docker commands are quite simple, and probably not bulletproof.

Probably not be necessary, I'm just a big fan of docker compose due to the declarative style. I can just define everything that's needed from a single yaml. I'll just leave the content here for future reference in case (I'll probably come back to find this):
version: "3"

services:
  languagetool:
      image: erikvl87/languagetool:latest
      ports:
      - 8010:8010
      environment:
      - langtool_languageModel=/ngrams
      - Java_Xms=512m
      - Java_Xmx=1g
      volumes:
      - ./data:/ngrams

Nice! Is it possible to opt out of the ngrams? It is a good default, but also takes more memory I guess.

Rolv-Apneseth · 2024-11-16T22:44:55Z

Nice! Is it possible to opt out of the ngrams? It is a good default, but also takes more memory I guess.

I won't lie I don't know what that is as I'm not too familiar with languagetool itself, I just grabbed the default docker compose that was given for some languagetool docker image. However, reducing it to this still seems to work and allows all the tests to pass:

version: "3"

services:
  languagetool:
      image: erikvl87/languagetool:latest
      ports:
      - 8010:8010
      environment:
      - Java_Xms=512m
      - Java_Xmx=1g

docker compose up -d
LANGUAGETOOL_HOSTNAME=http://localhost LANGUAGETOOL_PORT=8010 cargo nextest run --all-features

jeertmans · 2024-11-17T09:36:22Z

N grams are sequences of n grammar tokens (not sure about the work) that you find in a given language and the probability to find it. The ngram dataset contains many such examples in order to help LT determine if a given sequence of words makes sense or not.

To summarize, this is an opt in feature from LT that makes error checking much better, at the cost of storing those large dataset files.

Rolv-Apneseth · 2024-11-17T09:58:55Z

N grams are sequences of n grammar tokens (not sure about the work) that you find in a given language and the probability to find it. The ngram dataset contains many such examples in order to help LT determine if a given sequence of words makes sense or not.

To summarize, this is an opt in feature from LT that makes error checking much better, at the cost of storing those large dataset files.

Ah ok, thanks for explaining

Rolv-Apneseth added 3 commits November 16, 2024 14:50

ci: include newer versions of languagetool in tests

1a7a2d2

tests: debug errors with server testing

a800a4c

ci: --no-capture for testing library code, for easier debugging fai…

4838e40

…ling CI tests

Rolv-Apneseth requested a review from jeertmans November 16, 2024 15:47

jeertmans merged commit ccb494a into jeertmans:v3 Nov 16, 2024
21 of 23 checks passed

jeertmans added the ci Continuous Integration related (GitHub actions, precommit, …) label Nov 16, 2024

Rolv-Apneseth deleted the test-later-languagetool-versions branch November 16, 2024 17:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: include newer versions of `languagetool` in tests #126

ci: include newer versions of `languagetool` in tests #126

Rolv-Apneseth commented Nov 16, 2024

Rolv-Apneseth commented Nov 16, 2024

Rolv-Apneseth commented Nov 16, 2024

codspeed-hq bot commented Nov 16, 2024

jeertmans commented Nov 16, 2024

jeertmans commented Nov 16, 2024

jeertmans commented Nov 16, 2024

Rolv-Apneseth commented Nov 16, 2024

jeertmans commented Nov 16, 2024

Rolv-Apneseth commented Nov 16, 2024

jeertmans commented Nov 17, 2024

Rolv-Apneseth commented Nov 17, 2024

ci: include newer versions of languagetool in tests #126

ci: include newer versions of languagetool in tests #126

Conversation

Rolv-Apneseth commented Nov 16, 2024

Rolv-Apneseth commented Nov 16, 2024

Rolv-Apneseth commented Nov 16, 2024

codspeed-hq bot commented Nov 16, 2024

CodSpeed Performance Report

Merging #126 will not alter performance

Summary

jeertmans commented Nov 16, 2024

jeertmans commented Nov 16, 2024

jeertmans commented Nov 16, 2024

Rolv-Apneseth commented Nov 16, 2024

jeertmans commented Nov 16, 2024

Rolv-Apneseth commented Nov 16, 2024

jeertmans commented Nov 17, 2024

Rolv-Apneseth commented Nov 17, 2024

ci: include newer versions of `languagetool` in tests #126

ci: include newer versions of `languagetool` in tests #126