Initial German support #69

ischender · 2023-12-06T13:59:34Z

Adding preliminary support for German (and potentially other Western European languages).
Supported: reference_based, source_based, reference_free.
Missing only the factual_consistency notebooks, since I will need to find good data for that.

kennysong · 2023-12-07T02:36:08Z

Awesome work @ischender! When this is ready for review later, let me know and I'll assign a reviewer.

ischender · 2023-12-27T10:46:55Z

@yosukehigashi I am supposed to assign this to you, but I don't seem to be able to...

kennysong · 2023-12-27T10:49:05Z

Assigned to Yosuke! (Sorry, I assumed Alex would be able to set the reviewer)

yosukehigashi · 2023-12-27T14:48:51Z

Thanks for the massive contribution @ischender! I'll review this ASAP

yosukehigashi

Thanks for the great work!! Just a few minor comments, but overall looks great!

src/langcheck/metrics/de/_detoxify.py

src/langcheck/metrics/de/_tokenizers.py

src/langcheck/metrics/de/reference_based_text_quality.py

src/langcheck/metrics/de/source_based_text_quality.py

tests/metrics/de/test_reference_based_text_quality.py

src/langcheck/metrics/de/reference_free_text_quality.py

yosukehigashi · 2023-12-29T08:34:30Z

docs/notebooks/LangCheck German.ipynb

It'll be great if we could follow the format in the English LangCheck Quickstart notebook for this! I.e.

Cell 1: !pip install langcheck

Cell 2: Define a list of generated outputs (in German), and call fluency

Cell 3: Show the results of running a comparison (fluency > 0.5)

(and so on)

done, that actually made me realize that the Parrot model did not work as well as I previously thought, so now it's wrapped in a translation layer that gives seemingly good results (I used different translations engines not to pullute the tests)

Ah interesting!
Intuitively though, I'd expect something like fluency to be quite language-specific, and it seems hard to accurately assess fluency if you translate it to another language first? (On the other hand, something like factual_consistency feels more language-agnostic).

From a quick browse on huggingface, https://huggingface.co/EIStakovskii/bert-base-german-cased_fluency seems like it might be ok (although it's quite old so likely not state-of-the-art) - what do you think?

from the tests I ran, the fluency seems to match what I would expect, at least in the case of German, when the language structure is similar enough (the basic grammar of English is of a Germanic language, after all).

I will give this bert one a go too...

@yosukehigashi , I tried a quick and dirty implementation using "EIStakovskii/bert-base-german-cased_fluency" in a version of langcheck.metrics.de._fluency_local and ran a quick test...
It seems pretty bad.

While with the the translation we get what we expect:

Please note that the first 2 sentences are pretty much as bad in German as in English...
Meaning:

generated_outputs = [ 'Black cat the', 'The black cat is.', 'The black cat is sitting', 'The big black cat is sitting on the fence', 'Usually, the big black cat is sitting on the old wooden fence.' ]

As you can see, they get mostly the same value:

(interestingly, the translation was done with different engines)

Parrot, that has SOME multilingual properties since it's based on T5 (not mT5 though) does worse than translation (the reason I switched to translations) but better than the bert-based model

Thoughts?

Thanks for doing this analysis! As we said on Discord, let's go with this approach for now since it's performing the best.
Could you create an issue (something like "Consider computing German fluency without translating to English") and add a link to this comment thread, so that this doesn't get buried?

ischender · 2024-01-03T12:48:54Z

Thanks for the great work!! Just a few minor comments, but overall looks great!

thank you, I will be on the suggestions as of next week

Co-authored-by: Yosuke Higashi <[email protected]>

yosukehigashi · 2024-01-10T23:50:27Z

@yosukehigashi I should have implemented all the suggestions/changes, unless something managed to slip by (sorry if to). Can you have a look when you can?

Thanks for the updates! I'll take a look later today

docs/metrics.md

yosukehigashi · 2024-01-11T07:36:06Z

docs/notebooks/LangCheck German.ipynb

Ah interesting!
Intuitively though, I'd expect something like fluency to be quite language-specific, and it seems hard to accurately assess fluency if you translate it to another language first? (On the other hand, something like factual_consistency feels more language-agnostic).

From a quick browse on huggingface, https://huggingface.co/EIStakovskii/bert-base-german-cased_fluency seems like it might be ok (although it's quite old so likely not state-of-the-art) - what do you think?

tests/metrics/de/test_reference_based_text_quality.py

src/langcheck/metrics/de/_detoxify.py

Co-authored-by: Yosuke Higashi <[email protected]>

…upport

yosukehigashi

Just a few questions about the Translate class, but otherwise LGTM!

README_de.md

src/langcheck/metrics/de/_translation.py

Co-authored-by: Yosuke Higashi <[email protected]>

…nto pr/ischender/69

yosukehigashi

Thanks for all of your work on this!!🚀 Looks good to merge once the last two comments are resolved

src/langcheck/metrics/de/_translation.py

yosukehigashi · 2024-01-19T02:04:06Z

docs/notebooks/LangCheck German.ipynb

Thanks for doing this analysis! As we said on Discord, let's go with this approach for now since it's performing the best.
Could you create an issue (something like "Consider computing German fluency without translating to English") and add a link to this comment thread, so that this doesn't get buried?

yosukehigashi · 2024-01-22T02:20:36Z

Looks like all comments have been resolved so I'll merge this now. Thanks @ischender!🇩🇪

ischender added 7 commits December 5, 2023 12:22

first basic implementation of reference free - DE

f5758a6

fizing tokenizers and adding a rougeL example

5f7efac

adding reference based German tests

34a6911

fixing tests, and fixing import bugs in DE

d2aa437

Merge branch 'main' into de-support

20f02d3

adding notes re: case sensitivity

7fdbb58

removing autoformatting

e61da5b

ischender and others added 11 commits December 19, 2023 16:26

Merge branch 'citadel-ai:main' into de-support

a0824e4

adapting source based reference from ZH

07225f8

fixing error in typing in python 3.8

f11bd02

fixing formatting issues

2b5cb31

adding source based tests + context_relevance

d3634cd

first implementation reference free, no tests

5bb01b2

small fixes

a30d2b6

style fixes

abfc265

first round of tests for reference free

0a4d6db

finalizing tests for reference free

08cdb4c

Merge branch 'citadel-ai:main' into de-support

1cfe04b

kennysong requested a review from yosukehigashi December 27, 2023 10:48

yosukehigashi requested changes Dec 29, 2023

View reviewed changes

ischender and others added 5 commits January 4, 2024 14:47

Merge branch 'main' into de-support

4cb5628

moving and unifying Detoxify to metrics from /de and /en

32baf5b

fixing typos and commented out code from the PR

f0795fd

Update src/langcheck/metrics/de/reference_based_text_quality.py

1cbe428

Co-authored-by: Yosuke Higashi <[email protected]>

Update src/langcheck/metrics/de/reference_based_text_quality.py

306bbea

Co-authored-by: Yosuke Higashi <[email protected]>

ischender added 2 commits January 10, 2024 16:50

updated table for metrics

47b4ee0

fixing types problems

8b52857

yosukehigashi added 3 commits January 11, 2024 06:52

fix notebook name

97468b3

use de fluency

14b549c

fix docstring

458ec4d

yosukehigashi reviewed Jan 11, 2024

View reviewed changes

ischender and others added 11 commits January 11, 2024 14:43

Update docs/metrics.md

bdf638c

Co-authored-by: Yosuke Higashi <[email protected]>

German documentation + screenshots

e154723

adding German (ドイツ語???) to the Japanese documentation

285752a

Merge branch 'de-support' of github.com:ischender/langcheck into de-s…

b8a1b8e

…upport

adding a translation function to wrap up longer texts

a419d2b

translating factual consistency data

09adbc9

fixing translation wrapper

443ca7b

fixing translation wrapper

0dca8ba

moving translation wrapper to a file, adding tests

4eb0676

removing debugging print statements

bf5cd21

format json documents

7a3fbff

yosukehigashi reviewed Jan 18, 2024

View reviewed changes

ischender and others added 6 commits January 18, 2024 09:38

changes from PR comments

d2a7acf

Update src/langcheck/metrics/de/_translation.py

cacf3bf

Co-authored-by: Yosuke Higashi <[email protected]>

fixing flake8 formatting issue

2e79a93

Merge branch 'de-support' of https://github.com/ischender/langcheck i…

8ea3a4f

…nto pr/ischender/69

small corrections to make pyright happy

2c66b5a

changing block size as per PR suggestion

82b947f

yosukehigashi approved these changes Jan 19, 2024

View reviewed changes

updating comment to describe min block size

e21a2d7

ischender mentioned this pull request Jan 21, 2024

Consider computing German fluency without translating to English #78

Open

yosukehigashi merged commit 6eba3bb into citadel-ai:main Jan 22, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial German support #69

Initial German support #69

ischender commented Dec 6, 2023 •

edited

Loading

kennysong commented Dec 7, 2023

ischender commented Dec 27, 2023

kennysong commented Dec 27, 2023

yosukehigashi commented Dec 27, 2023

yosukehigashi left a comment

yosukehigashi Dec 29, 2023

ischender Jan 10, 2024

yosukehigashi Jan 11, 2024

ischender Jan 11, 2024

ischender Jan 17, 2024

yosukehigashi Jan 19, 2024

ischender commented Jan 3, 2024

yosukehigashi commented Jan 10, 2024

yosukehigashi Jan 11, 2024

yosukehigashi left a comment

yosukehigashi left a comment

yosukehigashi Jan 19, 2024

yosukehigashi commented Jan 22, 2024

Initial German support #69

Initial German support #69

Conversation

ischender commented Dec 6, 2023 • edited Loading

kennysong commented Dec 7, 2023

ischender commented Dec 27, 2023

kennysong commented Dec 27, 2023

yosukehigashi commented Dec 27, 2023

yosukehigashi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ischender commented Jan 3, 2024

yosukehigashi commented Jan 10, 2024

Choose a reason for hiding this comment

yosukehigashi left a comment

Choose a reason for hiding this comment

yosukehigashi left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

yosukehigashi commented Jan 22, 2024

ischender commented Dec 6, 2023 •

edited

Loading