Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DCAT-AP-CH Language vs. Opendata.swiss vs. I14Y vs. International #223

Open
fabian-santi opened this issue Jun 3, 2024 · 2 comments
Open
Labels
discussion needed Questions or extra attention needed Open

Comments

@fabian-santi
Copy link

Angaben betreffend unterschiedliche Handhabung der Sprache:

1 Opendata.swiss definiert die Sprache als ISO 639-1 two-letter code
Siehe https://handbook.opendata.swiss/de/content/glossar/bibliothek/dcat-ap-ch.html#dcat-dataset-language

2 DCAT-AP-CH definiert die Sprache als [VOCAB-EU-LANGUAGE] URI three-letter code
Siehe https://www.dcat-ap.ch/releases/2.0/dcat-ap-ch.html#dataset-language

3 DCAT (w3c)
https://www.w3.org/TR/vocab-dcat-3/#Property:resource_language
Resources defined by the Library of Congress (ISO 639-1, ISO 639-2) SHOULD be used.
If a ISO 639-1 (two-letter) code is defined for language, then its corresponding IRI SHOULD be used; if no ISO 639-1 code is defined, then IRI corresponding to the ISO 639-2 (three-letter) code SHOULD be used.

4 DCAT-AP v3
https://www.dublincore.org/specifications/dublin-core/dcmi-terms/#language
ISO 639-2 or ISO 639-3

5 Die I14Y verwendet aktuell den two-letter code

Spezielle Sprachen
• Spezielle Sprachen wie Klingonisch tlh sind nur im three-letter code abgebildet und beispielsweise beim EU Vokabular zulässig
http://publications.europa.eu/resource/authority/language/TLH

Verschiedene Ressource-Properties sind betroffen
• Catalog.language
• Dataset.language
• Distribution.language
• …

Welcher Sprachcode nun korrekt ist, ist über die verschiedenen Ebenen der Standardisierung nicht einfach zu beantworten.
Der three-letter ist sicher am universellsten. Dies würde bedeuten, ihn auf opendata.swiss und I14Y zu ersetzen.

@mispichtig mispichtig added Open discussion needed Questions or extra attention needed labels Jul 15, 2024
@mispichtig
Copy link
Collaborator

mispichtig commented Aug 12, 2024

Point 4 DCAT-AP v3: They also use the EU Vocabularies Languages Named Authority List for the property dct:language in DCAT-AP v3.0.0., see: https://semiceu.github.io/DCAT-AP/releases/3.0.0/#controlled-vocabularies-to-be-used. So I would propose staying with the EU-Vocabulary and replace ISO 639-1 two-letter code in i14y and opendata.swiss with it.

@taniahumar
Copy link

side note for geodata: ISO 639-2 alpha-3-Codes (https://www.loc.gov/standards/iso639-2/php/code_list.php) used in geocat.ch / GM03 (3-letter)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
discussion needed Questions or extra attention needed Open
Projects
Status: No status
Development

No branches or pull requests

3 participants