Request for review | GEMET --> SWEET concept match candidates #272
brandonnodnarb
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
request
The SWEET community are seeking reviews of candidate matches between SWEET and General Multilingual Environmental Thesaurus (GEMET). This review is ultimately dual purpose as it will facilitate adding existing GEMET definitions to SWEET concepts (per #211) as well as facilitate the process of mapping between the two vocabularies.
background
The SequenceMatcher class from the Python package difflib was used to generate a similarity score between a pairwise evaluation of each SWEET concept label (
rdfs:label
) against every GEMET concept label (skos:prefLabel
). Only results with an arbitrarily determined similarity score of 0.90 or better were returned and included in the spreadsheets.An initial cursory scan for false positive matches (e.g. adsorption != absorption) has been completed. These predominantly false positive results are in a second tab in each spreadsheet titled 'CONCEPT SCHEME'_removed.
reviewing
For those willing to review concepts, please use the comments field to capture anything specific to a record. If you have reviewed a concept pairing and:
please then add your name (or ORCID) to one of the reviewer cells for that concept. If the concept is determined to be a match you are done. If the concept is determined to be NOT a match, please cut and paste the entire row to the spreadsheet's '_removed' tab.
One aspect of the hub concept will be highlighting the range of precision in definitions. As such, please consider the broadest plausible match scenario for all concepts during this initial review.
When two reviewers have completed their assessment and agreed it a plausible match (or NOT), that concept row can be considered completed. If a potential match has conflicting reviews, or needs discussion for any reason, please use this thread or the sweetontology ESIP slack channel for discussion. Any concept needing further deliberation will be added as an agenda item for the STC monthly meeting.
There are currently two reviewer columns. Please do add more reviewer columns if needed; using two as a minimum, but more are welcome.
results for review
The following table has total number of candidate SWEET matches for GEMET and initial observations (comments). Concept schemes are linked to their respective google spreadsheet which will be used to facilitate the review.
'matches'
Beta Was this translation helpful? Give feedback.
All reactions