Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Need to eliminate substring matches #58

Open
cmungall opened this issue Feb 19, 2021 · 0 comments
Open

Need to eliminate substring matches #58

cmungall opened this issue Feb 19, 2021 · 0 comments

Comments

@cmungall
Copy link

cmungall commented Feb 19, 2021

With this admittedly silly query I get spurious matches with MEDIUM confidence

curl -L -s 'http://www.ebi.ac.uk/spot/zooma/v2/api/services/annotate?propertyValue=range+600-1K' | jq '.[] | .confidence, .semanticTags, .annotatedProperty.propertyValue'
"MEDIUM"
[
  "http://purl.obolibrary.org/obo/NCIT_C38013"
]
"Range"
"MEDIUM"
[
  "http://purl.obolibrary.org/obo/NCIT_C134240"
]
"Analytical Procedure Range"

the first matches a substring. The second is a match with one string in common.

Why are these MEDIUM and not LOW? Is there a way to eliminate substring or overlapping string matches?

@henrietteharmse henrietteharmse added this to the Resilient searches milestone Mar 3, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants