-
Notifications
You must be signed in to change notification settings - Fork 54
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
mondo-base.obo
: MONDO IDs do not match between $xref
and $id
#6873
Comments
Hey, I think this may have gotten buried amongst the other Issues, but wanted to check in and see if anyone has looked into this yet. @twhetzel Thanks for your help! -Brian |
Hi Brian - thanks for the ping on this issue. I've been looking into this and have a few questions and comments. What is the source of the Mondo ontology file used? I've looked at the number of terms without labels or definitions in the latest release of Mondo and do not get the same count as mentioned in the initial post. In the "xref_ids.csv" file, the Mondo IDs vary in length from 13, 14, and 15 characters long while they should be 13 characters long (MONDO: and then a 7 digit number). Amongst the IDs that are 13 characters long, there are at least a few obsolete terms in the file, e.g. MONDO:8000034. Deprecated terms in MONDO have the annotation owl:deprecated with value true [1] and of the Mondo terms with xrefs in the latest release, 3,210 of these are obsoleted. For other valid IDs in the "xref_ids.csv" file, e.g. MONDO:0000005, what are the values you get for $xref and $id? |
@matentzn have you taken over this ticket since you created a meeting with Brian? |
I Never saw this, and I didn't realise @bschilder was interested in Mondo at all! Yep I can discuss it with him when I meet him. |
Oh, I thought your ping to him here (neurogenomics/RareDiseasePrioritisation#33 (comment)) was related to Mondo as well |
I guess now it it but originally no, it was only referring to uPheno! |
We determined this is not a Mondo related problem, but related to the r toolkit! |
Indeed, it seems something strange is going on within For example, the Mondo IDs within While I'm still trying to sort out the exact reason for this issue, I think |
Thanks both for the update. Please see other issues with the IDs as mentioned earlier #6873 (comment) |
Yeah I confirmed that with Brian in the meeting, many ids don't even exist. He will look into it! |
@twhetzel here's a reprex you can quickly run in R: > if(!require("ontologyIndex")) install.packages("ontologyIndex")
> mondo <- ontologyIndex::get_OBO("https://github.com/monarch-initiative/mondo/releases/download/v2023-09-12/mondo-base.obo", extract_tags = "everything")
> mondo$xref["MONDO:0000005"]
$`MONDO:0000005`
[1] "OMIMPS:203655"
> mondo$id["MONDO:0000005"]
MONDO:0000005
"MONDO:0000005" |
Tagging who i think may be the author of |
@bschilder that looks correct for |
Sure thing, I've already started working on alternative methods for converting IDs across ontologies using some of Monarch's resources. I'll keep your team posted on how things progress: |
I'm using the "mondo-base.obo" object read into R as an
ontologyIndex
object.In working with this object, I've noticed some unexpected features. Namely, the MONDO IDs in the
$xref
slot do not seem to match up with those listed in the$id
slot. I'd expect that all MONDO IDS in any of the slots should at least be present in the$id
slot.Mondo term (ID Label)
xref_ids.csv
Bug/Typo/Error description
Your nano-attribution (ORCID)
https://orcid.org/0000-0001-5949-2191
Thanks in advance for your help.
The text was updated successfully, but these errors were encountered: