Skip to content

EBISPOT/EFO-UKB-mappings

Repository files navigation

Mapping UK Biobank to the Experimental Factor Ontology (EFO)

UK Biobank is a massive datasource for population health data used extensively in research. Some of the clinical data is mapped to ICD-10, but coverage is incomplete, as are mappings from existing ICD-10 codes to public ontologies. Here, we describe a pipeline to map 1,552 UK Biobank traits to public ontology terms via the Experimental Factor Ontolo- gy. Our approach uses ontology mapping services and man- ual curation to provide almost complete coverage (97%), thus increasing interoperability of UK Biobank with public datasets. Both mappings and services described are freely available for use and the mapping pipeline represents a typi- cal curation workflow that can be adopted for other domains.

This repository contains:

  • The mapping masterfile TSV containing all up to date mappings of UK Biobank and ICD10 to EFO mappings.
  • The Zooma dataset CSV, containing all mappings in the Zooma format. The mappings are available in Zooma.
  • The ISMB ECCB 2019 paper outlining our mapping process and results.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •