Skip to content

Latest commit

 

History

History
33 lines (30 loc) · 4.14 KB

README.md

File metadata and controls

33 lines (30 loc) · 4.14 KB

Biomedical Data Commons (BMDC) Schema MCF Files

This directory contains the MCF nodes for all defined domain specific schemas in Biomedical Data Commons. These files are kept in-sync with the Google repository via Copybara. Changes inside Google are immediately copied here. Approved GitHub pull requests are sent to the Google respository, where it is tested; if approved, the PR will merge into both the Google and GitHub repository.

Overview

  • GeneticVariant_GenVarSource_enums.mcf contains GenVarSourceEnum classes generated by script format_dbSNP_GenVarSource_enum_schema.py.
  • GeneticVariant_alt_id_database_properties.mcf contains GeneticVariant properties generated by script format_dbSNP_alt_ID_database_property_schema.py.
  • [biomedical_stat_vars.mcf] contains StatisticalVariable schema specific to Biomedical Data Commons.
  • [biological_taxonomy.mcf] contains schema for the following classes: BiologicalEntity, Taxon, Species.
  • [biological_taxonomy_enum] contains schema for enumerations which populate Taxon properties in biological_taxonomy.mcf.
  • chemical_compound.mcf contains schema for classes: ActiveIngredientAmount, AnatomicalTherapeuticChemicalCode, Antibody, BiomedicalEntity, ChemicalCompound, ChemicalCompoundAssociation, ChemicalCompoundDiseaseTreatment, ChemicalCompoundDiseaseContraindication, ChemicalCompoundGeneAssociation, ChemicalCompoundGeneticVariantAssociation, ChemicalCompoundProteinInteraction, Drug, DrugStrength, FDAApplication, HumanProteinOccurrence, Protein, ProteinProteinInteraction, and USAdoptedNameStem.
  • chemical_compound_enum.mcf contains schema of enummerations, which populate properties in chemical_compound.mcf.
  • disease.mcf contains schema for classes: Disease, DiseaseAssociation, DiseaseDiseaseAssociation, DiseaseGeneAssociation, DiseaseSymptomAssociation, DiseaseGeneticVariantAssociation, MeSHConcept, MeSHDescriptor, MeSHQualifier, MeSHRecordType, MeSHSupplementaryConceptRecord, and MeSHTerm.
  • disease_enum.mcf schema of enummerations, which populate properties in disease.mcf.
  • encode.mcf contains schema for ENCODE data.
  • genome_annotation.mcf contains schema for classes: Allele, BasePairs, Chromosome, Gene, GeneGeneAssociation, GeneGeneticVariantAssociation, GeneticAssociation, GeneticVariant, GeneticVariantGeneAssociation, GeneticVariantGeneticVariantAssociation, GenomeAnnotation, GenomeAssembly, GenomeAssemblyUnit, GenomicCoordinates, NonCodingRNA, Nucleotide, and RNATranscript.
  • genome_annotation_enum.mcf contains schema of enummerations, which populate properties in genome_annotation.mcf.
  • human_cell_type_enum.mcf contains HumanCellTypeEnum classes generated by script parse_protein_atlas.py.
  • human_tissue_enum.mcf contains HumanTissueEnum classes generated by script parse_protein_atlas.py.
  • interaction_type_enum.mcf contains classes of InteractionTypeEnum that is automatically generated by parse_ebi.py and populates the interactionType property.
  • pharmGKB_id_properties.mcf contains Gene and ChemicalCompound alternative identifier properties automatically generated from pharmGKB data using script drug_gene_relations/config.py from pharmGKB data. This was then manually modified to remove existing properties and curate property domains. -virus_taxonomic_ranking_enum.mcf contains enumerations generated by create_virus_taxonomic_ranking_enums.py as part of the ICTV Metadata Resource import.