-
Notifications
You must be signed in to change notification settings - Fork 4
Changelog
All notable changes to the database loaded using this project will be documented in this file.
The format is based on Keep a Changelog
- Data from 1000 Genomes GRCh38 and hg19 datasets (about 2,5k samples each)
- Updated GDC datasets to 2019-10
- Updated ENCODE datasets to 2020-01
- In the Biosample table we added the alt_biosample_source_id varchar field, which contains alternative identifiers for biological samples, derived from the source
- In the Donor table we added the alt_donor_source_id varchar field, which contains alternative identifiers for donors/patients, derived from the source
- In the Item table we added the alt_item_source_id varchar field, which contains alternative identifiers for items, derived from the source
-
Field mapped from TCGA source (GRCh38) to alt_biosample_source_id: biospecimen__bio__bcr_sample_barcode
-
Field mapped from TCGA source (hg19) to alt_biosample_source_id: biospecimen_sample__bcr_sample_barcode
-
Field mapped from TCGA source (GRCh38) to alt_donor_source_id: biospecimen__shared__bcr_patient_barcode
-
Field mapped from TCGA source (hg19) to alt_donor_source_id: clinical_patient__bcr_patient_barcode
-
Field mapped from TCGA source (GRCh38) to alt_item_source_id: gdc__aliquots__submitter_id
-
Field mapped from TCGA source (hg19) to alt_item_source_id: biospecimen_aliquot__bcr_aliquot_barcode
-
Field mapped from ENCODE source to antibody: antibody__dbxrefs (only first entry of the array, corresponding to id from https://antibodyregistry.org) or antibody__accession if the first is not available (this corresponds to the ENCODE DCC curated accession starting with "ENCAB")
-
Corrected Roadmap Epigenomics DNase-seq data target field. It was "DNase". Now set to NULL.
This corresponds to the first released version of the database
Usage
Supporting Tools