Skip to content
This repository has been archived by the owner on Jul 24, 2023. It is now read-only.

Latest commit

 

History

History
101 lines (58 loc) · 6.27 KB

update-existing-metadata.md

File metadata and controls

101 lines (58 loc) · 6.27 KB
layout title parent nav_order
default
Update Existing Metadata
Step 2 - Request a Metadata Template
2

Update Existing Metadata

Pre-requisites

Step-by-Step

Now you'd like to update your metadata in order to:

  • correct mistake(s)
  • provide further/change metadata to comply with a new iteration of the DCC data model affecting your datasets' metadata
  • provide metadata for files that have been added to your dataset

In this how-to, we'll be using an example clinical dataset named Clinical Family History located in a Synapse Project called HTAN HCA immune cells census. This is a dataset that's been annotated previously.

Before you start:

  • Please make sure that you get your existing metadata first and update it with any additional changes.
  • You can reuse existing templates as long as your dataset hasn't changed (i.e. files haven't been deleted or added). You could validate and submit without regenerating your templates every time.
  • If files were deleted, please ensure that the templates don't contain the records associated with these files.

To update your metadata:

  1. Access the Data Curator app

    • If you are prompted to login to Synapse, please use your Synapse account (or associated Google account).
  2. In the app, go to the "Select Your Dataset" section in the left-hand menu. From that page, select your project from the dropdown.

    • The project name corresponds to the bucket name (here HTAN HCA immune cells census).

    htan-app-update-existing-metadata-clinical

  3. Next, select your dataset, which corresponds to the folder name in your bucket (here Clinical_FamilyHistory).

    htan-app-update-existing-metadata-clinical

  4. Then, select the metadata template you would like to use (here Clinical Tier 1 - FamilyHistory). If you don't see the correct template for your dataset, you can select the "Minimal Metadata" template and contact your DCC liaison.

    htan-app-update-metadata-clinical-dataset-selection

  5. Once you have selected your dataset and metadata template, navigate to the "Get Metadata Template" section in the left-hand menu. Select the "Click to Generate Google Sheets Template" button.

    • This will generate a link to a Google spreadsheet containing an empty template for you to complete with metadata, for each of the files in your dataset. This can take awhile depending on how many files are in your folder, so please be patient!

    htan-app-generate-link-clinical

  6. Clicking on the generated template link will open up the template in Google Sheets.

    htan-app-generated-link-clinical

  7. All previously validated metadata is available. htan-app-previous-metadata

  8. Add a new row (in this case, a participant), but it can be other new information. clinical-metadata-htan-app-add-new-row

    • Note: you can also save the spreadsheet as a CSV file and use a method of your choice to fill it out. The metadata CSV will be validated by the Data Curator app before submission regardless of the method used to fill out the template.
  9. Once you've filled in the template, you can save your spreadsheet as a CSV (File -> Download -> Comma-separated Values...) clinical-metadata-htan-app-download

  10. Next, go gack to the Data Curator App and navigate to the "Submit & Validate Metadata" step in the left-hand sidebar. Click on the "Browse" button to upload your saved CSV. clinical-metadata-htan-app-upload

  11. Check the preview of your file to make sure everything looks correct. clinical-metadata-htan-app-preview

  12. Validate your CSV by clicking the "Validate Metadata" button. clinical-metadata-htan-app-validate

  13. Once validated, you can submit. clinical-metadata-htan-app-validated

  14. Click on the "Submit to Synapse" button. clinical-metadata-htan-app-submit-to-synapse

  15. Success!

    clinical-metadata-htan-app-success

  16. Check your metadata on Synapse. A link to where your metadata file lives is generated by the Data Curator App upon successful submission of your metadata.

  17. See your metadata in a table. You can also see your metadata in a table by navigating to the Tables tab of your project. There would be a table with your dataset name which you can query and view. clinical-metadata-htan-app-table

Please contact your DCC liaison if you cannot resolve a metadata error or have questions regarding metadata updates and submission.