layout | title | parent | nav_order |
---|---|---|---|
default |
Update Existing Metadata |
Step 2 - Request a Metadata Template |
2 |
- You are a certified user on Synapse
- You have transferred your dataset to the DCC
Now you'd like to update your metadata in order to:
- correct mistake(s)
- provide further/change metadata to comply with a new iteration of the DCC data model affecting your datasets' metadata
- provide metadata for files that have been added to your dataset
In this how-to, we'll be using an example clinical
dataset named Clinical Family History
located in a Synapse Project called HTAN HCA immune cells census
. This is a dataset that's been annotated previously.
Before you start:
- Please make sure that you get your existing metadata first and update it with any additional changes.
- You can reuse existing templates as long as your dataset hasn't changed (i.e. files haven't been deleted or added). You could validate and submit without regenerating your templates every time.
- If files were deleted, please ensure that the templates don't contain the records associated with these files.
To update your metadata:
-
Access the Data Curator app
- If you are prompted to login to Synapse, please use your Synapse account (or associated Google account).
-
In the app, go to the "Select Your Dataset" section in the left-hand menu. From that page, select your project from the dropdown.
- The project name corresponds to the bucket name (here
HTAN HCA immune cells census
).
- The project name corresponds to the bucket name (here
-
Next, select your dataset, which corresponds to the folder name in your bucket (here
Clinical_FamilyHistory
). -
Then, select the metadata template you would like to use (here
Clinical Tier 1 - FamilyHistory
). If you don't see the correct template for your dataset, you can select the "Minimal Metadata" template and contact your DCC liaison. -
Once you have selected your dataset and metadata template, navigate to the "Get Metadata Template" section in the left-hand menu. Select the "Click to Generate Google Sheets Template" button.
- This will generate a link to a Google spreadsheet containing an empty template for you to complete with metadata, for each of the files in your dataset. This can take awhile depending on how many files are in your folder, so please be patient!
-
Clicking on the generated template link will open up the template in Google Sheets.
-
Add a new row (in this case, a participant), but it can be other new information.
- Note: you can also save the spreadsheet as a CSV file and use a method of your choice to fill it out. The metadata CSV will be validated by the Data Curator app before submission regardless of the method used to fill out the template.
-
Once you've filled in the template, you can save your spreadsheet as a CSV (File -> Download -> Comma-separated Values...)
-
Next, go gack to the Data Curator App and navigate to the "Submit & Validate Metadata" step in the left-hand sidebar. Click on the "Browse" button to upload your saved CSV.
-
Check the preview of your file to make sure everything looks correct.
-
Validate your CSV by clicking the "Validate Metadata" button.
-
Success!
-
Check your metadata on Synapse. A link to where your metadata file lives is generated by the Data Curator App upon successful submission of your metadata.
-
See your metadata in a table. You can also see your metadata in a table by navigating to the
Tables
tab of your project. There would be a table with your dataset name which you can query and view.
Please contact your DCC liaison if you cannot resolve a metadata error or have questions regarding metadata updates and submission.