Skip to content

CSV file upload format

henrietteharmse edited this page Aug 17, 2022 · 4 revisions

The TSV file upload for Zooma consists of the following columns STUDY, BIOENTITY, PROPERTY_TYPE, PROPERTY_VALUE, SEMANTIC_TAG, ANNOTATOR, ANNOTATOR_DATE, with the fields defined as follows:

  • STUDY : An identifier to a study, publication or some other experimental dataset that this annotation occurs in. This field is required, but it can be empty.

  • BIOENTITY : An identifier to the individual data item that this annotation occurs in. Serves to group annotations together, which helps Zooma identify context-specific mappings. This is not strictly required, but if missing annotations are assumed to be "universal" and Zooma may be less confident when predicting future annotations.

  • PROPERTY_TYPE : the type of the PROPERTY (e.g. "disease"). Required, but can be empty.

  • PROPERTY_VALUE : is the word you would search for in zooma (e.g. "Thrombophilia due to activated protein C resistance"). Required, cannot be empty

  • SEMANTIC_TAG : is the id to which zooma would annotate the entered property value (e.g. "http://purl.obolibrary.og/obo/HP_0004936"). Required, but can be empty. Multiple ids can be split by | (pipe).

  • ANNOTATOR : the the name of the person who says that this PROPERTY_VALUE can be associated to this SEMANTIC_TAG (the person who created the csv file for example). Optional

  • ANNOTATOR_DATE : the date at which the annotation was done (e.g. "2013-11-22 12:00:00"). Optional

Clone this wiki locally