This repository implements a file converterfrom OSC-EM JSON to PDBx/mmCIF. The JSON schema is defined in OSCEM and the PDBx/mmCIF format in (PDBML Schema v50 ).
- converter executable
- with
--json
specify path to json file that contains metadata - with
--dic
specify path to the PDBx/mmCIF dictionary file - with
--conversions
specify path to conversions table. This table includes correspondance in names between OSC-EM and PDBx - with
--level
specify the json element name that contains metadata entries. For SciCat that is usually "scientificMetadata" - with
--append
specify if the metadata should be added to existing mmCIF to later deposit it in PDB - with
--mmCIFfile
specify the path to existing mmCIF file. Throws an error if --append is false and --mmCIFfile is not specified - with
--output
specify the file to write the newly created mmCIF with metadata entries
Converter explicitly parser through the PDBx definitions to extract as much data as possible. This allows for
- administrative categories sorted within mmCIF ( such as author information, grant, etc)
- em-related categories are sorted randomly, as there is no definitive sorting in PDB team as well
- file ends with information on atoms
- units in OSCEM definition are comared to PDBx ( converter for units will be implemented)
- numeric values are checked to be within a range allowed by PDBs
- values are checked to be within a list of attributes allowed by PDBx. This is additionally enhanced to match via regular expressions or ceratin logic.