This script is a helper that packages/publishes a GFF file, created by the MetaPUF pipeline, as an RO-Crate.
The schema of the RO-Crate is one that is supported by MGnify: it contextualises the GFF file with enough metadata that it can be displayed in the MGnify Assembly Analsysis Contig Viewer (an IGV.js instance) alongside other annotation tracks.
Python3 and pip.
This util is not part of the MetaPUF Snakemake pipeline itself.
cd utils/package_as_rocrate/
pip install -r requirements.txt
Call the package_metapuf_as_crates
script with the path to a GFF, and the PRIDE PXD accession of the dataset that generated the GFF.
E.g.:
python package_metapuf_as_crates.py examples/*.gff PXD005780
Optionally, an output directory may be specified other than the default ./crates
:
python package_metapuf_as_crates.py examples/*.gff PXD005780 --output_dir my-crates-folder
The resulting crates are .zip compressed archives, containing the GFF file alongside provenance metadata (the datasets and workflow information). This means the crate .zip files can be helpful for distributing results in a FAIR manner.
MGnify's website also supports these crates: e.g. for the example dataset, browse to Analysis page MGYA00579915 for ERZ1669337 and load the crate zip into the Contig browser's "Offline RO-Crate" feature.