All test samples and genome data are shared on our public DNAnexus project. You don't have to download any data for testing our pipeline on DNAnexus platform.
There are two methods to run our pipeline on DNAnexus.
- Building your own DX workflow from
atac.wdl
with dxWDL (CLI) - Using a pre-built DX workflow on our public DX project (Web UI)
This document describes instruction for the item 2).
-
Sign up for a DNAnexus account.
-
Create a new DX project by clicking on "+New Project" on the top left.
-
Move to one of the following workflow directories according to the platform you have chosen for your project (AWS or Azure). These DX workflows are pre-built with all parameters defined.
-
Copy it to your project by right-clicking on the DX workflow
atac
and choose "Copy". -
Choose your project and create a folder for the test run by clicking on the "Folder+" icon.
-
Click on "Copy into this folder" on the bottom left.
-
Move to the target folder and click on the DX workflow
atac
. -
Specify an output directory by clicking "Workflow Actions" on the top right. Click on "Set output folder" and choose an output folder.
-
Click on "Run as Analysis..." and you will be automatically redirected to the "Monitor" tab.
-
It will take about an hour. You will be able to find all outputs on your output folder. Final QC report (
qc.html
)/JSON (qc.json
) will be found on it. -
See full specification for input JSON file.
-
DNAnexus allows only one copy of a workflow per project. The example workflow in the previous section is pre-built for the subsampled test sample ENCSR356KRQ with all parameters defined already.
-
Copy one of the following workflows according to the platform you have chosen for your project (AWS or Azure).
- AWS general without pre-defined reference genome.
- AWS hg38 with pre-defined hg38 reference genome.
- AWS hg19 with pre-defined hg19 reference genome.
- AWS mm10 with pre-defined mm10 reference genome.
- AWS mm9 with pre-defined mm9 reference genome.
- Azure general without pre-defined reference genome.
- Azure hg38 with pre-defined hg38 reference genome.
- Azure hg19 with pre-defined hg19 reference genome.
- Azure mm10 with pre-defined mm10 reference genome.
- Azure mm9 with pre-defined mm9 reference genome.
-
Click on the DX workflow
atac
. -
Specify your input files (FASTQs, BAMs, TAG-ALIGNs, ...) on the top left. For example, click on the item "fastqs_rep1_R1" and choose your R1 FASTQ file for replicate 1. See details here for other input types.
-
Choose a reference genome. See details here.
-
Click on "Run as Analysis..." and you will be automatically redirected to the "Monitor" tab.