Subgenomic RNAs Profile ANalysis (SPAN): Profiling of Noncanonical Subgenomic RNAs in SARS-CoV-2 Variants.
- A SgRNA Profile ANalysis (SPAN) is proposed for profiling various subgenomic RNAs (sgRNAs) and accompany with SPANviewer for data visualization. This pipeline is currently under submission in the manuscript titled "Profiling of Noncanonical Subgenomic RNAs in SARS-CoV-2 Variants".
- This pipeline first maps the reads to the SARS-CoV-2 reference genome and generates junction coordinates from splice alignments. The junction coordinates are then processed using a Python script to identify subgenomic RNAs and finally visualized using R Shiny. All scripts can be run independently.
- Linux or MacOS
- R (version 4.1)
- python (version 3.8.10)
Environments setup:
conda env create -f ./SPAN.yml -n SPAN
conda activate SPAN
- Before using this app, please install R packages first.
Rscript install.R
Three pages we provided are as follows.
- Venn Diagram page
- Junction Site Sashimi Plot page
- Boxplot page
- Activate conda environment:
conda activate SPAN
- Please prepare fastq files in ~/SPAN/{variant_name} folder (e.g., use SRAToolkit fasterq-dump function to retrieve SRA reads)
- Generate sample.txt:
python ./scripts/generate_sample_list.py
- Execute this SPAN pipeline with snakemake:
snakemake -c1
- Data visualization:
Rscript ./R/SPANviewer.R
- Yu-Nong Gong
- Chia-Te Wang