longread-UMI-pipeline

A collection of scripts for processing longread UMI data. Tested on Linux 3.10.0

Citation:
Karst, S. M., Ziels, R. M., Kirkegaard, R. H., & Albertsen, M. (2019). Enabling high-accuracy long-read amplicon sequences using unique molecular identifiers and Nanopore sequencing. bioRxiv, 645903. https://www.biorxiv.org/content/10.1101/645903v2

Installing the Easy Way -- via Conda

Requirements/Dependencies

Conda or Miniconda3 installed
usearch version 10 or higher

Clone from github

Go to desired installation directory, open a terminal and run:
git clone https://github.com/ziels/longread-UMI-pipeline
Go to scripts directory:
cd longread-UMI-pipeline/scripts
Modify dependencies.sh with path to usearch
Change line export USEARCH=usearch_path to give your exact file path to the usearch executable file (instead of usearch_path).

Install conda environment

conda env create -f environment.yaml

Check that Conda env is installed
conda info —-envs

Make sure you get something like:

# conda environments:
#
longread-UMI <path to conda envs>/longread-UMI

Note the installation path of the longread-UMI environment (for next steps)

Activate conda environment
conda activate longread-UMI
Or, depending on your conda version: source activate longread-UMI

Editing porechop adapters with custom primers

Find path of conda environments from command
conda info —-envs
Check the porechop path works:
ls < path to conda environments >/longread-UMI/lib/python3.6/site-packages/porechop

Make sure you see an adapters.py returned from the above command.

Back-up and replace adapters.py
mv <path_to_conda_environments>/longread-UMI/lib/python3.6/site-packages/porechop/adapters.py <path_to_conda_environments>/longread-UMI/lib/python3.6/site-packages/porechop/adapters_old.py

(From within longread-UMI-pipeline/scripts directory):
cp ./adapters.py <path_to_conda_environments>/longread-UMI/lib/python3.6/site-packages/porechop/adapters.py

Test longread-UMI-pipeline on test data:

Go to /path/to/longread-UMI-pipeline/test_data
Open a terminal in the directory and run
../longread_UMI_pipeline.sh -d test_reads.fq -s 10 -c 30 -t 1

Installing the Hard Way -- Manual installation

Requirements/Dependencies

Clone from github

Go to desired installation directory, open a terminal and run:
git clone https://github.com/SorenKarst/longread-UMI-pipeline

Make bash scripts executable

Go to longread-UMI-pipeline directory, open a terminal and run:
find . -name "*.sh" -exec chmod +x {} \;

Create symlink to run longread-UMI-pipeline and longread-UMI-mockanalysis from terminal

Create symlink in ~/bin by opening a terminal and run:
mkdir -p ~/bin
ln -s /path/to/longread-UMI-pipeline/longread_UMI_pipeline.sh ~/bin/longread-UMI-pipeline
ln -s /path/to/longread-UMI-pipeline/longread_UMI_mockanalysis.sh ~/bin/longread-UMI-mockanalysis

Change paths to dependencies

Open /path/to/longread-UMI-pipeline/scripts/dependencies.sh in a texteditor.
Change all paths under "Program paths" to reflect installation paths on your system.
If unsure of the paths try to type which <function> in the terminal. I.e. which racon.
Install any missing dependencies.

Customize porechop adaptors.py to be able to detect custom primers

We recommend to make a seperate installation of porechop to use with the longread-UMI-pipeline.
Go to path/to/porechop/porechop/
Backup current adapters.py.
Replace current adapters.py with path/to/longread-UMI-pipeline/scripts/adapters.py.

Test scripts

Open a terminal anywhere and run:
longread-UMI-pipeline -h or /path/to/longread-UMI-pipeline -h
Test longread-UMI-pipeline on test data:
Go to /path/to/longread-UMI-pipeline/test_data
Open a terminal in the directory and run longread-UMI-pipeline -d test_reads.fq -s 10 -c 30 -t 1

Run pipeline on Zymo mock data

Create a working directory, open a terminal, download the Zymo mock fastq data and decompress:
wget ftp://ftp.sra.ebi.ac.uk/vol1/fastq/ERR333/003/ERR3336963/ERR3336963_1.fastq.gz; gunzip -c ERR3336963_1.fastq.gz > reads.fq
Open a terminal in the directory and run:
longread-UMI-pipeline -d reads.fq -s 1000000 -c 30 -t <Number-of-threads>
Open a terminal in the directory and run:
longread-UMI-mockanalysis <Number-of-threads>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

longread-UMI-pipeline

Installing the Easy Way -- via Conda

Requirements/Dependencies

Clone from github

Install conda environment

Editing porechop adapters with custom primers

Test longread-UMI-pipeline on test data:

Installing the Hard Way -- Manual installation

Requirements/Dependencies

Clone from github

Make bash scripts executable

Create symlink to run longread-UMI-pipeline and longread-UMI-mockanalysis from terminal

Change paths to dependencies

Customize porechop adaptors.py to be able to detect custom primers

Test scripts

Run pipeline on Zymo mock data

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
scripts		scripts
test_data		test_data
README.md		README.md
adapters.py		adapters.py
longread_UMI_mockanalysis.sh		longread_UMI_mockanalysis.sh
longread_UMI_pipeline.sh		longread_UMI_pipeline.sh

ziels/longread-UMI-pipeline

Folders and files

Latest commit

History

Repository files navigation

longread-UMI-pipeline

Installing the Easy Way -- via Conda

Requirements/Dependencies

Clone from github

Install conda environment

Editing porechop adapters with custom primers

Test longread-UMI-pipeline on test data:

Installing the Hard Way -- Manual installation

Requirements/Dependencies

Clone from github

Make bash scripts executable

Create symlink to run longread-UMI-pipeline and longread-UMI-mockanalysis from terminal

Change paths to dependencies

Customize porechop adaptors.py to be able to detect custom primers

Test scripts

Run pipeline on Zymo mock data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages