AIRR Seq pipelines

A group of Nextflow workflows and processes for the analysis of AIRR-Seq data from the Adaptive ImmunoSEQ platform and the 10X VDJ platform. The processes mirror the utilities from LymphoSeq2. AIRR Seq pipelines also contain wokrflows for publication specific code. The workflows are designed to run using the module framework on slurm clusters. Future updates will try to make these pipelines more compatible cloud services like AWS or Azure. This repository contains the following workflows:

Serial Analysis of the T-Cell Receptor β-Chain Repertoire in People Living With HIV Reveals Incomplete Recovery After Long-Term Antiretroviral Therapy

General requirements

All Nextflow workflows in this repository reuquire at least Nextflow version 20.10.0. The workflows use Nextflow DSL2 syntax. It is configured for slurm clusters and relies on the modules package manager framework for the time being. The following modules are required:

Anaconda3
Nextflow
Singularity
R/4.1.0
Gliph2

The following R packages are required:

Serial Analysis of the T-Cell Receptor β-Chain Repertoire in People Living With HIV Reveals Incomplete Recovery After Long-Term Antiretroviral Therapy

The workflow for the analysis of T-cell Receptor β-chain repertoires are defined in main.nf under the entry point cfarExperimentOne. A template parameter files is provided here cfar_template.yaml. The required metadata file is provided in the Github repo under metadata/cfar/CFAR_Dean_metadata.xlsx. TRB repertoire sequencing data from the 192 peripheral blood samples is publicly available for access on the Adaptive immunoSEQ Analyzer portal (https://clients.adaptivebiotech.com/pub/towlerton-2022-hiv). The control dataset was sampled from the 786 TRB repertoire sequencing datasets from bone-marrow donors available on the Adaptive immunoSEQ Analyzer portal (Adaptive Biotechnologies, Seattle, WA; https://clients.adaptivebiotech.com/pub/Dean-2015-GenomeMed).

Before running the workflow, please edit the run.sh to specify the following variable:

SCRATCH=/path/to/scratch/directory # Nextflow caches a lot, not specifying a scratch directory will quickly fill up your run directory.

PARAMS=/path/to/params/file.yaml # Please make sure to edit the cfar_template.yaml with the correct paths for all the inputs.

You will also need to edit the nextflow.config with the correct queue name for your slurm configuration.

To run the workflow, run the following command:

./run.sh

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
metadata/cfar		metadata/cfar
params		params
scripts		scripts
.gitignore		.gitignore
README.md		README.md
cfar.nf		cfar.nf
lymphoseq.nf		lymphoseq.nf
main.nf		main.nf
nextflow.config		nextflow.config
nsclc.nf		nsclc.nf
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AIRR Seq pipelines

General requirements

Serial Analysis of the T-Cell Receptor β-Chain Repertoire in People Living With HIV Reveals Incomplete Recovery After Long-Term Antiretroviral Therapy

About

Releases

Packages

Languages

shashidhar22/airr_seq_pipelines

Folders and files

Latest commit

History

Repository files navigation

AIRR Seq pipelines

General requirements

Serial Analysis of the T-Cell Receptor β-Chain Repertoire in People Living With HIV Reveals Incomplete Recovery After Long-Term Antiretroviral Therapy

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages