Integrated analysis of single cell RNA-sequencing (scRNA-seq) data from multiple batches or studies is often necessary in order to learn functional changes in cellular states upon experimental perturbations or cell type relationships in a developmental lineage. Here we introduce a new algorithm (RPCI) that uses the gene-eigenvectors from a reference dataset to establish a global frame for projecting all datasets, with a clear advantage in preserving genuine gene expression differences in matching cell types between samples, such as those present in cells at distinct developmental stages or in perturbated vs control studies. This R package “RISC” (Robust Integration of Sinlgle Cell RNA-seq data implements the RPCI algorithm, with additional functions for scRNA-seq analysis, such as clustering cells, identifying cluster marker genes, detecting differentially expressed genes between experimental conditions, and importantly outputting integrated gene expression values for downstream data analysis.
install.packages(c("Matrix", "matrixStats", "irlba", "doParallel", "foreach", "data.table", "Rtsne", "umap", "MASS", "pbmcapply", "Rcpp", "RcppEigen", "densityClust", "FNN", "igraph", "RColorBrewer", "ggplot2", "gridExtra", "pheatmap"))
install_github("https://github.com/bioinfoDZ/RISC.git")
The RISC package can also be downloaded and installed mannually Link
install.packages("/Path/to/RISC_1.0.tar.gz", repos = NULL, type = "source")
Here we provide a vignettes which shows the key steps in analyzing example scRNA-seq datasets from the basal or squamous carcinoma patients before and after anti-PD-1 therapy (GSE123813). Link
We also provide an example how to convert Seurat object to RISC object Link
(1) RISC package: "RISC_1.0.tar.gz"
(2) Vignette for GSE123813: "GSE123813_Vignette.pdf"
(3) GSE123813 directory contains the informaiton of cell-type, patients and treatment.
file position, "/GSE123813/Raw_Data/bcc_annotation.tsv"