Skip to content

Commit 3bd45b4

Browse files
authored
Add files via upload
1 parent 8e44e18 commit 3bd45b4

9 files changed

+175
-44
lines changed

.Rbuildignore

+2-2
Original file line numberDiff line numberDiff line change
@@ -1,2 +1,2 @@
1-
^.*\.Rproj$
2-
^\.Rproj\.user$
1+
^.*\.Rproj$
2+
^\.Rproj\.user$

.gitignore

+5-4
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,5 @@
1-
.Rproj.user
2-
.Rhistory
3-
.RData
4-
.Ruserdata
1+
.Rproj.user
2+
.Rhistory
3+
.RData
4+
.Ruserdata
5+
inst/doc

CaSpER.Rproj

+21-17
Original file line numberDiff line numberDiff line change
@@ -1,17 +1,21 @@
1-
Version: 1.0
2-
3-
RestoreWorkspace: Default
4-
SaveWorkspace: Default
5-
AlwaysSaveHistory: Default
6-
7-
EnableCodeIndexing: Yes
8-
UseSpacesForTab: Yes
9-
NumSpacesForTab: 2
10-
Encoding: UTF-8
11-
12-
RnwWeave: Sweave
13-
LaTeX: pdfLaTeX
14-
15-
BuildType: Package
16-
PackageUseDevtools: Yes
17-
PackageInstallArgs: --no-multiarch --with-keep.source
1+
Version: 1.0
2+
3+
RestoreWorkspace: Default
4+
SaveWorkspace: Default
5+
AlwaysSaveHistory: Default
6+
7+
EnableCodeIndexing: Yes
8+
UseSpacesForTab: Yes
9+
NumSpacesForTab: 2
10+
Encoding: UTF-8
11+
12+
RnwWeave: Sweave
13+
LaTeX: pdfLaTeX
14+
15+
AutoAppendNewline: Yes
16+
StripTrailingWhitespace: Yes
17+
18+
BuildType: Package
19+
PackageUseDevtools: Yes
20+
PackageInstallArgs: --no-multiarch --with-keep.source
21+
PackageRoxygenize: rd,collate,namespace,vignette

DESCRIPTION

+15-17
Original file line numberDiff line numberDiff line change
@@ -1,17 +1,15 @@
1-
Package: CaSpER
2-
Type: Package
3-
Title: Identify large-scale CNV events from single cell or bulk RNA-Seq data
4-
Version: 0.1.0
5-
Author: Akdes Serin Harmanci, Arif O. Harmanci
6-
Maintainer: Akdes Serin Harmanci <[email protected]>
7-
Description: More about what it does (maybe more than one line)
8-
Use four spaces when indenting paragraphs within the Description.
9-
License:
10-
Encoding: UTF-8
11-
LazyData: true
12-
LinkingTo: Rcpp
13-
Depends: Rcpp, signal, pheatmap, RColorBrewer, HMMcopy, IRanges, grid, GenomeGraphs, ggplot2, reshape, mclust, ggpubr, scales, gridExtra, igraph, intergraph, ggnetwork, philentropy, ape, biomaRt, limma, GO.db, org.Hs.eg.db, GOstats
14-
RoxygenNote: 6.0.1
15-
Suggests: knitr,
16-
rmarkdown
17-
VignetteBuilder: knitr
1+
Package: CaSpER
2+
Type: Package
3+
Title: Identify large-scale CNV events from single cell or bulk RNA-Seq data
4+
Version: 0.1.0
5+
Author: Akdes Serin Harmanci, Arif O. Harmanci
6+
Maintainer: Akdes Serin Harmanci <[email protected]>
7+
Description: Identification, visualization and integrative analysis of CNV events in multiscale resolution using single-cell or bulk RNA sequencing data
8+
Encoding: UTF-8
9+
LazyData: true
10+
LinkingTo: Rcpp
11+
Depends: Rcpp, signal, pheatmap, RColorBrewer, HMMcopy, IRanges, grid, GenomeGraphs, ggplot2, reshape, mclust, ggpubr, scales, gridExtra, igraph, intergraph, ggnetwork, philentropy, ape, biomaRt, limma, GO.db, org.Hs.eg.db, GOstats
12+
RoxygenNote: 6.1.1
13+
Suggests: knitr,
14+
rmarkdown
15+
VignetteBuilder: knitr

NAMESPACE

+7-2
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
export(CreateCasperObject)
22
export(lohCallMedianFilterByChr)
33
export(lohCallMedianFilter)
4-
export(processMedianFiltering)
4+
export(ProcessData)
55
export(PerformMedianFilter)
66
export(PerformMedianFilterByChr)
77
export(CenterSmooth)
@@ -36,7 +36,12 @@ export(plotSCellCNVTree)
3636
export(generateAnnotation)
3737
export(getDiffExprGenes)
3838
export(generateEnrichmentSummary)
39-
export(go.enrichment.BP)
39+
export(extractSegmentSummary)
40+
export(splitByOverlap)
41+
export(extractLargeScaleEvents)
42+
export(mergeScalesAndGenerateFinalEventSummary)
43+
export(goEnrichmentBP)
44+
export(gene.matrix)
4045
import(igraph)
4146
import(signal)
4247
import(pheatmap)

README.md

+10-2
Original file line numberDiff line numberDiff line change
@@ -26,8 +26,8 @@ You can install CaSpER R package using the following R commands:
2626
Install dependencies first:
2727

2828
``` r
29-
You may need to install libcurl-devel, libopenssl-devel and libxml2-devel
30-
ex: sudo yum -y install libxml2-devel.x86_64
29+
You may need to install libcurl-devel, libopenssl-devel openssl-devel and libxml2-devel
30+
ex: sudo yum -y install libxml2-devel libcurl-devel libopenssl-devel openssl-devel
3131

3232
source("https://bioconductor.org/biocLite.R")
3333
biocLite(c('HMMcopy', 'GenomeGraphs', 'biomaRt', 'limma', 'GO.db', 'org.Hs.eg.db', 'GOstats'))
@@ -100,3 +100,11 @@ Tutorials
100100
1. [Yale meningioma Bulk RNA-Seq dataset](/demo/meningioma.R)
101101
2. [TCGA-GBM Bulk RNA-Seq dataset](/demo/tcga_GBM.R)
102102
3. [GBM Single-cell RNA-Seq dataset](/demo/sCellGBM.R)
103+
104+
105+
For running the examples, you can use Rscript command:
106+
Rscript /demo/meningioma.R
107+
Rscript /demo/tcga_GBM.R
108+
Rscript /demo/sCellGBM.R
109+
110+
Each run generally takes about 5-20 minutes.

README.txt

+106
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,106 @@
1+
CaSpER
2+
---
3+
4+
CaSpER is an algorithm for identification, visualization and integrative analysis of CNV events in multiscale resolution using single-cell or bulk RNA sequencing data.
5+
6+
It takes as input:
7+
8+
- Mapped RNA-Seq reads (Bam files)
9+
10+
- Normalized expression matrix
11+
12+
and outputs:
13+
14+
- CNV events in multiscale resolution
15+
16+
- Mutually exclusive and co-occurent CNV events
17+
18+
19+
Installation
20+
----------
21+
22+
CaSpER is developed under R version 3.4.3.
23+
24+
You can install CaSpER R package using the following R commands:
25+
26+
Install dependencies first:
27+
28+
You need to install libcurl-devel, libopenssl-devel, openssl-devel, and libxml2-devel
29+
>> sudo yum -y install libxml2-devel libcurl-devel libopenssl-devel openssl-devel
30+
31+
Next it is necessary to install R package dependencies
32+
>> source("https://bioconductor.org/biocLite.R")
33+
>> biocLite(c('HMMcopy', 'GenomeGraphs', 'biomaRt', 'limma', 'GO.db', 'org.Hs.eg.db', 'GOstats'))
34+
35+
Finally, to install CaSpER R package, use following:
36+
>> require(devtools)
37+
>> install_github("akdess/CaSpER")
38+
39+
These commands are included in install_dependencies.sh and install_dependencies.R scripts.
40+
41+
Please note that you may need to be root to install these dependencies.
42+
43+
For extracting B-allele frequencies from RNA-Seq bam files download BAFExtract C++ source or binary code from https://github.com/akdess/BAFExtract.
44+
45+
After downloading source code type the following:
46+
47+
cd BAFExtract
48+
make clean
49+
make
50+
51+
The executable is built under directory /bin.
52+
53+
BAFExtract is developed and tested under CentOS 7.
54+
55+
Usage
56+
----------
57+
58+
59+
Step 1. Merge single cell RNA-Seq bam files (required for single-cell studies)
60+
61+
>> bamtools merge -list <bam_file_names> -out <sample_name>_merged.bam
62+
>> samtools index <sample_name>_merged.bam
63+
64+
Step 2. Extract BAF values from RNA-Seq bam files
65+
66+
>> samtools view <bam_file> | ./BAFExtract -generate_compressed_pileup_per_SAM stdin <genome_list> <sample_dir> 50 0; ./BAFExtract -get_SNVs_per_pileup <genome_list> <sample_dir> <genome_fasta_pileup_dir> 20 4 0.1 <output_baf_file>
67+
where
68+
<sample_dir>: the name of sample directory
69+
<output_baf_file>: final output
70+
71+
You can download and unzip genome_fasta_pileup_dir files from :
72+
73+
[for hg38](https://www.dropbox.com/s/ysrcfcnk7z8gyit/hg38.zip?dl=0)
74+
75+
[for hg19](https://www.dropbox.com/s/a3u8f2f8ufm5wdj/hg19.zip?dl=0)
76+
77+
You can download genome_list files from :
78+
79+
[for hg38](https://www.dropbox.com/s/rq7v67tiou1qwwg/hg38.list?dl=0)
80+
81+
[for hg19](https://www.dropbox.com/s/jcmt23nmuzm6poz/hg19.list?dl=0)
82+
83+
84+
Example run for BAFExtract:
85+
86+
[download example rna-seq bam file](https://www.dropbox.com/s/1vl6iip0b8jwu66/SRR1295366.sorted.bam?dl=0)
87+
88+
```{bash}
89+
mkdir test; samtools view SRR1295366.sorted.bam | ./bin/BAFExtract -generate_compressed_pileup_per_SAM stdin hg38.list test 50 0; ./bin/BAFExtract -get_SNVs_per_pileup hg38.list test ./hg38/ 20 4 0.1 test.baf
90+
```
91+
92+
Step 3. Run CaSpER R package (See tutorials below)
93+
94+
Tutorials
95+
----------
96+
97+
1. [Yale meningioma Bulk RNA-Seq dataset](/demo/meningioma.R)
98+
2. [TCGA-GBM Bulk RNA-Seq dataset](/demo/tcga_GBM.R)
99+
3. [GBM Single-cell RNA-Seq dataset](/demo/sCellGBM.R)
100+
101+
For running the examples, you can use Rscript command:
102+
Rscript /demo/meningioma.R
103+
Rscript /demo/tcga_GBM.R
104+
Rscript /demo/sCellGBM.R
105+
106+
Each run generally takes about 5-20 minutes.

install_dependencies.R

+7
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,7 @@
1+
#source("https://bioconductor.org/biocLite.R")
2+
#biocLite(c('HMMcopy', 'GenomeGraphs', 'biomaRt', 'limma', 'GO.db', 'org.Hs.eg.db', 'GOstats'))
3+
4+
install.packages('devtools');
5+
6+
require(devtools)
7+
install_github("akdess/CaSpER")

install_dependencies.sh

+2
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,2 @@
1+
sudo yum -y install libcurl-devel libopenssl-devel libxml2-devel openssl-devel
2+

0 commit comments

Comments
 (0)