Skip to content

Commit

Permalink
sync assignment and answers pages
Browse files Browse the repository at this point in the history
  • Loading branch information
malachig committed Jul 18, 2023
1 parent 940dce6 commit 4027b73
Show file tree
Hide file tree
Showing 2 changed files with 30 additions and 20 deletions.
20 changes: 10 additions & 10 deletions _posts/0003-08-01-Integrated_Assignment.md
Original file line number Diff line number Diff line change
Expand Up @@ -25,12 +25,12 @@ Experimental information and other things to keep in mind:
- Libraries were prepared using standard Illumina protocols
- For this exercise we will be using a subset of the reads (first 1,000,000 reads from each pair).
- The files are named based on their SRR id's, and obey the following key:
- SRR7155055 = transfected sample 1
- SRR7155056 = transfected sample 2
- SRR7155057 = transfected sample 3
- SRR7155058 = control sample 1
- SRR7155059 = control sample 2
- SRR7155060 = control sample 3
- SRR7155055 = CBSLR knockdown sample 1 (T1 - aka transfected 1)
- SRR7155056 = CBSLR knockdown sample 2 (T2 - aka transfected 2)
- SRR7155057 = CBSLR knockdown sample 3 (T3 - aka transfected 3)
- SRR7155058 = control sample 1 (C1 - aka control 1)
- SRR7155059 = control sample 2 (C2 - aka control 2)
- SRR7155060 = control sample 3 (C3 - aka control 3)

Experimental descriptions from the study authors:

Expand Down Expand Up @@ -122,14 +122,14 @@ In order to make visualization easier, we're going to merge each of our bams int
```bash
#merge the bams for visulization purposes
cd $RNA_INT_ALIGN_DIR
java -Xmx2g -jar $PICARD MergeSamFiles OUTPUT=transfected.bam INPUT=SRR7155055.bam INPUT=SRR7155056.bam INPUT=SRR7155057.bam
java -Xmx2g -jar $PICARD MergeSamFiles OUTPUT=cbslr-knockdown.bam INPUT=SRR7155055.bam INPUT=SRR7155056.bam INPUT=SRR7155057.bam
java -Xmx2g -jar $PICARD MergeSamFiles OUTPUT=control.bam INPUT=SRR7155058.bam INPUT=SRR7155059.bam INPUT=SRR7155060.bam
```

Try viewing genes such as TP53 to get a sense of how the data is aligned. To do this:
- Load up IGV
- Change the reference genome to "Human hg38" in the top-left category
- Click on File > Load from URL, and in the File URL enter: "http://##.oicrcbw.ca/rnaseq/integrated_assignment/hisat2/transfected.bam". Repeat this step and enter "http://##.oicrcbw.ca/rnaseq/integrated_assignment/hisat2/control.bam" to load the other bam, where ## is your student number for the AWS instance.
- Click on File > Load from URL, and in the File URL enter: "http://##.oicrcbw.ca/rnaseq/integrated_assignment/hisat2/cbslr-knockdown.bam". Repeat this step and enter "http://##.oicrcbw.ca/rnaseq/integrated_assignment/hisat2/control.bam" to load the other bam, where ## is your student number for the AWS instance.
- Right-click on the alignments track in the middle, and Group alignments by "Library"
- Jump to TP53 by typing it into the search bar above

Expand All @@ -153,8 +153,8 @@ Try viewing genes such as TP53 to get a sense of how the data is aligned. To do

**Goals:**

- Perform differential analysis between the transfected and control samples
- Check if is differentially expressed
- Perform differential analysis between the CBSLR knockdown and control samples
- Check if CBSLR (tcons_00001221) on chromosome 1, is itself differentially expressed

First create a file that lists our 6 expression files, then view that file, then start an R session. Adapt the R tutorial file has been provided in the github repo for part 1 of the tutorial: Tutorial_Part1_ballgown.R. Modify it to fit the goals of this assignment then run it.

Expand Down
30 changes: 20 additions & 10 deletions _posts/0009-08-01-Integrated_Assignment_Answers.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,26 +11,36 @@ date: 0009-08-01

# Integrated Assignment answers

**Background:** The use of cell lines are often implemented in order to study different experimental conditions. One such kind of study is the effects of shRNA on expression profiles, to determine whether these effects target specific genes. Experimental models for these include using control shRNA to account for any expression changes that may occur from just the introduction of these molecules.
**Background:** Cell lines are often used to study different experimental conditions and to study the function of specific genes by various perturbation approaches. One such type of study involves knocking down expression of a target of interest by shRNA and then using RNA-seq to measure the impact on gene expression. These eperiments often include use of a control shRNA to account for any expression changes that may occur from just the introduction of these molecules. Differential expression is performed by comparing biological replicates of shRNA knockdown vs shRNA control.

**Objectives:** In this assignment, we will be using a subset of the GSE114360 dataset, which consists of 6 RNA sequence files on the SGC-7901 gastric cancer cell line, (3 transfected with tcons_00001221 shRNA, and 3 control shRNA), and determine the number of differentially expressed genes.
**Objectives:** In this assignment, we will be using a subset of the [GSE114360 dataset](https://www.ncbi.nlm.nih.gov/bioproject/PRJNA471072), which consists of 6 RNA-seq datasets generated from a cell line (3 transfected with shRNA, and 3 controls). Our goal will be to determine differentially expressed genes.

Experimental information and other things to keep in mind:

- The libraries are prepared as paired end.
- The samples are sequenced on a Illumina 4000.
- The samples are sequenced on an Illumina 4000.
- Each read is 150 bp long
- The dataset is located here: [GSE114360](https://www.ncbi.nlm.nih.gov/bioproject/PRJNA471072)
- 3 samples transfected with target shRNA and 3 samples with control shRNA
- Libraries were prepared using standard Illumina protocols
- For this exercise we will be using all a subset of the reads (first 1000000 reads from each pair).
- For this exercise we will be using a subset of the reads (first 1,000,000 reads from each pair).
- The files are named based on their SRR id's, and obey the following key:
- SRR7155055 = transfected sample 1
- SRR7155056 = transfected sample 2
- SRR7155057 = transfected sample 3
- SRR7155058 = control sample 1
- SRR7155059 = control sample 2
- SRR7155060 = control sample 3
- SRR7155055 = CBSLR knockdown sample 1 (T1 - aka transfected 1)
- SRR7155056 = CBSLR knockdown sample 2 (T2 - aka transfected 2)
- SRR7155057 = CBSLR knockdown sample 3 (T3 - aka transfected 3)
- SRR7155058 = control sample 1 (C1 - aka control 1)
- SRR7155059 = control sample 2 (C2 - aka control 2)
- SRR7155060 = control sample 3 (C3 - aka control 3)

Experimental descriptions from the study authors:

Experimental details from the [paper](https://pubmed.ncbi.nlm.nih.gov/35499052/):
"An RNA transcriptome-sequencing analysis was performed in shRNA-NC or shRNA-CBSLR-1 MKN45 cells cultured under hypoxic conditions for 24 h (Fig. 2A)."

Experimental details from the GEO submission:
"An RNA transcriptome sequencing analysis was performed in MKN45 cells that were transfected with tcons_00001221 shRNA or control shRNA."

Note that according to [GeneCards](https://www.genecards.org/cgi-bin/carddisp.pl?gene=CBSLR) and [HGNC](https://www.genenames.org/data/gene-symbol-report/#!/hgnc_id/55459), *CBSLR* and *tcons_00001221* refer to the same thing.

## PART 0 : Obtaining Data and References

Expand Down

0 comments on commit 4027b73

Please sign in to comment.