Skip to content

Commit f4dc734

Browse files
author
pubudu
committed
hands-on descriptions: Objectives, datasets and steps
1 parent 9eafb15 commit f4dc734

7 files changed

+84
-2
lines changed

.DS_Store

0 Bytes
Binary file not shown.

content/2.NumPy_Data_Types.md

+23
Original file line numberDiff line numberDiff line change
@@ -352,6 +352,29 @@ Understanding the distinctions between Python's general-purpose types and NumPy'
352352

353353
## Hands-on
354354

355+
**Basic RNA Expression Analysis:**
356+
357+
***Objective:***
358+
359+
* Validate differential expression of immune-related genes between patient groups [previously classified as immunologically strong (`istrong`) and immunologically weak (`iweak`)](https://pubmed.ncbi.nlm.nih.gov/30393759/)
360+
* Apply an alternative analytical approach using Z-ratio methodology to complement standard differential expression tools like `EdgeR`
361+
* Confirm whether genes that contribute to the Immune-score show consistent expression patterns between the two patient groups
362+
* Provide a computational verification method that ranks immune-related genes based on their relative expression differences between the patient groups
363+
364+
***Dataset description:***
365+
366+
* `test_data/Sample_group_info.csv`
367+
* Patients were divided into `istrong` (immunity-strong) and `iweak` (immunity-weak) groups based on their Immunity score
368+
* The Immunity score was calculated by averaging the gene expression values of the 17 identified immune-genes
369+
* `test_data/count_matrix.csv`: Gene expression raw counts
370+
371+
***Steps:***
372+
373+
1. Data Loading
374+
1. Load sample group information (`iweak` vs `istrong`)
375+
2. Load gene expression count matrix
376+
2. Examine array information
377+
355378
:::{exercise} Hands-on
356379

357380
```python

content/4.Advance_indexing_filtering.md

+10
Original file line numberDiff line numberDiff line change
@@ -351,6 +351,16 @@ print(f"A: {a_count}, T: {t_count}, G: {g_count}, C: {c_count}")
351351

352352
## Hands-on
353353

354+
***Steps:***
355+
356+
1. Data Loading and visualization
357+
1. Load sample group information (`iweak` vs `istrong`)
358+
2. Load gene expression count matrix
359+
3. Examine array information
360+
2. Sample Identification
361+
1. Filter samples by group (`iweak`/`istrong`)
362+
2. Match count matrix columns with sample IDs
363+
354364
:::{exercise} Hands-on
355365

356366
```python

content/5.Essential_array_operations.md

+16-2
Original file line numberDiff line numberDiff line change
@@ -372,6 +372,21 @@ print(f"Row sums: {row_sums}") # [6 15]
372372

373373
## Hands-on
374374

375+
***Steps:***
376+
377+
1. Data Loading and visualization
378+
1. Load sample group information (`iweak` vs `istrong`)
379+
2. Load gene expression count matrix
380+
3. Examine array information
381+
2. Sample Identification
382+
1. Filter samples by group (`iweak`/`istrong`)
383+
2. Match count matrix columns with sample IDs
384+
3. Data Preprocessing
385+
1. Convert count matrix to numeric values
386+
2. Apply log2 transformation: log2(counts + 1)
387+
4. Statistical Analysis
388+
1. Calculate mean and std for each gene within each group
389+
375390
:::{exercise} Hands-on
376391

377392
```python
@@ -461,7 +476,6 @@ istrong_std = cm[:,cm_strong_cols].std(1) ## STD of istrong samples
461476

462477
:::{solution} Visual representation
463478

464-
**Visual representation - Calculating mean and STD of each gene in `iweak` group:**
479+
**Visual representation - Calculating mean and SD of each gene in `iweak` and `istrong` group:**
465480
![alt text](image-15.png)
466-
467481
:::

content/6.Vectorized_Operations_in_NumPy.md

+35
Original file line numberDiff line numberDiff line change
@@ -615,6 +615,41 @@ Vectorized operations in NumPy are a powerful tool for data analysis in computat
615615

616616
## Hands-on
617617

618+
***Objective:***
619+
620+
* Validate differential expression of immune-related genes between patient groups [previously classified as immunologically strong (`istrong`) and immunologically weak (`iweak`)](https://pubmed.ncbi.nlm.nih.gov/30393759/)
621+
* Apply an alternative analytical approach using Z-ratio methodology to complement standard differential expression tools like `EdgeR`
622+
* Confirm whether genes that contribute to the Immune-score show consistent expression patterns between the two patient groups
623+
* Provide a computational verification method that ranks immune-related genes based on their relative expression differences between the patient groups
624+
625+
***Dataset description:***
626+
627+
* `test_data/Sample_group_info.csv`
628+
* Patients were divided into `istrong` (immunity-strong) and `iweak` (immunity-weak) groups based on their Immunity score
629+
* The Immunity score was calculated by averaging the gene expression values of the 17 identified immune-genes
630+
* `test_data/count_matrix.csv`: Gene expression raw counts
631+
632+
***Steps:***
633+
634+
1. Data Loading and visualization
635+
1. Load sample group information (`iweak` vs `istrong`)
636+
2. Load gene expression count matrix
637+
3. Examine array information
638+
2. Sample Identification
639+
1. Filter samples by group (`iweak`/`istrong`)
640+
2. Match count matrix columns with sample IDs
641+
3. Data Preprocessing
642+
1. Convert count matrix to numeric values
643+
2. Apply log2 transformation: log2(counts + 1)
644+
4. Statistical Analysis
645+
1. Calculate mean and std for each gene within each group
646+
2. Compute Z-scores within each sample group
647+
3. Calculate Z-score differences between groups
648+
4. Compute standard deviation of all differences
649+
5. Differential Expression - verification
650+
1. Calculate Z-ratio: difference / std_difference
651+
2. Rank genes by Z-ratio (highest to lowest)
652+
618653
:::{exercise} Hands-on
619654

620655
```python

content/image-15.png

922 KB
Loading

content/image-17.png

-152 KB
Loading

0 commit comments

Comments
 (0)