Skip to content

Latest commit

 

History

History
94 lines (44 loc) · 4.38 KB

README.md

File metadata and controls

94 lines (44 loc) · 4.38 KB

Huntington's Disease RNA-Seq Data: Insights and Analysis

Final Project for BF591 (R for Biological Sciences)

This project involved creating an RShiny application to facilitate the exploration of a differential expression dataset derived from post-mortem Huntington’s Disease brain samples. The data is publicly available via GEO, and the project references findings published in this paper. The app's design is customized with CSS, bypassing the default RShiny themes for a tailored aesthetic.

Screenshot 2024-12-16 at 01 53 17

Project Objectives and Dataset

The app provides interactive tools for analyzing a dataset related to Huntington’s Disease. Various input files support the app’s functionality, including:

Metadata: Information about samples/experiments (GSE64810_series_matrix.txt).

Normalized Counts: Processed expression data (GSE64810_mlhd_DESeq2_norm_counts_adjust.csv).

Differential Expression: Outlier-trimmed analysis results (GSE64810_mlhd_DESeq2_diffexp_DESeq2_outlier_trimmed_adjust.csv).

Gene Conversion: File mapping Ensembl IDs to gene symbols (human_id2gene.txt).

Pathway Analysis Results: FGSEA outcomes (fgsea_results.csv), generated using a hallmark gene set (c2.all.v2023.1.Hs.symbols.gmt).

App Features and Tabs

1. Sample Metadata

This section allows users to explore metadata with filterable columns, a search bar, and dynamic visualizations:

Summary Tab: Displays column properties, including data types and calculated statistics for numerical fields.

Metadata Tab: Provides an interactive view of the complete metadata file with horizontal scrolling.

Violin Plot Tab: Generates violin plots based on user-selected continuous variables via dropdowns.

Screenshot 2024-12-16 at 01 54 58 Screenshot 2024-12-16 at 01 55 09

2. Counts Matrix

This tab facilitates exploration of normalized counts data through:

Filtered Data: A summary of genes passing/failing user-defined variance thresholds.

Scatter Plot: Visualizes Ranked Median vs Log Variance and Ranked Median vs Number of Zeros, with color-coded data points.

Heatmap: Displays expression patterns across samples.

PCA: Visualizes principal components with adjustable settings for the number of PCs.

Screenshot 2024-12-16 at 01 57 17 Screenshot 2024-12-16 at 01 57 26 Screenshot 2024-12-16 at 01 57 35

3. Differential Expression

Users can interactively explore differentially expressed genes with:

Metadata Tab: Full data display with filtering and search capabilities.

Plot Tab Includes:

Volcano Plot: Customizable padj thresholds, axis selection, and dynamic color settings.

Filtered Table: Displays genes passing the user-defined filters.

Screenshot 2024-12-16 at 02 14 27 Screenshot 2024-12-16 at 02 14 47

4. GSEA (Gene Set Enrichment Analysis)

This section highlights FGSEA results:

Barplot Tab: Displays pathway enrichment scores filtered by user-defined thresholds.

Table Tab: Lists pathways passing the threshold, with options for pathway type and downloadable results.

Scatter Plot Tab: Plots normalized enrichment scores (NES) against -log10(padj) values.

Screenshot 2024-12-16 at 02 17 53 Screenshot 2024-12-16 at 02 18 18 Screenshot 2024-12-16 at 02 19 33