EECS 6893 Big Data Analytics - Final Project
Project ID: 201712-18
Authors: Jing Ai (ja3130), Michael Nguyen (mn2769), Haoquan Zhao (hz2441)
Alzheimer’s Disease affect 1 in 3 seniors in the US and is one of the fastest rising part of the healthcare budget. Gaining better understanding of disease patterns and achieving accurate diagnosis showing the disease progression are crucial problems to address. Our project aims to identify the Alzheimer’s Disease biomarker combinations with the highest diagnostic power and examine the disease patterns of patients at different disease stages. The novelty of our project is that we performed a comprehensive analysis that integrated clinical, genomic and imaging data and included patients of multiple disease stages (Normal, Early Mild Cognitive Impairment, Late Mild Cognitive Impairment and Diagnosed Alzheimer’s), as previous studies have only focused their analysis on one modality and binary phenotypes (diseased/not diseased).
The Alzheimer's Disease Neuroimaging Initiative (ADNI) data collection is a publicly available data collection consist of clinical, genetic and imaging datasets based on studies of approximately 1,550 participants including Alzheimer’s disease patients, mild cognitive impairment subjects and elderly controls across 3 multi-year cohorts (ADNI1, ADNI GO, ADNI2) between 2004 and 2017. More information on the data collection can be found here: http://adni.loni.usc.edu/data-samples/
Clinical
Imaging - radiology measurements
Genetic
Combined
Multi-layer Perceptron for Merged Data Classification
Convolutional Neural Network(CNN) for Raw Images Classification
Random Forest for classification and assessing feature importance
Spearman’s correlation for estimating feature correlations
tSNE for visualizing high dimensional data