IDEARS - Integrated Disease Explanation and Associations Risk Scoring

Applies to the UKB datasetes, UKB dementia, AD and PD classification and SHAP

Overview

This is the codebase for IDEARs - Integrated Disease Explanation and Associations Risk Scoring. Its overall architecture is shown below.

How to Run

To ease the configuation, please install Anaconda and set this up in a virtual environment.

Install Anaconda:

https://www.anaconda.com/products/individual

Create the environment:

conda env create -f .\conda-env.yml

Acticate the environment:

conda activate conda-env

Then on Windows, run startlocal_woDocker.bat and on Linux, run startlocal_woDocker.sh

Codebase Structure

data_gen.py is used to perform ETL on the data and to create the model datasets
data_proc.py is used for extra data processing including the creation of normalised datasets
ml.py is used to run the models including logistic regression, XGBoost and for model interpretability using SHAP
analysis.py is used to create charts, perform extra statistical tests including paired t tests

The jupyter notebooks used for AD are:

AD_ml_part_1.ipynb
Master_ml.ipynb

Overview

Import modules etc.

Directory Tree and Explanations

This folder shows the implementation of the IDEARs platform.

Enquiries

Michael Allwright - [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.vscode		.vscode
static		static
ukb_utils		ukb_utils
.DS_Store		.DS_Store
.gitattributes		.gitattributes
AD_ml_part_1.ipynb		AD_ml_part_1.ipynb
Master_ml.ipynb		Master_ml.ipynb
README.md		README.md
UKB ML flow-Page-2.drawio.png		UKB ML flow-Page-2.drawio.png
analysis.py		analysis.py
colinerarity.py		colinerarity.py
collinearity_finder_treater_py.py		collinearity_finder_treater_py.py
conda_env.yml		conda_env.yml
config.yml		config.yml
data_gen.py		data_gen.py
data_proc.py		data_proc.py
main.py		main.py
ml.py		ml.py
snf_proc.py		snf_proc.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IDEARS - Integrated Disease Explanation and Associations Risk Scoring

Overview

How to Run

Codebase Structure

Overview

Directory Tree and Explanations

Enquiries

About

Releases

Packages

Languages

binfnstats/ukb-IDEARS

Folders and files

Latest commit

History

Repository files navigation

IDEARS - Integrated Disease Explanation and Associations Risk Scoring

Overview

How to Run

Codebase Structure

Overview

Directory Tree and Explanations

Enquiries

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages