StateTransitionEnrichment

To find state transitions which are enriched or depleted in given set of genes (compared to the background), we analyzed our data via binomial regression in a bayesian framework.

Yt,x ~ binomial(θt,x , Nx) Θt = logistic(αt+βt*Xt)

The number of genes with transition t and the condition x (part of the background (x = 0) or the selected group (x = 1)) is Yt,x. It depends on the number of genes in condition x (Nx) and the probability θt,x that a given transition t occurs in a gene which is annotated to one of the two conditions x. The influence of the condition on θ is expressed by the effect size β, which is the log odds ratio. The uncertainty that an enrichment (positive log odds ratio) or a depletion (negative log odds ratio) occurs is represented by the probability that β is either negative or positive, respectively.

Explaining the Analysis

Performing the analysis consists out of three branches ( see dag.svg).

Testing the accuracy of the model.
Testing the plausibility of the results.
Actual analysis of the genes of interested.

The accuracy of the model is assessed by posterior predictive checks (PPC) and the ability of the model to recover known, because simulated, data. The plausibly of the results is assessed by analyzing a set of genes (house keeping genes) where a set of transitions are expected to be enriched or depleted.

Explanation of the individual plots can be seen in the *.rst files in report/ folder.

Prerequisite

Software

Tool is build to be run in a conda enviroment with a snakemake pipline. The needed packages are defined in "condaPackages.txt".

Input-Data

ListOfGoTerms a plain text file, format can be seen in example file GOthreeComb.go
journal.pone.0000898.s001.csv, List of HKG from https://doi.org/10.1371/journal.pone.0000898 should be saved in Data folder and is lifted to mouse genome (file can be requested).
Jan2020gcsc3.mutations.RData, Genes and their corresponding transitions (file can be requested).

Executing the code with

$ snakemake

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
data		data
report		report
scripts		scripts
.gitignore		.gitignore
README.md		README.md
Snakefile		Snakefile
condaPackages.txt		condaPackages.txt
dag.svg		dag.svg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

StateTransitionEnrichment

Explaining the Analysis

Prerequisite

Software

Input-Data

Executing the code with

About

Releases

Packages

Languages

keksundso/StateTransitionEnrichment

Folders and files

Latest commit

History

Repository files navigation

StateTransitionEnrichment

Explaining the Analysis

Prerequisite

Software

Input-Data

Executing the code with

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages