Skip to content

insilico/OceanWorldsBiosig

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

OceanWorldsBiosig

The data and analysis software will be made public upon publication of the manuscript.

Support for submitted paper: Interpretable Machine Learning Biosignature Detection from Ocean Worlds Analogue CO2 Isotopologue Data

IRMS Data

This repository contains benchmark ocean world analogue exerimental Isotope Ratio Mass Spec (IRMS) data for biotic and abiotic samples. Download directly from this site to use the data. The data can be found in two places: new_results/data and paper_results/data.

Analysis Script

We also provide an analysis R script to classify biotic versus abiotic samples (i.e., biosignature models) using LASSO penalized NPDR (nearest-neighbor projected distance regression) with random forest proximity metric and random forest for classification. To perform the analysis pipeline from the beginning, run new_results/npdr_rf_biosig.R in RStudio. Slight differences from the paper results may arise due to cross-validated parameter tuning. Thus, we also provide the reported paper (submitted) results in paper_results.

Install the NPDR R library as follows.

# install.packages("devtools") # uncomment to install devtools
library(devtools)
devtools::install_github("insilico/npdr")
library(npdr)

Other possible dependencies for running analysis script:

install.packages(c("ranger", "reshape2", "dplyr", "caret", "glmnet"))
install.packages(c("speedglm", "wordspace", "doParallel", "foreach"))

Authors

Lily Clough and Brett McKinney

Contact

[email protected]

Websites

Related references

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published