From b870d03db1ce91219dc2969ddc6758d34d9f194f Mon Sep 17 00:00:00 2001 From: jlevy44 Date: Sat, 19 Jan 2019 22:23:07 -0800 Subject: [PATCH 01/15] Pushing background Planning on adding two more sections that expand on the points of the last paragraph. Will need help editing these points and making text more concise, to leave room for remaining two paragraphs. Also looking to adjust some text from the previous gene expression paragraphs and text surrounding latent space prediction. --- content/04.study.md | 8 ++++++++ content/citation-tags.tsv | 4 ++++ 2 files changed, 12 insertions(+) diff --git a/content/04.study.md b/content/04.study.md index 05c5054f..cdac2b56 100644 --- a/content/04.study.md +++ b/content/04.study.md @@ -47,6 +47,14 @@ Deep learning applied to gene expression data is still in its infancy, but the f Many previously untestable hypotheses can now be interrogated as deep learning enables analysis of increasing amounts of data generated by new technologies. For example, the effects of cellular heterogeneity on basic biology and disease etiology can now be explored by single-cell RNA-seq and high-throughput fluorescence-based imaging, techniques we discuss below that will benefit immensely from deep learning approaches. +### DNA Methylation + +DNA Methylation (DNAm), a process of epigenetic alteration that can change gene transcription without modifying sequence, is an important mechanism for understanding the development of oncogenesis. It has also shed light on the processes involving initial differentiation of stem cells, aging and pathogenesis in response to environmental exposures. + +Traditional epigenetic computational approaches focus on estimate metrics pertaining to prognosis. For instance, it has been shown that DNA methylation could be used to calculate immune cell type proportions, a proxy for the patient’s immune profile, in patient subpopulations. Such approaches are important for adjusting the results of epigenome-wide association studies for cell-type composition [@tag:Teschendorff2017]. These methods typically use a restricted set of candidate methylation sites (CpG sites), a reference library from which to make predictions [@tag:Houseman2012]. While utilizing reference-based libraries demonstrates strong predictive value for immune cell type estimation, these methods severely restrict the amount of underlying biology that can be understood and correlated with disease manifestations and phenotypes. When a referenced-based library is not available for use, unsupervised methods are able to estimate these immune profiles [@tag:TitusReview2017]. Methods that do not rely on these reference libraries [@tag:Houseman2014] are hindered by being unable to fully capture the nonlinearity of the methylation data. + +There are many promising deep learning approaches that serve to expand the number of sites that can be studied by capturing the complex interactions between different methylated regions of DNA and extract the complete set of informative biologically relevant features. The main approaches focus on: 1) estimating regions of methylation status and imputing missing methylation values, 2) performing classification and regression tasks, and 3) understanding latent embeddings of methylation states from which to extract biologically meaningful features, infer interpolated disease states, and uncover relevant CpG sites for the above prediction tasks. + ### Splicing Pre-mRNA transcripts can be spliced into different isoforms by retaining or skipping subsets of exons or including parts of introns, creating enormous spatiotemporal flexibility to generate multiple distinct proteins from a single gene. diff --git a/content/citation-tags.tsv b/content/citation-tags.tsv index 9da21983..2dc26c7f 100644 --- a/content/citation-tags.tsv +++ b/content/citation-tags.tsv @@ -96,6 +96,8 @@ Hinton2015_dk arxiv:1503.02531v1 Hochreiter doi:10.1093/bioinformatics/btm247 Hoff doi:10.1093/nar/gkp327 Horton1992_assessment doi:10.1093/nar/20.16.4331 +Houseman2012 doi:10.1186/1471-2105-13-86 +Houseman2014 doi:10.1093/bioinformatics/btu029 Hubara2016_qnn arxiv:1609.07061 Huddar2016_predicting doi:10.1109/ACCESS.2016.2618775 Hughes2016_macromol_react doi:10.1021/acscentsci.6b00162 @@ -245,7 +247,9 @@ Tan2014_psb doi:10.1142/9789814644730_0014 Tan2015_adage doi:10.1128/mSystems.00025-15 Tan2016_eadage doi:10.1101/078659 TAC-ELM doi:10.1142/S0219720012500151 +Teschendorff2017 doi:10.2217/epi-2016-0153 TensorFlow arxiv:1603.04467 +TitusReview2017 doi:10.1093/hmg/ddx275 Torracinta2016_deep_snp doi:10.1101/097469 Torracinta2016_sim doi:10.1101/079087 Tu1996_anns doi:10.1016/S0895-4356(96)00002-9 From 1063bb58c880c86570dface54a9d1b9c996b56cd Mon Sep 17 00:00:00 2001 From: Joshua Levy Date: Sat, 19 Jan 2019 22:42:32 -0800 Subject: [PATCH 02/15] Make DNAm Background More Readable for Editing --- content/04.study.md | 18 +++++++++++++----- 1 file changed, 13 insertions(+), 5 deletions(-) diff --git a/content/04.study.md b/content/04.study.md index cdac2b56..4b04253f 100644 --- a/content/04.study.md +++ b/content/04.study.md @@ -49,11 +49,19 @@ For example, the effects of cellular heterogeneity on basic biology and disease ### DNA Methylation -DNA Methylation (DNAm), a process of epigenetic alteration that can change gene transcription without modifying sequence, is an important mechanism for understanding the development of oncogenesis. It has also shed light on the processes involving initial differentiation of stem cells, aging and pathogenesis in response to environmental exposures. - -Traditional epigenetic computational approaches focus on estimate metrics pertaining to prognosis. For instance, it has been shown that DNA methylation could be used to calculate immune cell type proportions, a proxy for the patient’s immune profile, in patient subpopulations. Such approaches are important for adjusting the results of epigenome-wide association studies for cell-type composition [@tag:Teschendorff2017]. These methods typically use a restricted set of candidate methylation sites (CpG sites), a reference library from which to make predictions [@tag:Houseman2012]. While utilizing reference-based libraries demonstrates strong predictive value for immune cell type estimation, these methods severely restrict the amount of underlying biology that can be understood and correlated with disease manifestations and phenotypes. When a referenced-based library is not available for use, unsupervised methods are able to estimate these immune profiles [@tag:TitusReview2017]. Methods that do not rely on these reference libraries [@tag:Houseman2014] are hindered by being unable to fully capture the nonlinearity of the methylation data. - -There are many promising deep learning approaches that serve to expand the number of sites that can be studied by capturing the complex interactions between different methylated regions of DNA and extract the complete set of informative biologically relevant features. The main approaches focus on: 1) estimating regions of methylation status and imputing missing methylation values, 2) performing classification and regression tasks, and 3) understanding latent embeddings of methylation states from which to extract biologically meaningful features, infer interpolated disease states, and uncover relevant CpG sites for the above prediction tasks. +DNA Methylation (DNAm), a process of epigenetic alteration that can change gene transcription without modifying sequence, is an important mechanism for understanding the development of oncogenesis. +It has also shed light on the processes involving initial differentiation of stem cells, aging and pathogenesis in response to environmental exposures. + +Traditional epigenetic computational approaches focus on estimate metrics pertaining to prognosis. +For instance, it has been shown that DNA methylation could be used to calculate immune cell type proportions, a proxy for the patient’s immune profile, in patient subpopulations. +Such approaches are important for adjusting the results of epigenome-wide association studies for cell-type composition [@tag:Teschendorff2017]. +These methods typically use a restricted set of candidate methylation sites (CpG sites), a reference library from which to make predictions [@tag:Houseman2012]. +While utilizing reference-based libraries demonstrates strong predictive value for immune cell type estimation, these methods severely restrict the amount of underlying biology that can be understood and correlated with disease manifestations and phenotypes. +When a referenced-based library is not available for use, unsupervised methods are able to estimate these immune profiles [@tag:TitusReview2017]. +Methods that do not rely on these reference libraries [@tag:Houseman2014] are hindered by being unable to fully capture the nonlinearity of the methylation data. + +There are many promising deep learning approaches that serve to expand the number of sites that can be studied by capturing the complex interactions between different methylated regions of DNA and extract the complete set of informative biologically relevant features. +The main approaches focus on: 1) estimating regions of methylation status and imputing missing methylation values, 2) performing classification and regression tasks, and 3) understanding latent embeddings of methylation states from which to extract biologically meaningful features, infer interpolated disease states, and uncover relevant CpG sites for the above prediction tasks. ### Splicing From 8d3301d1258365c120e24d9260a13bb86008a43c Mon Sep 17 00:00:00 2001 From: jlevy44 Date: Thu, 9 May 2019 10:46:22 -0400 Subject: [PATCH 03/15] Deep Review PR # 2 --- content/04.study.md | 28 +++++++++++++++------------- content/citation-tags.tsv | 19 +++++++++++++++---- 2 files changed, 30 insertions(+), 17 deletions(-) diff --git a/content/04.study.md b/content/04.study.md index 4b04253f..0fde4f34 100644 --- a/content/04.study.md +++ b/content/04.study.md @@ -49,19 +49,21 @@ For example, the effects of cellular heterogeneity on basic biology and disease ### DNA Methylation -DNA Methylation (DNAm), a process of epigenetic alteration that can change gene transcription without modifying sequence, is an important mechanism for understanding the development of oncogenesis. -It has also shed light on the processes involving initial differentiation of stem cells, aging and pathogenesis in response to environmental exposures. - -Traditional epigenetic computational approaches focus on estimate metrics pertaining to prognosis. -For instance, it has been shown that DNA methylation could be used to calculate immune cell type proportions, a proxy for the patient’s immune profile, in patient subpopulations. -Such approaches are important for adjusting the results of epigenome-wide association studies for cell-type composition [@tag:Teschendorff2017]. -These methods typically use a restricted set of candidate methylation sites (CpG sites), a reference library from which to make predictions [@tag:Houseman2012]. -While utilizing reference-based libraries demonstrates strong predictive value for immune cell type estimation, these methods severely restrict the amount of underlying biology that can be understood and correlated with disease manifestations and phenotypes. -When a referenced-based library is not available for use, unsupervised methods are able to estimate these immune profiles [@tag:TitusReview2017]. -Methods that do not rely on these reference libraries [@tag:Houseman2014] are hindered by being unable to fully capture the nonlinearity of the methylation data. - -There are many promising deep learning approaches that serve to expand the number of sites that can be studied by capturing the complex interactions between different methylated regions of DNA and extract the complete set of informative biologically relevant features. -The main approaches focus on: 1) estimating regions of methylation status and imputing missing methylation values, 2) performing classification and regression tasks, and 3) understanding latent embeddings of methylation states from which to extract biologically meaningful features, infer interpolated disease states, and uncover relevant CpG sites for the above prediction tasks. +#### Inference, Imputation, and Prediction + +Deep learning approaches are beginning to help address some of the current limitations of feature-by-feature analysis approaches to DNA methylation data, and may help uncover additional important features necessary to understand the biological underpinnings behind different pathological states. +One of the more popular applications is the prediction of the degree of methylation at CpG sites neighboring measured sites. +DeepSignal employs a convolutional neural network to construct features from raw electrical Nanopore signals from sites near a methylated base, and concatenates uses a bi-directional recurrent neural network on DNA sequences of the aligned signals to detect methylation [@tag:Ni2018]. +DeepCpG applies a similar method using scBS-Seq, DNA sequence and Bidirectional GRUs [@tag:Angermueller2017] and methods like DAPL and DeepMethyl incorporate sequence and topological structure [@tag:Qiu2018] [@tag:Khwaja2017] [@tag:Wang2016_methyl] [@tag:Fu2019]. +In addition to this, Gene expression has been used to infer and impute methylation states [@tag:Peng2019] [@tag:Levy-Jurgenson2018], methylation of genes predicted from promoter methylation [@tag:Pan2018], and convolutional models have been able to predict methylation status from images [@tag:Momeni2018][@tag:Korfiatis2017]. +While these examples of methylation imputation and inference methods have value it is imperative to recognize limitations of imputing cytosine modifications. +Not only does imputing DNA methylation share similar limitations with genotype imputation, but there is additional complexity in correlation of DNA methylation that can depend on cell type and other variables related with DNA methylation that can vary by sample. +As the abundance of available tissue types and cell types with whole-genome bisulfite sequencing (and oxidative bisulfite sequencing), grows, the accuracy of imputation methods for DNA methylation is expected to increase. + +Once DNA methylation is measured, deep learning approaches can also be used to perform classification and regression tasks. +For instance, one group employed a Deep Neural Network (DNN) to predict triglyceride concentrations pre- and post-treatment from approximately 450K features (differential DNAm levels) from the Illumina 450K microarray, and used the Dropout technique to generalize the model [@tag:Islam2018] [@tag:Darst2018]. +Another study transformed methylation profiles of about ten thousand TCGA samples to perform classification tasks to differentiate 32 different cancer types using the concatenation of various Convolutional Neural Network Maps and learn important patterns of differentially methylated regions that were used to make the classifications [@tag:Chatterjee2018]. +Finally, the prediction of cancer subtypes using DNAm was proposed based on a deep autoencoder. The system exploited content retrieval mechanisms to additionally understand the cancer cell type differentiation of the predicted cancer types [@tag:Khwaja2018] based on methylation of CpG islands. ### Splicing diff --git a/content/citation-tags.tsv b/content/citation-tags.tsv index 2dc26c7f..00aada8d 100644 --- a/content/citation-tags.tsv +++ b/content/citation-tags.tsv @@ -10,6 +10,7 @@ Asgari doi:10.1371/journal.pone.0141287 blast doi:10.1016/S0022-2836(05)80360-2 Angermueller2016_dl_review doi:10.15252/msb.20156651 Angermueller2016_single_methyl doi:10.1186/s13059-017-1189-z +Angermueller2017 doi:10.1186/s13059-017-1189-z Artemov2016_clinical doi:10.1101/095653 Arvaniti2016_rare_subsets doi:10.1101/046508 Bach2015_on doi:10.1371/journal.pone.0130140 @@ -27,6 +28,7 @@ Bracken2016_mirna doi:10.1038/nrg.2016.134 Boza doi:10.1371/journal.pone.0178751 Buggenthin2017_imaged_lineage doi:10.1038/nmeth.4182 Burlina2016_amd doi:10.1109/ISBI.2016.7493240 +Chatterjee2018 arxiv:1807.09617 Caruana2014_need arxiv:1312.6184 Caruana2015_intelligible url:https://dl.acm.org/citation.cfm?id=2788613 Chaudhary2017_multiom_liver_cancer doi:10.1101/114892 @@ -46,6 +48,7 @@ Codella2016_ensemble_melanoma arxiv:1610.04662 Consortium2012_encode doi:10.1038/nature11247 CudNN arxiv:1410.0759 Dahl2014_multi_qsar arxiv:1406.1231 +Darst2018 doi:10.1186/s12863-018-0646-3 Dean2012_nips_downpour url:http://research.google.com/archive/large_deep_networks_nips2012.html DeepChem url:https://github.com/deepchem/deepchem Deming2016_genetic arxiv:1605.07156 @@ -70,6 +73,7 @@ Esteva2017_skin_cancer_nature doi:10.1038/nature21056 Faruqi url:http://alifar76.github.io/sklearn-metrics/ Finnegan2017_maximum doi:10.1101/105957 Fong2017_perturb doi:10.1109/ICCV.2017.371 +Fu2019 doi:10.1109/TCBB.2019.2909237 Gal2015_dropout arxiv:1506.02142 Gaublomme2015_th17 doi:10.1016/j.cell.2015.11.009 Gargeya2017_dr doi:10.1016/j.ophtha.2017.02.008 @@ -96,12 +100,11 @@ Hinton2015_dk arxiv:1503.02531v1 Hochreiter doi:10.1093/bioinformatics/btm247 Hoff doi:10.1093/nar/gkp327 Horton1992_assessment doi:10.1093/nar/20.16.4331 -Houseman2012 doi:10.1186/1471-2105-13-86 -Houseman2014 doi:10.1093/bioinformatics/btu029 Hubara2016_qnn arxiv:1609.07061 Huddar2016_predicting doi:10.1109/ACCESS.2016.2618775 Hughes2016_macromol_react doi:10.1021/acscentsci.6b00162 Iglovikov2017_baa doi:10.1101/234120 +Islam2018 doi:10.1186/s12919-018-0121-1 Ithapu2015_efficient doi:10.1016/j.jalz.2015.01.010 Jafari2016_skin_lesions doi:10.1007/s11548-017-1567-8 Jha2017_integrative_models doi:10.1101/104869 @@ -123,9 +126,12 @@ Koh2016_denoising doi:10.1101/052118 Koh2017_understanding arxiv:1703.04730 Kooi2016_mamm_lesions doi:10.1016/j.media.2016.07.007 Kooi2017_mamm_tl doi:10.1002/mp.12110 +Korfiatis2017 doi:10.1007/s10278-017-0009-z Kraus2017_deeploc doi:10.15252/msb.20177551 Krizhevsky2013_nips_cnn url:https://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf Krizhevsky2014_weird_trick arxiv:1404.5997 +Khwaja2017 doi:10.1109/BIOCAS.2017.8325078 +Khwaja2018 arxiv:1810.01243 Lacey2016_dl_fpga arxiv:1602.04283 Lakhani2017_radiography doi:10.1148/radiol.2017162326 Lanchantin2016_motif arxiv:1608.03644 @@ -133,6 +139,7 @@ Lee2016_deeptarget arxiv:1603.09123v2 Lee2016_emr_oct_amd doi:10.1101/094276 Lei2016_rationalizing arxiv:1606.04155 Leibig2016_dr doi:10.1101/084210 +Levy-Jurgenson2018 doi:10.1101/491357 Li2014_minibatch doi:10.1145/2623330.2623612 Li2016_variation doi:10.1126/science.aad9417 Liang2015_exprs_cancer doi:10.1109/TCBB.2014.2377729 @@ -164,6 +171,7 @@ McHardy2 doi:10.7717/peerj.1603 Metaphlan doi:10.1038/nmeth.2066 Meng2016_mllib arxiv:1505.06807 Min2016_deepenhancer doi:10.1109/BIBM.2016.7822593 +Momeni2018 doi:10.1101/438341 Moritz2015_sparknet arxiv:1511.06051 Mordvintsev2015_inceptionism url:http://googleresearch.blogspot.co.uk/2015/06/inceptionism-going-deeper-into-neural.html Mrzelj url:https://repozitorij.uni-lj.si/IzpisGradiva.php?id=85515 @@ -171,6 +179,7 @@ matis doi:10.1016/S0097-8485(96)80015-5 nbc doi:10.1093/bioinformatics/btq619 Murdoch2017_automatic arxiv:1702.02540 Nemati2016_rl doi:10.1109/EMBC.2016.7591355 +Ni2018 doi:10.1101/385849 Nguyen2014_adversarial arxiv:1412.1897v4 Ngiam2011 url:https://ai.stanford.edu/~ang/papers/icml11-MultimodalDeepLearning.pdf Nie2016_3d_survival doi:10.1007/978-3-319-46723-8_25 @@ -182,7 +191,9 @@ onecodex url:https://www.onecodex.com/ Papernot2017_pate url:https://openreview.net/forum?id=HkwoSDPgg Park2016_deepmirgene arxiv:1605.00017 Parnamaa2017 doi:10.1534/g3.116.033654 +Pan2018 doi:10.1101/438218 Pawlowski2016 doi:10.1101/085118 +Peng2019 doi:10.1101/527044. Pereira2016_docking doi:10.1021/acs.jcim.6b00355 PerezSianes2016_screening doi:10.1007/978-3-319-40126-3_2 Phymm doi:10.1038/nmeth.1358 @@ -191,6 +202,7 @@ Pratt2016_dr doi:10.1016/j.procs.2016.07.014 Quang2017_factor doi:10.1101/151274 Qin2017_onehot doi:10.1371/journal.pcbi.1005403 Qiu2017_graph_embedding doi:10.1101/110668 +Qiu2018 doi:10.1101/406066 Ragoza2016_protein arxiv:1612.02751 RAD2010_view_cc doi:10.1145/1721654.1721672 Radford_dcgan arxiv:1511.06434v2 @@ -247,9 +259,7 @@ Tan2014_psb doi:10.1142/9789814644730_0014 Tan2015_adage doi:10.1128/mSystems.00025-15 Tan2016_eadage doi:10.1101/078659 TAC-ELM doi:10.1142/S0219720012500151 -Teschendorff2017 doi:10.2217/epi-2016-0153 TensorFlow arxiv:1603.04467 -TitusReview2017 doi:10.1093/hmg/ddx275 Torracinta2016_deep_snp doi:10.1101/097469 Torracinta2016_sim doi:10.1101/079087 Tu1996_anns doi:10.1016/S0895-4356(96)00002-9 @@ -259,6 +269,7 @@ Vera2016_sc_analysis doi:10.1146/annurev-genet-120215-034854 Vervier doi:10.1093/bioinformatics/btv683 Wallach2015_atom_net arxiv:1510.02855 Wang2016_breast_cancer arxiv:1606.05718 +Wang2016_methyl doi:10.1038/srep19598 Wang2016_protein_contact doi:10.1371/journal.pcbi.1005324 Wasson1985_clinical doi:10.1056/NEJM198509263131306 WayGreene2017_eval arxiv:1711.04828 From e0ad1ed7a0f24abf346571c0f5ed4ca27559d9b6 Mon Sep 17 00:00:00 2001 From: jlevy44 Date: Thu, 9 May 2019 17:34:49 -0400 Subject: [PATCH 04/15] Tab-delimited hopefully. --- content/citation-tags.tsv | 30 +++++++++++++++--------------- 1 file changed, 15 insertions(+), 15 deletions(-) diff --git a/content/citation-tags.tsv b/content/citation-tags.tsv index 00aada8d..e68fe288 100644 --- a/content/citation-tags.tsv +++ b/content/citation-tags.tsv @@ -10,7 +10,7 @@ Asgari doi:10.1371/journal.pone.0141287 blast doi:10.1016/S0022-2836(05)80360-2 Angermueller2016_dl_review doi:10.15252/msb.20156651 Angermueller2016_single_methyl doi:10.1186/s13059-017-1189-z -Angermueller2017 doi:10.1186/s13059-017-1189-z +Angermueller2017 doi:10.1186/s13059-017-1189-z Artemov2016_clinical doi:10.1101/095653 Arvaniti2016_rare_subsets doi:10.1101/046508 Bach2015_on doi:10.1371/journal.pone.0130140 @@ -28,7 +28,7 @@ Bracken2016_mirna doi:10.1038/nrg.2016.134 Boza doi:10.1371/journal.pone.0178751 Buggenthin2017_imaged_lineage doi:10.1038/nmeth.4182 Burlina2016_amd doi:10.1109/ISBI.2016.7493240 -Chatterjee2018 arxiv:1807.09617 +Chatterjee2018 arxiv:1807.09617 Caruana2014_need arxiv:1312.6184 Caruana2015_intelligible url:https://dl.acm.org/citation.cfm?id=2788613 Chaudhary2017_multiom_liver_cancer doi:10.1101/114892 @@ -48,7 +48,7 @@ Codella2016_ensemble_melanoma arxiv:1610.04662 Consortium2012_encode doi:10.1038/nature11247 CudNN arxiv:1410.0759 Dahl2014_multi_qsar arxiv:1406.1231 -Darst2018 doi:10.1186/s12863-018-0646-3 +Darst2018 doi:10.1186/s12863-018-0646-3 Dean2012_nips_downpour url:http://research.google.com/archive/large_deep_networks_nips2012.html DeepChem url:https://github.com/deepchem/deepchem Deming2016_genetic arxiv:1605.07156 @@ -73,7 +73,7 @@ Esteva2017_skin_cancer_nature doi:10.1038/nature21056 Faruqi url:http://alifar76.github.io/sklearn-metrics/ Finnegan2017_maximum doi:10.1101/105957 Fong2017_perturb doi:10.1109/ICCV.2017.371 -Fu2019 doi:10.1109/TCBB.2019.2909237 +Fu2019 doi:10.1109/TCBB.2019.2909237 Gal2015_dropout arxiv:1506.02142 Gaublomme2015_th17 doi:10.1016/j.cell.2015.11.009 Gargeya2017_dr doi:10.1016/j.ophtha.2017.02.008 @@ -104,7 +104,7 @@ Hubara2016_qnn arxiv:1609.07061 Huddar2016_predicting doi:10.1109/ACCESS.2016.2618775 Hughes2016_macromol_react doi:10.1021/acscentsci.6b00162 Iglovikov2017_baa doi:10.1101/234120 -Islam2018 doi:10.1186/s12919-018-0121-1 +Islam2018 doi:10.1186/s12919-018-0121-1 Ithapu2015_efficient doi:10.1016/j.jalz.2015.01.010 Jafari2016_skin_lesions doi:10.1007/s11548-017-1567-8 Jha2017_integrative_models doi:10.1101/104869 @@ -126,12 +126,12 @@ Koh2016_denoising doi:10.1101/052118 Koh2017_understanding arxiv:1703.04730 Kooi2016_mamm_lesions doi:10.1016/j.media.2016.07.007 Kooi2017_mamm_tl doi:10.1002/mp.12110 -Korfiatis2017 doi:10.1007/s10278-017-0009-z +Korfiatis2017 doi:10.1007/s10278-017-0009-z Kraus2017_deeploc doi:10.15252/msb.20177551 Krizhevsky2013_nips_cnn url:https://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf Krizhevsky2014_weird_trick arxiv:1404.5997 -Khwaja2017 doi:10.1109/BIOCAS.2017.8325078 -Khwaja2018 arxiv:1810.01243 +Khwaja2017 doi:10.1109/BIOCAS.2017.8325078 +Khwaja2018 arxiv:1810.01243 Lacey2016_dl_fpga arxiv:1602.04283 Lakhani2017_radiography doi:10.1148/radiol.2017162326 Lanchantin2016_motif arxiv:1608.03644 @@ -139,7 +139,7 @@ Lee2016_deeptarget arxiv:1603.09123v2 Lee2016_emr_oct_amd doi:10.1101/094276 Lei2016_rationalizing arxiv:1606.04155 Leibig2016_dr doi:10.1101/084210 -Levy-Jurgenson2018 doi:10.1101/491357 +Levy-Jurgenson2018 doi:10.1101/491357 Li2014_minibatch doi:10.1145/2623330.2623612 Li2016_variation doi:10.1126/science.aad9417 Liang2015_exprs_cancer doi:10.1109/TCBB.2014.2377729 @@ -171,7 +171,7 @@ McHardy2 doi:10.7717/peerj.1603 Metaphlan doi:10.1038/nmeth.2066 Meng2016_mllib arxiv:1505.06807 Min2016_deepenhancer doi:10.1109/BIBM.2016.7822593 -Momeni2018 doi:10.1101/438341 +Momeni2018 doi:10.1101/438341 Moritz2015_sparknet arxiv:1511.06051 Mordvintsev2015_inceptionism url:http://googleresearch.blogspot.co.uk/2015/06/inceptionism-going-deeper-into-neural.html Mrzelj url:https://repozitorij.uni-lj.si/IzpisGradiva.php?id=85515 @@ -179,7 +179,7 @@ matis doi:10.1016/S0097-8485(96)80015-5 nbc doi:10.1093/bioinformatics/btq619 Murdoch2017_automatic arxiv:1702.02540 Nemati2016_rl doi:10.1109/EMBC.2016.7591355 -Ni2018 doi:10.1101/385849 +Ni2018 doi:10.1101/385849 Nguyen2014_adversarial arxiv:1412.1897v4 Ngiam2011 url:https://ai.stanford.edu/~ang/papers/icml11-MultimodalDeepLearning.pdf Nie2016_3d_survival doi:10.1007/978-3-319-46723-8_25 @@ -191,9 +191,9 @@ onecodex url:https://www.onecodex.com/ Papernot2017_pate url:https://openreview.net/forum?id=HkwoSDPgg Park2016_deepmirgene arxiv:1605.00017 Parnamaa2017 doi:10.1534/g3.116.033654 -Pan2018 doi:10.1101/438218 +Pan2018 doi:10.1101/438218 Pawlowski2016 doi:10.1101/085118 -Peng2019 doi:10.1101/527044. +Peng2019 doi:10.1101/527044. Pereira2016_docking doi:10.1021/acs.jcim.6b00355 PerezSianes2016_screening doi:10.1007/978-3-319-40126-3_2 Phymm doi:10.1038/nmeth.1358 @@ -202,7 +202,7 @@ Pratt2016_dr doi:10.1016/j.procs.2016.07.014 Quang2017_factor doi:10.1101/151274 Qin2017_onehot doi:10.1371/journal.pcbi.1005403 Qiu2017_graph_embedding doi:10.1101/110668 -Qiu2018 doi:10.1101/406066 +Qiu2018 doi:10.1101/406066 Ragoza2016_protein arxiv:1612.02751 RAD2010_view_cc doi:10.1145/1721654.1721672 Radford_dcgan arxiv:1511.06434v2 @@ -269,7 +269,7 @@ Vera2016_sc_analysis doi:10.1146/annurev-genet-120215-034854 Vervier doi:10.1093/bioinformatics/btv683 Wallach2015_atom_net arxiv:1510.02855 Wang2016_breast_cancer arxiv:1606.05718 -Wang2016_methyl doi:10.1038/srep19598 +Wang2016_methyl doi:10.1038/srep19598 Wang2016_protein_contact doi:10.1371/journal.pcbi.1005324 Wasson1985_clinical doi:10.1056/NEJM198509263131306 WayGreene2017_eval arxiv:1711.04828 From 5a85c4cba5b350cf2e04aa2d8ad336b650facef4 Mon Sep 17 00:00:00 2001 From: jlevy44 Date: Thu, 9 May 2019 21:00:07 -0400 Subject: [PATCH 05/15] Fixed typo in citations --- content/citation-tags.tsv | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/citation-tags.tsv b/content/citation-tags.tsv index e68fe288..7de3c341 100644 --- a/content/citation-tags.tsv +++ b/content/citation-tags.tsv @@ -193,7 +193,7 @@ Park2016_deepmirgene arxiv:1605.00017 Parnamaa2017 doi:10.1534/g3.116.033654 Pan2018 doi:10.1101/438218 Pawlowski2016 doi:10.1101/085118 -Peng2019 doi:10.1101/527044. +Peng2019 doi:10.1101/527044 Pereira2016_docking doi:10.1021/acs.jcim.6b00355 PerezSianes2016_screening doi:10.1007/978-3-319-40126-3_2 Phymm doi:10.1038/nmeth.1358 From f73bf91458e2d1227a2a14e54fc8f263fbc249c8 Mon Sep 17 00:00:00 2001 From: Joshua Levy Date: Fri, 19 Jul 2019 22:30:17 -0400 Subject: [PATCH 06/15] Update content/04.study.md Co-Authored-By: Casey Greene --- content/04.study.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/04.study.md b/content/04.study.md index 0fde4f34..0d424c98 100644 --- a/content/04.study.md +++ b/content/04.study.md @@ -57,7 +57,7 @@ DeepSignal employs a convolutional neural network to construct features from raw DeepCpG applies a similar method using scBS-Seq, DNA sequence and Bidirectional GRUs [@tag:Angermueller2017] and methods like DAPL and DeepMethyl incorporate sequence and topological structure [@tag:Qiu2018] [@tag:Khwaja2017] [@tag:Wang2016_methyl] [@tag:Fu2019]. In addition to this, Gene expression has been used to infer and impute methylation states [@tag:Peng2019] [@tag:Levy-Jurgenson2018], methylation of genes predicted from promoter methylation [@tag:Pan2018], and convolutional models have been able to predict methylation status from images [@tag:Momeni2018][@tag:Korfiatis2017]. While these examples of methylation imputation and inference methods have value it is imperative to recognize limitations of imputing cytosine modifications. -Not only does imputing DNA methylation share similar limitations with genotype imputation, but there is additional complexity in correlation of DNA methylation that can depend on cell type and other variables related with DNA methylation that can vary by sample. +Imputing DNA methylation has complexities above and beyond genotype imputation: correlation of DNA methylation marks can depend on cell types and other factors that can vary by sample. As the abundance of available tissue types and cell types with whole-genome bisulfite sequencing (and oxidative bisulfite sequencing), grows, the accuracy of imputation methods for DNA methylation is expected to increase. Once DNA methylation is measured, deep learning approaches can also be used to perform classification and regression tasks. From d4966516385a40548dd62da44896d04574c12cf3 Mon Sep 17 00:00:00 2001 From: Joshua Levy Date: Fri, 19 Jul 2019 22:30:39 -0400 Subject: [PATCH 07/15] Update content/04.study.md Co-Authored-By: Casey Greene --- content/04.study.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/04.study.md b/content/04.study.md index 0d424c98..b819e01f 100644 --- a/content/04.study.md +++ b/content/04.study.md @@ -58,7 +58,7 @@ DeepCpG applies a similar method using scBS-Seq, DNA sequence and Bidirectional In addition to this, Gene expression has been used to infer and impute methylation states [@tag:Peng2019] [@tag:Levy-Jurgenson2018], methylation of genes predicted from promoter methylation [@tag:Pan2018], and convolutional models have been able to predict methylation status from images [@tag:Momeni2018][@tag:Korfiatis2017]. While these examples of methylation imputation and inference methods have value it is imperative to recognize limitations of imputing cytosine modifications. Imputing DNA methylation has complexities above and beyond genotype imputation: correlation of DNA methylation marks can depend on cell types and other factors that can vary by sample. -As the abundance of available tissue types and cell types with whole-genome bisulfite sequencing (and oxidative bisulfite sequencing), grows, the accuracy of imputation methods for DNA methylation is expected to increase. +As the number of tissue types and cell types with whole-genome bisulfite sequencing (and oxidative bisulfite sequencing) grows, the accuracy of DNA methylation imputation is expected to increase. Once DNA methylation is measured, deep learning approaches can also be used to perform classification and regression tasks. For instance, one group employed a Deep Neural Network (DNN) to predict triglyceride concentrations pre- and post-treatment from approximately 450K features (differential DNAm levels) from the Illumina 450K microarray, and used the Dropout technique to generalize the model [@tag:Islam2018] [@tag:Darst2018]. From 6a8c0fa420c4c753147dd164c3e5cdc44b5897bf Mon Sep 17 00:00:00 2001 From: jlevy44 Date: Fri, 19 Jul 2019 23:37:46 -0400 Subject: [PATCH 08/15] Added 2 studies, and summarized a bit more text --- content/04.study.md | 10 +++++----- content/citation-tags.tsv | 2 ++ 2 files changed, 7 insertions(+), 5 deletions(-) diff --git a/content/04.study.md b/content/04.study.md index b819e01f..c439206e 100644 --- a/content/04.study.md +++ b/content/04.study.md @@ -54,16 +54,16 @@ For example, the effects of cellular heterogeneity on basic biology and disease Deep learning approaches are beginning to help address some of the current limitations of feature-by-feature analysis approaches to DNA methylation data, and may help uncover additional important features necessary to understand the biological underpinnings behind different pathological states. One of the more popular applications is the prediction of the degree of methylation at CpG sites neighboring measured sites. DeepSignal employs a convolutional neural network to construct features from raw electrical Nanopore signals from sites near a methylated base, and concatenates uses a bi-directional recurrent neural network on DNA sequences of the aligned signals to detect methylation [@tag:Ni2018]. -DeepCpG applies a similar method using scBS-Seq, DNA sequence and Bidirectional GRUs [@tag:Angermueller2017] and methods like DAPL and DeepMethyl incorporate sequence and topological structure [@tag:Qiu2018] [@tag:Khwaja2017] [@tag:Wang2016_methyl] [@tag:Fu2019]. -In addition to this, Gene expression has been used to infer and impute methylation states [@tag:Peng2019] [@tag:Levy-Jurgenson2018], methylation of genes predicted from promoter methylation [@tag:Pan2018], and convolutional models have been able to predict methylation status from images [@tag:Momeni2018][@tag:Korfiatis2017]. +DeepCpG applies a similar method using scBS-Seq, DNA sequence and Bidirectional GRUs [@tag:Angermueller2017] and methods like DAPL, MRCNN and DeepMethyl incorporate sequence and topological structure [@tag:Qiu2018] [@tag:Tian2019] [@tag:Khwaja2017] [@tag:Wang2016_methyl] [@tag:Fu2019]. +In addition to this, Gene expression has been used to infer and impute methylation states [@tag:Peng2019] [@tag:Levy-Jurgenson2018], methylation of genes predicted from promoter methylation [@tag:Pan2018], and convolutional models have been able to predict methylation status from images [@tag:Momeni2018] [@tag:Korfiatis2017]. While these examples of methylation imputation and inference methods have value it is imperative to recognize limitations of imputing cytosine modifications. Imputing DNA methylation has complexities above and beyond genotype imputation: correlation of DNA methylation marks can depend on cell types and other factors that can vary by sample. As the number of tissue types and cell types with whole-genome bisulfite sequencing (and oxidative bisulfite sequencing) grows, the accuracy of DNA methylation imputation is expected to increase. Once DNA methylation is measured, deep learning approaches can also be used to perform classification and regression tasks. -For instance, one group employed a Deep Neural Network (DNN) to predict triglyceride concentrations pre- and post-treatment from approximately 450K features (differential DNAm levels) from the Illumina 450K microarray, and used the Dropout technique to generalize the model [@tag:Islam2018] [@tag:Darst2018]. -Another study transformed methylation profiles of about ten thousand TCGA samples to perform classification tasks to differentiate 32 different cancer types using the concatenation of various Convolutional Neural Network Maps and learn important patterns of differentially methylated regions that were used to make the classifications [@tag:Chatterjee2018]. -Finally, the prediction of cancer subtypes using DNAm was proposed based on a deep autoencoder. The system exploited content retrieval mechanisms to additionally understand the cancer cell type differentiation of the predicted cancer types [@tag:Khwaja2018] based on methylation of CpG islands. +For instance, Deep Neural Networks (DNN) have been employed on DNA methylation data to predict triglyceride concentrations pre- and post-treatment [@tag:Islam2018] [@tag:Darst2018] and differentiate cancer subtypes [@tag:Chatterjee2018] [@tag:Khwaja2018] while outperforming other methods such as Support Vector Machine (SVM). +Modular approaches to methylation prediction, such as MethylNet, have been able to predict age, cellular proportions and cancer subtypes, outperforming SVM and Elastic Net models while remaining concordant with expected biology [@tag:Levy2019]. +These approaches aim to make embedding, hyperparameter selection, regression, classification and model interpretation tasks more tractable for epigenetics researchers and machine learning scientists. ### Splicing diff --git a/content/citation-tags.tsv b/content/citation-tags.tsv index 7de3c341..a4b9686a 100644 --- a/content/citation-tags.tsv +++ b/content/citation-tags.tsv @@ -139,6 +139,7 @@ Lee2016_deeptarget arxiv:1603.09123v2 Lee2016_emr_oct_amd doi:10.1101/094276 Lei2016_rationalizing arxiv:1606.04155 Leibig2016_dr doi:10.1101/084210 +Levy2019 doi:10.1101/692665 Levy-Jurgenson2018 doi:10.1101/491357 Li2014_minibatch doi:10.1145/2623330.2623612 Li2016_variation doi:10.1126/science.aad9417 @@ -260,6 +261,7 @@ Tan2015_adage doi:10.1128/mSystems.00025-15 Tan2016_eadage doi:10.1101/078659 TAC-ELM doi:10.1142/S0219720012500151 TensorFlow arxiv:1603.04467 +Tian2019 doi:10.1186/s12864-019-5488-5 Torracinta2016_deep_snp doi:10.1101/097469 Torracinta2016_sim doi:10.1101/079087 Tu1996_anns doi:10.1016/S0895-4356(96)00002-9 From cc2437f8fc20d61581742c3961240a0fa429ffe1 Mon Sep 17 00:00:00 2001 From: jlevy44 Date: Sun, 21 Jul 2019 21:44:03 -0400 Subject: [PATCH 09/15] Response to recommendations, methylation section --- content/04.study.md | 3 ++- content/citation-tags.tsv | 4 ++-- 2 files changed, 4 insertions(+), 3 deletions(-) diff --git a/content/04.study.md b/content/04.study.md index c439206e..17106539 100644 --- a/content/04.study.md +++ b/content/04.study.md @@ -52,13 +52,14 @@ For example, the effects of cellular heterogeneity on basic biology and disease #### Inference, Imputation, and Prediction Deep learning approaches are beginning to help address some of the current limitations of feature-by-feature analysis approaches to DNA methylation data, and may help uncover additional important features necessary to understand the biological underpinnings behind different pathological states. -One of the more popular applications is the prediction of the degree of methylation at CpG sites neighboring measured sites. +One of the more popular applications is imputing the degree of methylation at CpG sites that are within a few thousand base pairs of measured sites or present in similar samples. DeepSignal employs a convolutional neural network to construct features from raw electrical Nanopore signals from sites near a methylated base, and concatenates uses a bi-directional recurrent neural network on DNA sequences of the aligned signals to detect methylation [@tag:Ni2018]. DeepCpG applies a similar method using scBS-Seq, DNA sequence and Bidirectional GRUs [@tag:Angermueller2017] and methods like DAPL, MRCNN and DeepMethyl incorporate sequence and topological structure [@tag:Qiu2018] [@tag:Tian2019] [@tag:Khwaja2017] [@tag:Wang2016_methyl] [@tag:Fu2019]. In addition to this, Gene expression has been used to infer and impute methylation states [@tag:Peng2019] [@tag:Levy-Jurgenson2018], methylation of genes predicted from promoter methylation [@tag:Pan2018], and convolutional models have been able to predict methylation status from images [@tag:Momeni2018] [@tag:Korfiatis2017]. While these examples of methylation imputation and inference methods have value it is imperative to recognize limitations of imputing cytosine modifications. Imputing DNA methylation has complexities above and beyond genotype imputation: correlation of DNA methylation marks can depend on cell types and other factors that can vary by sample. As the number of tissue types and cell types with whole-genome bisulfite sequencing (and oxidative bisulfite sequencing) grows, the accuracy of DNA methylation imputation is expected to increase. +While these methods reduce the computational overhead at comparable performance to other popular methylation imputation methods such as K-Nearest Neighbors, Random Forest, Singular Value Decomposition and Multiple Imputation by Chained Equations, the software implementations will need to become more user-friendly to gain widespread adoption. Once DNA methylation is measured, deep learning approaches can also be used to perform classification and regression tasks. For instance, Deep Neural Networks (DNN) have been employed on DNA methylation data to predict triglyceride concentrations pre- and post-treatment [@tag:Islam2018] [@tag:Darst2018] and differentiate cancer subtypes [@tag:Chatterjee2018] [@tag:Khwaja2018] while outperforming other methods such as Support Vector Machine (SVM). diff --git a/content/citation-tags.tsv b/content/citation-tags.tsv index a4b9686a..2a6e3170 100644 --- a/content/citation-tags.tsv +++ b/content/citation-tags.tsv @@ -139,7 +139,7 @@ Lee2016_deeptarget arxiv:1603.09123v2 Lee2016_emr_oct_amd doi:10.1101/094276 Lei2016_rationalizing arxiv:1606.04155 Leibig2016_dr doi:10.1101/084210 -Levy2019 doi:10.1101/692665 +Levy2019 doi:10.1101/692665 Levy-Jurgenson2018 doi:10.1101/491357 Li2014_minibatch doi:10.1145/2623330.2623612 Li2016_variation doi:10.1126/science.aad9417 @@ -261,7 +261,7 @@ Tan2015_adage doi:10.1128/mSystems.00025-15 Tan2016_eadage doi:10.1101/078659 TAC-ELM doi:10.1142/S0219720012500151 TensorFlow arxiv:1603.04467 -Tian2019 doi:10.1186/s12864-019-5488-5 +Tian2019 doi:10.1186/s12864-019-5488-5 Torracinta2016_deep_snp doi:10.1101/097469 Torracinta2016_sim doi:10.1101/079087 Tu1996_anns doi:10.1016/S0895-4356(96)00002-9 From 397fddc9df3107632d19c347b4fa957ae32b2b17 Mon Sep 17 00:00:00 2001 From: jlevy44 Date: Wed, 4 Mar 2020 15:05:54 -0500 Subject: [PATCH 10/15] Update metadata.yaml --- content/metadata.yaml | 33 +++++++++++++++++++++++++++++++++ 1 file changed, 33 insertions(+) diff --git a/content/metadata.yaml b/content/metadata.yaml index dbf55c15..b2de155a 100644 --- a/content/metadata.yaml +++ b/content/metadata.yaml @@ -70,6 +70,39 @@ authors: coi: string: "None" lastapproved: !!str 2017-05-26 + - github: jlevy44 + name: Joshua J. Levy + orcid: 0000-0001-8050-1291 + email: joshua.j.levy.gr@dartmouth.edu + affiliations: + - Program in Quantitative Biomedical Sciences, Geisel School of Medicine at Dartmouth, Lebanon, NH + funders: Burroughs Wellcome Fund Big Data in the Life Sciences training grant at Dartmouth + v1: "Drafted one or more subsections" + v2: "" + coi: + string: "None" + - github: brockclarke + name: Brock C. Christensen + orcid: 0000-0003-3022-426X + email: brock.christensen@dartmouth.edu + affiliations: + - Department of Epidemiology, Geisel School of Medicine, Dartmouth College, Lebanon, NH + funders: NIH R01CA216265, Dartmouth College Neukom Institute for Computational Science CompX award + v1: "Drafted one or more subsections" + v2: "" + coi: + string: "None" + - github: AlexanderTitus + name: Alexander J. Titus + orcid: 0000-0002-0145-9564 + email: alexander.j.titus@gmail.com + affiliations: + - Office of the Under Secretary of Defense for Research & Engineering, Department of Defense, Washington, DC + funders: NIH T32LM012204 + v1: "Drafted one or more subsections" + v2: "" + coi: + string: "None" - github: gwaygenomics name: Gregory P. Way orcid: 0000-0002-0503-9348 From ebf0e5587ee190d14bffdfffc1f5e600aad4f58c Mon Sep 17 00:00:00 2001 From: jlevy44 Date: Wed, 4 Mar 2020 15:08:25 -0500 Subject: [PATCH 11/15] Delete citation-tags.tsv --- content/citation-tags.tsv | 318 -------------------------------------- 1 file changed, 318 deletions(-) delete mode 100644 content/citation-tags.tsv diff --git a/content/citation-tags.tsv b/content/citation-tags.tsv deleted file mode 100644 index 2a6e3170..00000000 --- a/content/citation-tags.tsv +++ /dev/null @@ -1,318 +0,0 @@ -tag citation -Abe doi:10.1101/gr.634603 -Abramoff2016_dr doi:10.1167/iovs.16-19964 -Agarwal2015_targetscan doi:10.7554/eLife.05005 -Aitchison2017 url:http://papers.nips.cc/paper/6940-model-based-bayesian-inference-of-neural-activity-and-connectivity-from-all-optical-interrogation-of-a-neural-circuit -Alipanahi2015_predicting doi:10.1038/nbt.3300 -AltaeTran2016_one_shot doi:10.1021/acscentsci.6b00367 -Amit2017_breast_mri doi:10.1117/12.2249981 -Asgari doi:10.1371/journal.pone.0141287 -blast doi:10.1016/S0022-2836(05)80360-2 -Angermueller2016_dl_review doi:10.15252/msb.20156651 -Angermueller2016_single_methyl doi:10.1186/s13059-017-1189-z -Angermueller2017 doi:10.1186/s13059-017-1189-z -Artemov2016_clinical doi:10.1101/095653 -Arvaniti2016_rare_subsets doi:10.1101/046508 -Bach2015_on doi:10.1371/journal.pone.0130140 -Bahdanu2014_neural arxiv:1409.0473 -Baskin2015_drug_disc doi:10.1080/17460441.2016.1201262 -Bar2015_nonmed_tl doi:10.1117/12.2083124 -Barash2010_splicing_code doi:10.1038/nature09000 -Baxt1991_myocardial doi:10.7326/0003-4819-115-11-843 -BeaulieuJones2016_ehr_encode doi:10.1016/j.jbi.2016.10.007 -Bengio2015_prec arxiv:1412.7024 -Berezikov2011_mirna doi:10.1038/nrg3079 -Bergstra2011_hyper url:https://papers.nips.cc/paper/4443-algorithms-for-hyper-parameter-optimization.pdf -Bergstra2012_random url:http://www.jmlr.org/papers/v13/bergstra12a.html -Bracken2016_mirna doi:10.1038/nrg.2016.134 -Boza doi:10.1371/journal.pone.0178751 -Buggenthin2017_imaged_lineage doi:10.1038/nmeth.4182 -Burlina2016_amd doi:10.1109/ISBI.2016.7493240 -Chatterjee2018 arxiv:1807.09617 -Caruana2014_need arxiv:1312.6184 -Caruana2015_intelligible url:https://dl.acm.org/citation.cfm?id=2788613 -Chaudhary2017_multiom_liver_cancer doi:10.1101/114892 -Che2015_distill arxiv:1512.03542 -Che2016_rnn arxiv:1606.01865 -Chen2015_hashing arxiv:1504.04788 -Chen2016_exprs_yeast doi:10.1186/s12859-015-0852-1 -Chen2016_gene_expr doi:10.1093/bioinformatics/btw074 -Chen2015_trans_species doi:10.1093/bioinformatics/btv315 -Choi2016_retain arxiv:1608.05745 -Choi2016_gram arxiv:1611.07012 -Chollet2016_xception arxiv:1610.02357 -Chryssolouris1996_confidence doi:10.1109/72.478409 -Ciresan2013_mitosis doi:10.1007/978-3-642-40763-5_51 -Coates2013_cots_hpc url:http://www.jmlr.org/proceedings/papers/v28/coates13.html -Codella2016_ensemble_melanoma arxiv:1610.04662 -Consortium2012_encode doi:10.1038/nature11247 -CudNN arxiv:1410.0759 -Dahl2014_multi_qsar arxiv:1406.1231 -Darst2018 doi:10.1186/s12863-018-0646-3 -Dean2012_nips_downpour url:http://research.google.com/archive/large_deep_networks_nips2012.html -DeepChem url:https://github.com/deepchem/deepchem -Deming2016_genetic arxiv:1605.07156 -Ding doi:10.1186/s12859-015-0753-3 -Ditzler doi:10.1186/s12859-015-0793-8 -Ditzler2 doi:10.1109/TNNLS.2014.2320415 -Ditzler3 doi:10.1109/TNB.2015.2461219 -Dhungel2015_struct_pred_mamm doi:10.1007/978-3-319-24553-9_74 -Dhungel2016_mamm doi:10.1007/978-3-319-46723-8_13 -Dhungel2017_mamm_min_interv doi:10.1016/j.media.2017.01.009 -Dream_tf_binding url:https://www.synapse.org/#!Synapse:syn6131484/wiki/402026 -Dragonn url:http://kundajelab.github.io/dragonn/ -Duvenaud2015_graph_conv url:http://papers.nips.cc/paper/5954-convolutional-networks-on-graphs-for-learning-molecular-fingerprints -Edwards2015_growing_pains doi:10.1145/2771283 -Ehran2009_visualizing url:http://www.iro.umontreal.ca/~lisa/publications2/index.php/publications/show/247 -Elephas url:https://github.com/maxpumperla/elephas -Errington2014_reproducibility doi:10.7554/eLife.04333 -Eser2016_fiddle doi:10.1101/081380 -Esfahani2016_melanoma doi:10.1109/EMBC.2016.7590963 -Essinger2010_taxonomic doi:10.1109/IJCNN.2010.5596644 -Esteva2017_skin_cancer_nature doi:10.1038/nature21056 -Faruqi url:http://alifar76.github.io/sklearn-metrics/ -Finnegan2017_maximum doi:10.1101/105957 -Fong2017_perturb doi:10.1109/ICCV.2017.371 -Fu2019 doi:10.1109/TCBB.2019.2909237 -Gal2015_dropout arxiv:1506.02142 -Gaublomme2015_th17 doi:10.1016/j.cell.2015.11.009 -Gargeya2017_dr doi:10.1016/j.ophtha.2017.02.008 -Gawad2016_singlecell doi:10.1038/nrg.2015.16 -Geras2017_multiview_mamm doi:10.1038/nrg.2015.16 -Gerstein2016_scaling doi:10.1186/s13059-016-0917-0 -Ghandi2014_enhanced doi:10.1371/journal.pcbi.1003711 -Ghosh1992_sequence doi:10.1117/12.140112 -Glorot2011_domain url:http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.231.3442 -Goldsborough2017_cytogan doi:10.1101/227645 -Gomezb2016_automatic arxiv:1610.02415v1 -Graphlab doi:10.14778/2212351.2212354 -Groop1986_islet doi:10.2337/diab.35.2.237 -Gultepe2014_sepsis doi:10.1136/amiajnl-2013-001815 -Gulshan2016_dt doi:10.1001/jama.2016.17216 -Gupta2015_exprs_yeast doi:10.1101/031906 -Gupta2015_prec arxiv:1502.02551 -Guetterman url:http://www.fasebj.org/content/30/1_Supplement/406.3 -Hadjas2015_cc arxiv:1504.04343 -He2015_images arxiv:1512.03385 -Hinton2006_autoencoders doi:10.1126/science.1127647 -Hinton2015_dark_knowledge arxiv:1503.02531 -Hinton2015_dk arxiv:1503.02531v1 -Hochreiter doi:10.1093/bioinformatics/btm247 -Hoff doi:10.1093/nar/gkp327 -Horton1992_assessment doi:10.1093/nar/20.16.4331 -Hubara2016_qnn arxiv:1609.07061 -Huddar2016_predicting doi:10.1109/ACCESS.2016.2618775 -Hughes2016_macromol_react doi:10.1021/acscentsci.6b00162 -Iglovikov2017_baa doi:10.1101/234120 -Islam2018 doi:10.1186/s12919-018-0121-1 -Ithapu2015_efficient doi:10.1016/j.jalz.2015.01.010 -Jafari2016_skin_lesions doi:10.1007/s11548-017-1567-8 -Jha2017_integrative_models doi:10.1101/104869 -Johnson2017_integ_cell arxiv:1705.00092 -JuanMateu2016_t1d doi:10.1530/EJE-15-0916 -Kahng2017_activis arxiv:1704.01942 -Kalinin2018_pgx arxiv:1801.08570 -Karlin doi:10.1128/jb.179.12.3899-3913.1997 -Karpathy2015_visualizing arxiv:1506.02078 -Katzman2016_deepsurv arxiv:1606.00931 -Kearnes2016_admet arxiv:1606.08793 -Kearnes2016_graph_conv doi:10.1007/s10822-016-9938-8 -Kelley2016_basset doi:10.1101/gr.200535.115 -Keras url:https://github.com/fchollet/keras -Kizek doi:10.1016/j.bjid.2015.08.013 -Kindermans2016_investigating arxiv:1611.07270 -Knights doi:10.1111/j.1574-6976.2010.00251.x -Koh2016_denoising doi:10.1101/052118 -Koh2017_understanding arxiv:1703.04730 -Kooi2016_mamm_lesions doi:10.1016/j.media.2016.07.007 -Kooi2017_mamm_tl doi:10.1002/mp.12110 -Korfiatis2017 doi:10.1007/s10278-017-0009-z -Kraus2017_deeploc doi:10.15252/msb.20177551 -Krizhevsky2013_nips_cnn url:https://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf -Krizhevsky2014_weird_trick arxiv:1404.5997 -Khwaja2017 doi:10.1109/BIOCAS.2017.8325078 -Khwaja2018 arxiv:1810.01243 -Lacey2016_dl_fpga arxiv:1602.04283 -Lakhani2017_radiography doi:10.1148/radiol.2017162326 -Lanchantin2016_motif arxiv:1608.03644 -Lee2016_deeptarget arxiv:1603.09123v2 -Lee2016_emr_oct_amd doi:10.1101/094276 -Lei2016_rationalizing arxiv:1606.04155 -Leibig2016_dr doi:10.1101/084210 -Levy2019 doi:10.1101/692665 -Levy-Jurgenson2018 doi:10.1101/491357 -Li2014_minibatch doi:10.1145/2623330.2623612 -Li2016_variation doi:10.1126/science.aad9417 -Liang2015_exprs_cancer doi:10.1109/TCBB.2014.2377729 -Lin2017_why_dl_works arxiv:1608.08225v3 -Lipton2016_missing arxiv:1606.04130 -Lipton2015_lstm arxiv:1510.07641 -Litjens2016_histopath_survey doi:10.1038/srep26286 -Litjens2017_medimage_survey doi:10.1016/j.media.2017.07.005 -Lisboa2006_review doi:10.1016/j.neunet.2005.10.007 -Liu doi:10.1371/journal.pone.0053253 -Liu2016_towards arxiv:1604.07043 -Liu2016_sc_transcriptome doi:10.12688/f1000research.7223.1 -Lodato2015_neurons doi:10.1126/science.aab1785 -Lowe2012_kaggle url:http://blogs.sciencemag.org/pipeline/archives/2012/12/11/did_kaggle_predict_drug_candidate_activities_or_not -lmat doi:10.1093/bioinformatics/btt389 -Lundberg2016_an arxiv:1611.07478 -Lusci2015_irv doi:10.1186/s13321-015-0110-6 -Lusci2013_rnn doi:10.1021/ci400187y -Ma2015_qsar_merck doi:10.1021/ci500747n -Maaten2008_tsne url:http://www.jmlr.org/papers/v9/vandermaaten08a.html -Mahendran2014_understanding arxiv:1412.0035 -Mahendran2016_salient doi:10.1007/978-3-319-46466-4_8 -Mahendran2016_visualizing doi:10.1007/s11263-016-0911-8 -Mahmood doi:10.1016/S0140-6736(13)61752-3 -Mapreduce doi:10.1145/1327452.1327492 -Mayr2016_deep_tox doi:10.3389/fenvs.2015.00080 -McHardy doi:10.1038/nmeth976 -McHardy2 doi:10.7717/peerj.1603 -Metaphlan doi:10.1038/nmeth.2066 -Meng2016_mllib arxiv:1505.06807 -Min2016_deepenhancer doi:10.1109/BIBM.2016.7822593 -Momeni2018 doi:10.1101/438341 -Moritz2015_sparknet arxiv:1511.06051 -Mordvintsev2015_inceptionism url:http://googleresearch.blogspot.co.uk/2015/06/inceptionism-going-deeper-into-neural.html -Mrzelj url:https://repozitorij.uni-lj.si/IzpisGradiva.php?id=85515 -matis doi:10.1016/S0097-8485(96)80015-5 -nbc doi:10.1093/bioinformatics/btq619 -Murdoch2017_automatic arxiv:1702.02540 -Nemati2016_rl doi:10.1109/EMBC.2016.7591355 -Ni2018 doi:10.1101/385849 -Nguyen2014_adversarial arxiv:1412.1897v4 -Ngiam2011 url:https://ai.stanford.edu/~ang/papers/icml11-MultimodalDeepLearning.pdf -Nie2016_3d_survival doi:10.1007/978-3-319-46723-8_25 -NIH2016_genome_cost url:https://www.genome.gov/27565109/the-cost-of-sequencing-a-human-genome/ -Nih_curiosity url:https://www.nigms.nih.gov/Education/Documents/curiosity.pdf -Olivecrona2017_drug_design arxiv:1704.07555 -Osokin2017_biogan arxiv:1708.04692 -onecodex url:https://www.onecodex.com/ -Papernot2017_pate url:https://openreview.net/forum?id=HkwoSDPgg -Park2016_deepmirgene arxiv:1605.00017 -Parnamaa2017 doi:10.1534/g3.116.033654 -Pan2018 doi:10.1101/438218 -Pawlowski2016 doi:10.1101/085118 -Peng2019 doi:10.1101/527044 -Pereira2016_docking doi:10.1021/acs.jcim.6b00355 -PerezSianes2016_screening doi:10.1007/978-3-319-40126-3_2 -Phymm doi:10.1038/nmeth.1358 -Poplin2016_deepvariant doi:10.1101/092890 -Pratt2016_dr doi:10.1016/j.procs.2016.07.014 -Quang2017_factor doi:10.1101/151274 -Qin2017_onehot doi:10.1371/journal.pcbi.1005403 -Qiu2017_graph_embedding doi:10.1101/110668 -Qiu2018 doi:10.1101/406066 -Ragoza2016_protein arxiv:1612.02751 -RAD2010_view_cc doi:10.1145/1721654.1721672 -Radford_dcgan arxiv:1511.06434v2 -Rajkomar2017_radiographs doi:10.1007/s10278-016-9914-9 -Rakhlin2018_histology doi:10.1101/259911 -Ramsundar2015_multitask_drug arxiv:1502.02072 -Ranganath2016_deep arxiv:1608.02158 -Raina2009_gpu doi:10.1145/1553374.1553486 -Ribeiro2016_lime arxiv:1602.04938 -Rogers2010_fingerprints doi:10.1021/ci100050t -Roth2015_view_agg_cad doi:10.1109/TMI.2015.2482920 -Romero2017_diet url:https://openreview.net/pdf?id=Sk-oDY9ge -Rosenberg2015_synthetic_seqs doi:10.1016/j.cell.2015.09.054 -Russakovsky2015_imagenet doi:10.1007/s11263-015-0816-y -Sa2015_buckwild pmcid:PMC4907892 -Salzberg doi:10.1186/1471-2105-11-544 -Schatz2010_dna_cloud doi:10.1038/nbt0710-691 -Schmidhuber2014_dnn_overview doi:10.1016/j.neunet.2014.09.003 -Scotti2016_missplicing doi:10.1038/nrg.2015.3 -Segata doi:10.1371/journal.pcbi.1004977 -Segler2017_drug_design arxiv:1701.01329 -Seide2014_parallel doi:10.1109/ICASSP.2014.6853593 -Setty2015_seqgl doi:10.1371/journal.pcbi.1004271 -Selvaraju2016_grad arxiv:1610.02391 -Serden doi:10.1016/S0168-8510(02)00208-7 -Shaham2016_batch_effects doi:10.1093/bioinformatics/btx196 -Shapely doi:10.1515/9781400881970-018 -Shen2017_medimg_review doi:10.1146/annurev-bioeng-071516-044442 -Shin2016_cad_tl doi:10.1109/TMI.2016.2528162 -Shrikumar2017_learning arxiv:1704.02685 -Shrikumar2017_reversecomplement doi:10.1101/103663 -Simonyan2013_deep arxiv:1312.6034 -Singh2017_attentivechrome arxiv:1708.00339 -Singh2016_deepchrome arxiv:1607.02078 -Singh2016_tsk doi:10.1109/TCBB.2016.2609918 -Silver2016_alphago doi:10.1038/nature16961 -Sonderby doi:10.1007/978-3-319-21233-3_6 -Soueidan doi:10.1515/metgen-2016-0001 -Spark doi:10.1145/2934664 -Speech_recognition url:http://www.businessinsider.com/ibm-edges-closer-to-human-speech-recognition-2017-3 -Springenberg2014_striving arxiv:1412.6806 -Stein2010_cloud doi:10.1186/gb-2010-11-5-207 -Stenstrom2005_latent doi:10.2337/diabetes.54.suppl_2.S68 -Stormo2000_dna doi:10.1093/bioinformatics/16.1.16 -Stratnikov doi:10.1186/2049-2618-1-11 -Strobelt2016_visual arxiv:1606.07461 -Su2015_gpu arxiv:1507.01239 -Subramanian2016_bace1 doi:10.1021/acs.jcim.6b00290 -Sun2016_ensemble arxiv:1606.00575 -Sundararajan2017_axiomatic arxiv:1703.01365 -Sutskever arxiv:1409.3215 -Swamidass2009_irv doi:10.1021/ci8004379 -Tan2014_psb doi:10.1142/9789814644730_0014 -Tan2015_adage doi:10.1128/mSystems.00025-15 -Tan2016_eadage doi:10.1101/078659 -TAC-ELM doi:10.1142/S0219720012500151 -TensorFlow arxiv:1603.04467 -Tian2019 doi:10.1186/s12864-019-5488-5 -Torracinta2016_deep_snp doi:10.1101/097469 -Torracinta2016_sim doi:10.1101/079087 -Tu1996_anns doi:10.1016/S0895-4356(96)00002-9 -Unterthiner2014_screening url:http://www.bioinf.at/publications/2014/NIPS2014a.pdf -Vanhoucke2011_cpu url:https://research.google.com/pubs/pub37631.html -Vera2016_sc_analysis doi:10.1146/annurev-genet-120215-034854 -Vervier doi:10.1093/bioinformatics/btv683 -Wallach2015_atom_net arxiv:1510.02855 -Wang2016_breast_cancer arxiv:1606.05718 -Wang2016_methyl doi:10.1038/srep19598 -Wang2016_protein_contact doi:10.1371/journal.pcbi.1005324 -Wasson1985_clinical doi:10.1056/NEJM198509263131306 -WayGreene2017_eval arxiv:1711.04828 -WayGreene2017_tybalt doi:10.1101/174474 -Word2Vec arxiv:1301.3781 -wgsquikr doi:10.1371/journal.pone.0091784 -Wu2017_molecule_net doi:10.1039/C7SC02664A -Xiang doi:10.1016/S0167-9473(99)00098-5 -Xiong2011_bayesian doi:10.1093/bioinformatics/btr444 -Xiong2015_splicing_code doi:10.1126/science.1254806 -Xu2015_show arxiv:1502.03044 -Yasushi2016_cgbvs_dnn doi:10.1002/minf.201600045 -yok doi:10.1186/1471-2105-12-20 -Yoon2016_cancer_reports doi:10.1007/978-3-319-47898-2_21 -Yosinski2014 url:https://papers.nips.cc/paper/5347-how-transferable-are-features-in-deep-neural-networks -Yosinksi2015_understanding arxiv:1506.06579 -Yu2016_melanoma_resnet doi:10.1109/TMI.2016.2642839 -Zeiler2013_visualizing arxiv:1311.2901 -Zeng2015 doi:10.1186/s12859-015-0553-9 -Zeng2016_convolutional doi:10.1093/bioinformatics/btw255 -Zhang2015_multitask_tl doi:10.1145/2783258.2783304 -Zhang2017_generalization arxiv:1611.03530v2 -Zhou2015_deep_sea doi:10.1038/nmeth.3547 -Zhu2016_advers_mamm doi:10.1101/095786 -Zhu2016_mult_inst_mamm doi:10.1101/095794 -Zintgraf2017_visualizing arxiv:1702.04595 -goodfellow2016deep url:http://www.deeplearningbook.org/ -li2016joint url:https://dl.acm.org/citation.cfm?id=3061018 -world2004international url:http://www.who.int/classifications/icd/en/ -ghahramani_protect arxiv:1707.02476 -uncertainty_types arxiv:1703.04977 -uncertainty_multi_task arxiv:1705.07115 -guo_calibration arxiv:1706.04599 -platt_scaling url:http://citeseer.ist.psu.edu/viewdoc/summary?doi=10.1.1.41.1639 -out_dist_baseline arxiv:1610.02136 -temp_out_dist arxiv:1706.02690 -ai_safety arxiv:1606.06565 -strong_adversary arxiv:1705.07263 -retinopathy_uncertainty doi:10.1038/s41598-017-17876-z -bayesian_hypernets arxiv:1710.04759 -mcclure_bayesian arxiv:1611.01639 -uncertainty_ensembles arxiv:1612.01474 -domain_adapt_uncertainty arxiv:1505.07818 -gal_thesis url:http://www.cs.ox.ac.uk/people/yarin.gal/website/thesis/thesis.pdf From 58adbfc6bd2355b496ce3904bb966803d60e461e Mon Sep 17 00:00:00 2001 From: jlevy44 Date: Thu, 5 Mar 2020 15:14:13 -0500 Subject: [PATCH 12/15] Update metadata.yaml --- content/metadata.yaml | 15 +++++++++------ 1 file changed, 9 insertions(+), 6 deletions(-) diff --git a/content/metadata.yaml b/content/metadata.yaml index b2de155a..966e49e3 100644 --- a/content/metadata.yaml +++ b/content/metadata.yaml @@ -77,10 +77,11 @@ authors: affiliations: - Program in Quantitative Biomedical Sciences, Geisel School of Medicine at Dartmouth, Lebanon, NH funders: Burroughs Wellcome Fund Big Data in the Life Sciences training grant at Dartmouth - v1: "Drafted one or more subsections" - v2: "" + v1: "" + v2: "Drafted one or more subsections" coi: string: "None" + lastapproved: !!str 2019-07-22 - github: brockclarke name: Brock C. Christensen orcid: 0000-0003-3022-426X @@ -88,10 +89,11 @@ authors: affiliations: - Department of Epidemiology, Geisel School of Medicine, Dartmouth College, Lebanon, NH funders: NIH R01CA216265, Dartmouth College Neukom Institute for Computational Science CompX award - v1: "Drafted one or more subsections" - v2: "" + v1: "" + v2: "Drafted one or more subsections" coi: string: "None" + lastapproved: !!str 2019-08-01 - github: AlexanderTitus name: Alexander J. Titus orcid: 0000-0002-0145-9564 @@ -99,10 +101,11 @@ authors: affiliations: - Office of the Under Secretary of Defense for Research & Engineering, Department of Defense, Washington, DC funders: NIH T32LM012204 - v1: "Drafted one or more subsections" - v2: "" + v1: "" + v2: "Drafted one or more subsections" coi: string: "None" + lastapproved: !!str 2019-07-23 - github: gwaygenomics name: Gregory P. Way orcid: 0000-0002-0503-9348 From 91f684ecf0860f58cab43fe4c6120047a9208dd6 Mon Sep 17 00:00:00 2001 From: jlevy44 Date: Thu, 5 Mar 2020 22:40:27 -0500 Subject: [PATCH 13/15] Update metadata.yaml --- content/metadata.yaml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/metadata.yaml b/content/metadata.yaml index 966e49e3..e0fe872a 100644 --- a/content/metadata.yaml +++ b/content/metadata.yaml @@ -99,7 +99,7 @@ authors: orcid: 0000-0002-0145-9564 email: alexander.j.titus@gmail.com affiliations: - - Office of the Under Secretary of Defense for Research & Engineering, Department of Defense, Washington, DC + - Department of Epidemiology, Geisel School of Medicine, Dartmouth College, Lebanon, NH funders: NIH T32LM012204 v1: "" v2: "Drafted one or more subsections" From 992f01c4dcab3b09e3e8e203772d785003550ae9 Mon Sep 17 00:00:00 2001 From: jlevy44 Date: Sun, 8 Mar 2020 15:32:11 -0400 Subject: [PATCH 14/15] Update metadata.yaml --- content/metadata.yaml | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/content/metadata.yaml b/content/metadata.yaml index e0fe872a..9ad7970e 100644 --- a/content/metadata.yaml +++ b/content/metadata.yaml @@ -97,7 +97,7 @@ authors: - github: AlexanderTitus name: Alexander J. Titus orcid: 0000-0002-0145-9564 - email: alexander.j.titus@gmail.com + email: alexander.j.titus@dartmouth.edu affiliations: - Department of Epidemiology, Geisel School of Medicine, Dartmouth College, Lebanon, NH funders: NIH T32LM012204 From 9198ad082a1b2fd93e90ae1736abffbd4c022e7d Mon Sep 17 00:00:00 2001 From: Casey Greene Date: Mon, 9 Mar 2020 12:22:07 -0400 Subject: [PATCH 15/15] update COI dates to match PR comments --- content/metadata.yaml | 8 ++++---- 1 file changed, 4 insertions(+), 4 deletions(-) diff --git a/content/metadata.yaml b/content/metadata.yaml index 9ad7970e..07063935 100644 --- a/content/metadata.yaml +++ b/content/metadata.yaml @@ -81,7 +81,7 @@ authors: v2: "Drafted one or more subsections" coi: string: "None" - lastapproved: !!str 2019-07-22 + lastapproved: !!str 2020-03-04 - github: brockclarke name: Brock C. Christensen orcid: 0000-0003-3022-426X @@ -93,7 +93,7 @@ authors: v2: "Drafted one or more subsections" coi: string: "None" - lastapproved: !!str 2019-08-01 + lastapproved: !!str 2020-03-05 - github: AlexanderTitus name: Alexander J. Titus orcid: 0000-0002-0145-9564 @@ -105,7 +105,7 @@ authors: v2: "Drafted one or more subsections" coi: string: "None" - lastapproved: !!str 2019-07-23 + lastapproved: !!str 2020-03-07 - github: gwaygenomics name: Gregory P. Way orcid: 0000-0002-0503-9348 @@ -478,4 +478,4 @@ authors: v2: "Drafted sub-sections, edited the manuscript, reviewed pull requests, and coordinated co-authors" coi: string: "None" - lastapproved: !!str 2017-05-26 + lastapproved: !!str 2020-03-09