Skip to content

Latest commit

 

History

History
10 lines (9 loc) · 1.18 KB

01.synopsis.md

File metadata and controls

10 lines (9 loc) · 1.18 KB

Abstract {.page_break_before}

The advent of high-throughput profiling methods (such as genomics or imaging) has accelerated basic research and made deep molecular characterization of patient samples routine. These approaches provide a rich portrait of genes, molecular pathways, and cell types involved in disease phenotypes. Machine learning (ML) can be a useful tool to extract disease-relevant patterns from high dimensional datasets. However, depending on the complexity of the biological question, machine learning often requires many samples to identify recurrent and biologically meaningful patterns. Rare diseases are inherently limited in clinical cases and thus have few samples to study. In this perspective, we outline the challenges and emerging solutions for using ML for small sample sets, specifically that of rare diseases. Advances in ML methods for rare diseases are likely to be informative for applications beyond rare diseases such as incorporating genomics into predictive modeling in precision medicine in which sample sizes are small but datasets are high-dimensional. We propose that the methods community prioritizes the development of ML techniques for rare disease research.