Skip to content

Latest commit

 

History

History
17 lines (15 loc) · 1.3 KB

NEWS.md

File metadata and controls

17 lines (15 loc) · 1.3 KB

Unreleased

OutSeekR 1.0.0 - 2024-11-15

Added

  • Implementation of core Outlier Detection Algorithm, a statistical approach for detecting transcript-level outliers in RNA-seq or related data types, leveraging normalized data (e.g., FPKM) and several statistical metrics.
  • Five distinct statistics for robustly assessing outliers:
    • Z-scores using mean and standard deviation.
    • Z-scores using median and median absolute deviation.
    • Z-scores with 5%-trimmed mean and standard deviation.
    • Fraction of observations in the smaller cluster from K-means (K=2).
    • Cosine similarity between extreme observed values and theoretical distribution quantiles.
  • Comprehensive null simulation functionality. Generates null datasets mimicking the observed data distribution (without outliers) through generalized additive modeling of four potential distributions.
  • Outlier p-value calculation by comparing rank products from observed and null data across multiple rounds, refining the detection by iteratively removing the most extreme outliers.
  • Support for false discovery rate (FDR) correction to control for multiple testing.
  • Optimization for high-performance analysis using future.apply to enable parallelization, compatible with various computing environments.
  • Sample outliers data and usage demonstration.