Skip to content

Latest commit

 

History

History
240 lines (179 loc) · 9.44 KB

README.md

File metadata and controls

240 lines (179 loc) · 9.44 KB

DNAnalyzer-modified

Democratizing AI-Powered DNA Analysis

On-device genomic insights for everyone, everywhere

Copyright Release Build Status DOI


Open in GitHub Codespaces    Model in Hugging Face    DNAnalyzer on Product Hunt

Table of Contents


About DNAnalyzer

DNAnalyzer is a fiscally sponsored 501(c)(3) nonprofit (EIN: 81-2908499) revolutionizing DNA analysis by making machine learning-powered genomic insights accessible through efficient on-device computation.

Founded by Piyush Acharya and co-led with @LimesKey, our platform has attracted 45 computational biologists and computer scientists from institutions including Microsoft Research, University of Macedonia, and Northeastern University.

Our impact has been recognized by the organizers of the AI Engineer World's Fair (backed by OpenAI, Microsoft, Google DeepMind, and Anthropic) and the CEO of Forem.


Why It Matters

Current Reality DNAnalyzer's Mission
$100 average cost for DNA sequencing Free on-device analysis
Up to $600 for basic health insights Accessible to underserved communities
78% of companies share genetic data with third parties Private, secure analysis that stays on your device
Data breaches expose millions (23andMe: 6.9M users in 2023) No central database of sensitive genetic information

"Unlike a credit card number or password, stolen or misused genetic information cannot be changed."


Core Features

Start & Stop Codons

Identify protein coding regions and analyze the 20 different amino acids in polypeptide chains

High Coverage Regions

Detect promoter sequences with high GC-content (45-60%) that likely reveal crucial genomic information

Neural Disorder Analysis

Identify genetic signatures related to neurodevelopmental disorders like autism, ADHD, and schizophrenia

Core Promoter Elements

Find key promoter sequences (BRE, TATA, INR, DPE) responsible for initiating transcription

FASTA Support

Process multi-line and single-line FASTA database files through upload or web linking

Command-Line Interface

Access all core features through our powerful Methionine CLI (Met CLI) for scripting and automation

Coming Soon: Web-based user interface for enhanced accessibility


Quick DNA Introduction

DNA: The Programming Language of Life

DNA exists in most cells of the body and contains the blueprint for creating over 200 distinct cell types. Like a programming language exclusive to living organisms, it encodes the instructions for all biological processes.

Databases: The Foundation of Analysis

A DNA database is crucial for interpreting DNA sequences. By leveraging machine learning, we can make predictions on previously unseen DNA sequences, forming the foundation of modern genomic analysis.


Getting Started

Ready to explore your DNA? Follow our comprehensive guide to get started:

# Clone the repository
git clone https://github.com/VerisimilitudeX/DNAnalyzer.git

# Navigate to project directory
cd DNAnalyzer

# Install dependencies
./gradlew build

For detailed instructions, please refer to our Getting Started Guide.


Roadmap

Upcoming Development Description
Optimized SQL Database High-performance vertical scaling database to store genomic data from thousands of species
Enhanced Neural Network Support for genotyped data from 3rd-party DNA testing services ($99 compatibility)
DIAMOND Implementation Combining DIAMOND's performance with BLAST's powerful algorithm

Contributing

We welcome contributions from developers and researchers of all skill levels!


Citations

View our detailed citations document for all in-line references.

If you use DNAnalyzer in your research, please cite:

@software{Acharya_DNAnalyzer_ML-Powered_DNA_2022,
  author = {Acharya, Piyush},
  doi = {10.5281/zenodo.14556577},
  month = oct,
  title = {{DNAnalyzer: ML-Powered DNA Analysis Platform}},
  url = {https://github.com/VerisimilitudeX/DNAnalyzer},
  version = {3.5.0-beta.0},
  year = {2022}
}

⚖Terms of Use

The use of this application is entirely at your own discretion and responsibility. While the DNAnalyzer team is committed to addressing significant issues, we disclaim liability for losses, damages, or consequences arising from the use of this application.

For questions or concerns, please contact us at [email protected].

Copyright © Piyush Acharya 2025. DNAnalyzer is a fiscally sponsored 501(c)(3) nonprofit organization (EIN: 81-2908499) and is licensed under the MIT License.


Project Growth

Star History Chart

Support DNAnalyzer

Every referral helps fund our nonprofit mission

23andMe

Get 10% off your order
DNAnalyzer earns $20 per referral

23andMe Referral

Ancestry® Membership

Get up to 24% off membership
DNAnalyzer earns $10 per referral

Ancestry Referral