GeneBlocks is a Python library for comparing DNA sequences. It can be used to:
- Find common blocks in a group of DNA sequences, to factorize them (e.g. only analyze or synthetize each common block once).
- Highlight differences between sequences (insertions, deletions, mutations).
- Transfer Genbank features from one record to another sharing similar subsequences.
At the Edinburgh Genome Foundry, we use GeneBlocks to optimize sequence assembly, explore sets of non-annotated sequences, or visualize the differences between different versions of a sequence, and re-annotate records coming from third parties such as DNA manufacturers.
PIP installation:
pip install geneblocks
Github Page:
https://github.com/Edinburgh-Genome-Foundry/geneblocks
Live demos:
Transfer Genbank features between records
License: MIT, Copyright Edinburgh Genome Foundry
Geneblocks is part of the EGF Codons synthetic biology software suite for DNA design, manufacturing and validation.