A Haskell program that builds phylogenetic trees from DNA sequences using progressive alignment and UPGMA clustering. Originally developed with Ivar Mira and Alfred Nilsson as a project in a course (Program Design and Data Structures).
Fix buggy tracebackFix labeling of sequences in final tree- Write tests and debug
- Score gap opening and extension differently
- Incorporate fasta file parser
- Implement nearest neighbour clustering
To run the program, load main.hs into GHCI and run:
treeBuilder sequences
where sequences is a list of DNA strings to construct the phylogenetic tree from. The resulting phylogenetic tree is displayed in Newick format in the terminal. It can be visualized using an online tool.