This project has the codes and materials for the subfamily classification of a CAZyme family.
Project title: Subfamily classification and analysis of CAZy family
Goal: Classify the CAZy family into subfamilies, analyze and visualize them using Bioinformatics tools and methods.
In this tutorial, CAZy family GH31 was used to explain bioinformatics methods to analyze and visualize them.
Paper Link: A subfamily classification to choreograph the diverse activities within glycoside hydrolase family 31.
DOI: https://doi.org/10.1016/j.jbc.2023.103038
- Dataset and preprocess
- Domain annotation using HMM and dbCAN
- Extraction of modules (based on the annotation)
- Construction of sequence similarity networks (SSN) using SSNpipe and analyzing SSNs based on characterized IDs from CAZy and EC numbers.
- Visualization of SSN networks using Cytoscape
- Phylogenentic analysis
- Interpretation and discussion