If you use any part of this repository, we shall be very grateful if you cite our paper Analyzing hCov genome sequences: Applying Machine Intelligence and beyond
- Please install Tensorflow version: 2.2.0. (Other 2.x versions should work, but have not been tested. Use gpu for better performance)
- Please install Keras version: 2.3.1.
- Other python libraries used for this project can be installed by running the following command
pip install -r requirements.txt
- Keep the input sequence fasta file (file used in this analysis can be downloaded from here ) and the info file (sample file present in the input directory) in the Input directory
- The input_processing.py script handles the input processing and labelling task. It requires 3 mandatory parameters and 1 optional parameters. They are:
- input
- info_file
- label
- old
- There are four options for labelling:
- Death
- CFR_Recovery
- CFR_confirmed_cases
- CFR_Infrastructure
- Set old parameter to 1 to use the old Training/Testing Accession ID's for input preprocessing and labelling. Don't use this parameter for generating new Training/Testing set. The models are pre-trained with Death Labelling.
- Sample command:
python input_processing.py --input <input_fasta_file_name> --info_file <info_file_name> --label <label_option> --old 1(optional)
-
Navigate to the <Clustering_and_Phylogenetic_Analysis> directory.
-
The controller.py script is a one-stop service for all of related analysis. It requires 2 parameters:
- label (Same as the Input Folder Options)
- method
-
There are 4 options for method:
Method Description Euclidean Simple Euclidean distance-based method among the 3-mers of the genome sequence Novel_Fast_Vector 18-dimensional Novel Fast Vector Sequence Comparison Analysis Accumulated_Fast_Vector 18-dimenasional Accumulated Fast Vector Sequence Comparison Analysis MAW Minimum Absent Word Analysis -
Only MAW requires additional 4 parameters:
- Minimum_MAW_Length
- Maximum_MAW_Length
- Distance_Method
- Fasta File Name (Must be kept in the Input Directory)
-
Sample Command:
python controller.py --label <label_option> --method <method_option>