Visualization of Viral Proteome based on Mitochondrial Localization.
Focuses on:
- Probability distribution of Viral Proteins based on Mitochondrial Targeting Signal Peptide and Trans-membrane Domains
- Cleavage Sites distribution of Viral Proteins based on probability of containing mitochondrial target sequence
- Comparison of viral proteome mitochondrial localization prediction results against Human and Mouse Mitochondrial Proteins (mitoCarta)
- Virologs (viral protein with host homolog) to uncover new serviceable aspects of protein based on sequence(structural) comparison.
Goal of the project is to provide an interactive application for users to access data analysis of the entire Viral Proteome based on (1) orthologs to Human Proteins, (2) Subcellular Localization, (3) Gene Expression (cell-type, regulation eg. ISG) and various host filters. In addition, provide interactive data plots for visualization of viral proteome results based on subcellular localization including distribution plots, (1)probability of being mitochondrial protein, (2)number of cleavage sites of protein, and (3) probability of containing trans-membrane domains, and to obtain a complete list of viral protein information (including Accession ID, Protein Name, Probability of Mitochondrial signal, Virus Type). This would help researchers acquire viral proteins that mimic the host (human) mitochondrial localized proteins.
Virolog_WebDev |-- documentation +-- ViralAnalysisWebAppArchitecture.jpg |-- filters |-- isg +-- files pertaining to isg filter |-- mitofates +-- files pertaining to mitofates filter |-- tmhmm +-- files pertaining to tmhmm filter |-- word++ +-- files pertaining to word++ filter |-- cleanCsvResults.py |-- tmhmm_mitofates_viralResults.csv |-- tmhmm_mitofates_viralShortResults.csv +-- tmhmmMitofatesComparison.py |-- genedata |-- csvdata |-- gene_taxonomy.csv |-- mitofates_viral1.csv |-- mitofatesResults_Human.MitoCarta2.0.csv +-- mitofatesResults_Mouse.MitoCarta2.0.csv |-- data |-- gene_identifiers.txt |-- mitofatesResults_Human.MitoCarta2.0.xlsx |-- mitofatesResults_Mouse.MitoCarta2.0.xlsx |-- mitofatesResults_viral1.xlsx +-- mitofatesResults_viral2.xlsx |-- databasedumps +-- viralanalysisdb2018.sql |-- outputdata |-- gene_taxonomy.csv |-- mitofates_viral_cleaned.csv |-- mitofates_viral1-cleaned2.csv |-- mitofatesResults_Human.MitoCarta2.0-cleaned.csv +-- mitofatesResults_Mouse.MitoCarta2.0-cleaned.csv |-- cleavage_sites.py |-- datacleaner.py |-- presequence_probability.py |-- protein_info.py +-- virus_taxonomy.py |-- HeatMapGeneration |-- example_heat_map.html |-- geneExpressionData_test.csv |-- geneExpressionData.csv |-- heat_map.py |-- README.md +-- requirements.txt |-- static |-- css |-- dashboard.css |-- index.css +-- layout.css |-- js |-- datatables.min.js +-- index.js |-- templates |-- partials |-- cleavagesites.html |-- presequenceprobability.html +-- viraltaxonomy.html |-- dashboard.html +-- layout.html |-- create-tables.sql |-- Human_Genome.sql |-- main.py |-- README.md +-- requirements.txt
It is assumed that this repo has been cloned locally and you are in the base directory for the cloned repo. Follow the steps at the following url: https://www.digitalocean.com/community/tutorials/how-to-serve-flask-applications-with-gunicorn-and-nginx-on-ubuntu-16-04
-
Create a python virtual environment and activate it (how?).
-
Run the following the command after activating the virtualenv:
pip install -r requirements.txt
-
Start the application using
python main.py
-
Go to http://localhost:5000 in your browser.
The app currently uses a mix of csv files and mysql database to store the data. It's in the process of migrating from csv to mysql. The database is currently hosted on the same cloud server as that of the app.