Skip to content

Condense and summarize MicrobeAnnotator metabolic pathway tab files from many genomes into a big heatmap CSV files

Notifications You must be signed in to change notification settings

AhmedElsherbini/MicrobeAnnotator_stats

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

MicrobeAnnotator_stats

Kindly if you find this repo useful for your work, cite & star this repo

What is this script?

MicrobeAnnotator is a great tool for metabolic pathway annotation. However, if you need to collect pathway completeness stats from a cohort of genomes into one condensed CSV (to make a heatmap), you are in the right repo!

What do you need?

You shall have the tab files in one directory

What about dependencies?

Pandas, csv, argparse

Then, effortlessly you can type in your beautiful terminal

python3 microanno_stats.py -i .

"-i /--input_dir" is your path to the directory for your binary file.

In my case, the script was in the same directory as the Python file. Therefore, I wrote . as my input directory.

What do you get?

Currently, there are two files.

  1. pathway_group.csv The main output is a collection in the 3rd column which is referred as "pathway_group" in the basic tab output of MicrobeAnnotator.
  2. detailed_pathway.csv. This is simply a detailed pathway collection in the 2nd column which is referred to as "name" in the basic tab output of MicrobeAnnotator.

You can use the two files in your favorite visualization tool to produce a publication-ready figure.

I hope this helps.

Thanks

About

Condense and summarize MicrobeAnnotator metabolic pathway tab files from many genomes into a big heatmap CSV files

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages