masterBLASTer

A SLURM tool for performing parallel BLAST searches.

Requirements:

HPC cluster running SLURM job scheduler
BLAST
Python 3.5+ (with pandas and numpy packages)
Preindexed BLAST protein database

DESCRIPTION:

This tool provides a way of submitting parallel BLASTp searches from a large protein FASTA file. It consists of a set of SLURM .srun files that each perform a different step.

The three basic steps are:

Split the FASTA file into several smaller blocks
Perform seperate BLAST searches with each block
Combine the results and outputing a result file

INSTRUCTIONS:

Copy all tool files into the project's working directory.
Edit the config.sh file to match the project parameters. These will be your SLURM account, SLURM partition, the project FASTA file, the blast database to use, the output file, and the number of blocks to split the file into.
Start the project3_master.srun file supplying the account and partition to run the master task from. Example: 'sbatch -A youraccount -p yourpartition project3_master.srun'
The master SLURM script will direct the rest of the program. In your SLURM queue you should see the master job as well as the helper jobs (they will be held until the previous step completes). Simply, wait until all jobs are finished and the results will be in your project directory with the specified output file name.

RESULTS:

The tool currently returns a tab-delimited file that includes each unique gene hit from the database and the number of hits for each gene sorted in descending order.

Sample File

For testing, a sample FASTA file is provided (some.pep) as well as an example output (results.tsv)

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
Hort503-01-S2018-P03.docx		Hort503-01-S2018-P03.docx
LICENSE		LICENSE
README.md		README.md
blast_out_test		blast_out_test
config.sh		config.sh
debug.log		debug.log
master_BLASTer.png		master_BLASTer.png
merge.py		merge.py
project3_blast.srun		project3_blast.srun
project3_master.srun		project3_master.srun
project3_merge.srun		project3_merge.srun
project3_split.srun		project3_split.srun
results.tsv		results.tsv
some.pep		some.pep

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

masterBLASTer

DESCRIPTION:

INSTRUCTIONS:

RESULTS:

Sample File

About

Releases

Packages

Contributors 2

Languages

License

mtmcgowan/masterBLASTer

Folders and files

Latest commit

History

Repository files navigation

masterBLASTer

DESCRIPTION:

INSTRUCTIONS:

RESULTS:

Sample File

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages