tools

UNIX tools for simple, repetitive bioinformatics tasks

tally_short_sequences.sh

Finds sequences shorter than specified length within a fasta file and prints the counts for each sample

Usage: bash tally_short_sequences.sh Fasta_Filename Max_Read_Length

assign_taxonomy_from_blast_results.sh

Takes .txt formatted BLAST results from a fasta file and wraps entrez_qiime.py to assign NCBI taxonomic lineages in QIIME format for the top BLAST hit for each OTU.
This script assumes you have a local copy of NCBI taxonomy and entrez_qiime.py

Usage: bash assign_taxonomy_from_blast_results.sh blast_results_text_file ABS/PATH/TO/NCBI/TAXONOMY/FILES/DIRECTORY/ ABS/PATH/TO/DRIECTORY/FOR/entrez_qiime.py

make_qiime_database_from_fasta.sh

Takes fasta file from NCBI search and creates a QIIME-compatible taxonomy database along with a sequence database that can be used to assign OTUs and taxonomy within QIIME. This allows simple construction of custom databases based on edirect NCBI search results. This script assumes you have a local copy of NCBI taxonomy and entrez_qiime.py

Usage: bash make_qiime_database_from_fasta.sh /ABS/PATH/TO/INPUT_FASTA_FILE /ABS/PATH/TO/DIRECTORY/FOR/entrez_qiime.py /ABS/PATH/TO/NCBI/TAXONOMY/DIRECTORY/ /ABS/PATH/TO/OUTPUT/DIRECTORY/

SRA_Download_and_Process.sh

Takes downloaded accession list and table from the Sequence Read Archive Read Selector Tool and automatically downloads the associated fastq files from those accessions. It then removes low-quality reads, and generates a QIIME compatible concatenated fasta and valid mapping file, ready for OTU-Picking. This allows you to go straight from selecting projects of interest on SRA to picking OTUs in QIIME with any uploaded metadata. This script assumes local copies of QIIME, the fastx_toolkit, and sratools.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
GDM_Unifrac		GDM_Unifrac
DB.fasta.gz		DB.fasta.gz
Funguild_in_R.R		Funguild_in_R.R
README.md		README.md
SRA_Download_and_Process.sh		SRA_Download_and_Process.sh
Taxonomy.txt.gz		Taxonomy.txt.gz
assign_ncbi_taxonomy_from_blast_results.sh		assign_ncbi_taxonomy_from_blast_results.sh
bbdml_helper.R		bbdml_helper.R
count_seqs.sh		count_seqs.sh
countseqs.sh		countseqs.sh
fastq_encoding.sh		fastq_encoding.sh
make_ITSx_Parts_slurm.sh		make_ITSx_Parts_slurm.sh
make_qiime_database_from_fasta.sh		make_qiime_database_from_fasta.sh
tally_short_sequences.sh		tally_short_sequences.sh
xml_to_top-blast_hit.sh		xml_to_top-blast_hit.sh
xml_to_top-blast_hit.sh~		xml_to_top-blast_hit.sh~

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tools

tally_short_sequences.sh

assign_taxonomy_from_blast_results.sh

make_qiime_database_from_fasta.sh

SRA_Download_and_Process.sh

About

Releases

Packages

Contributors 2

Languages

gzahn/tools

Folders and files

Latest commit

History

Repository files navigation

tools

tally_short_sequences.sh

assign_taxonomy_from_blast_results.sh

make_qiime_database_from_fasta.sh

SRA_Download_and_Process.sh

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages