PyOmiX - Analysis Workflow

Analysis Workflow Course Project - Programming for Bioinformatics (Bioinformatics Professional Diploma)

GROUP MEMBERS:

Ahmed Omar Lamloum
Mohamed Magdy AboelEla (Team Leader)
Usama Bakry
Waleed Faheem Amer

OVERVIEW:

The overall purpose of PyOmiX is to create an analysis workflow that generate a simple phylogeny trees from multiple sequence alignment files for a list of SWISS-Prot ids using Clustal Omega, throughout a series of steps as described in the following flowchart.

INPUTS:

python pyomix.py -i <swiss-prot ids file dir> -d <database fasta file> -o <output dir>

OUTPUTS:

Directory with a subdirectory for each ID from the input list.
In each directory:
- Sequence fasta file from UniProt.
- Alignment file from Diamond.
- Sequences fasta file for accessions numbers from NCBI.
- Mulitple sequence alignment file from Clustal Omega.
- Phylogenetic tree from Clustal Omega.

SUBTASKS:

Function to make directories for swiss-prot ids. (done)
- Input: ids file.
- Output: list of ids directories.
Function to get fasta file (sequence) using request from UniProt.
- Input: swiss-prot id.
- Output: sequence fasta file.
Function to align sequence using Diamond. (done)
- Input: sequence fasta file and database file.
- Output: alignment file.
Function to get fasta file (sequence) using request from NCBI.
- Input: accession number.
- Output: sequence fasta file.
Function to merge multiple fasta files in one fasta file.
- Input: list of fasta files.
- Output: fasta file.
Function to perform multiple sequence alignment and get a phylogenetic tree.
- Input: fasta file.
- Output: alignment file and phylogenetic tree file.
Implement python script to run it on the command line.

EXTERNAL MODULES AND PROGRAMMES TO BE USED:

Diamond Aligner.
Clustal Omega (clustalo.py)

EXPECTED DIFFICULTIES:

Unsuitability of the extracted phylogenetic tree from Clustal Omega, so, we will use Simple Phylogeny Tree module instead.
Implementation the python script to run it on command line.

TASKS TO BE COMPLETED BY THE 4TH OF MAY:

Function to make directories for swiss-prot ids. (done)
Function to get fasta file (sequence) using request from UniProt.
Function to align sequence using Diamond. (done)
Function to get fasta file (sequence) using request from NCBI.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
.idea		.idea
modules		modules
.gitignore		.gitignore
README.md		README.md
pyomix.py		pyomix.py
workflow		workflow
workflow.jpg		workflow.jpg
workflow.png		workflow.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PyOmiX - Analysis Workflow

GROUP MEMBERS:

OVERVIEW:

INPUTS:

OUTPUTS:

SUBTASKS:

EXTERNAL MODULES AND PROGRAMMES TO BE USED:

EXPECTED DIFFICULTIES:

TASKS TO BE COMPLETED BY THE 4TH OF MAY:

About

Releases

Packages

Languages

a-lamloum/pyomix

Folders and files

Latest commit

History

Repository files navigation

PyOmiX - Analysis Workflow

GROUP MEMBERS:

OVERVIEW:

INPUTS:

OUTPUTS:

SUBTASKS:

EXTERNAL MODULES AND PROGRAMMES TO BE USED:

EXPECTED DIFFICULTIES:

TASKS TO BE COMPLETED BY THE 4TH OF MAY:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages