genes are predicted simultaneously in several aligned genomes
INTRODUCTION
INSTALLATION
RUNNING AUGUSTUS IN CGP MODE
BUILDING THE NEWICK PARSER FROM SCRATCH
(not needed unless you run into compiler errors related to 'parse.cc' or 'lex.cc')
AUTHORS AND CONTACT
REFERENCES
LICENSES
The cgp mode is an extension to AUGUSTUS that takes an alignment of two or more genomes and simultaneously predicts genes in all of them. Beside the genomes and the alignment, a phylogenetic tree of the species is required input. AUGUSTUS-cgp can either be used
- de novo, or
- with extrinsic evidence for any subset of species
Such evidence includes for example already existing and trusted gene structures or hints from RNA-Seq alignments.
Both genomes and extrinsic evidence can either be read in from a flat file or
alternatively retrieved from a MySQL or SQLite database.
All three approaches are described below in more detail.
This manual assumes that you are already familiar with AUGUSTUS and that you know how to use AUGUSTUS for gene prediction in a single genome.
See these instructions for a complete overview.
Once all dependencies are available, you can compile AUGUSTUS using make.
make
In case you had previously compiled AUGUSTUS with disabled cgp mode first you have to call
make clean
After compilation has finished, the command bin/augustus should be executable and print a usage message.