Skip to content

Commit

Permalink
Updated README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
spficklin committed Nov 11, 2016
1 parent 4818491 commit 59a4a78
Showing 1 changed file with 12 additions and 0 deletions.
12 changes: 12 additions & 0 deletions terms/README.md
Original file line number Diff line number Diff line change
@@ -1 +1,13 @@
The shell scripts in this directory are meant to assist in creation of the terms list (needed for the --terms argument) for FUNC-E. Each script corresponds to a different vocabulary or database of controlled terms. At the time that these scripts are added they are verified to work. However, if the corresponding database changes file formats or URLs these scripts may break.

The following scripts are provided:

- AraCyc.sh: used for creating a term list from TAIR's AraCyc database.
- GO.sh: creates a term list from the Gene Ontology
- IPR.sh: creates a term list from InterPro
- KEGG.sh: creates a term list of KEGG orthologs and pathways.
- PO.sh: creates a term list from the Plant Ontology
- Pfam.sh: creates a list of terms from the Pfam database.
- RiceCyc.sh: creates a list of terms from the RiceCyc database

To use any of the terms from these databases your must have a mapping of the term names from the controlled vocaubulary/database mapped to the genes in your genomic background. These mappings must be obtained from the site where your transcript or genome assembly is housed. If such mappings are not available it is possible to create them yourself using tools such as InterProScan, Blast2GO, and the KEGG Automatic Annotation Service (KEGG), for example.

0 comments on commit 59a4a78

Please sign in to comment.