-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathREADME.Format
58 lines (42 loc) · 1.71 KB
/
README.Format
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
ORFanDbFormat
(c)Yin Lan 2015
Contents
I. Introduction
II. Input Data
III. Running
IV. Output
==================================================================
I. Introduction
ORFanDbFormat is used to format databases for use with ORFanFinder.
If you are only looking to use one of the prepackaged databases,
you likely do not need to deal with this much. Simply download the
uniprot.hdb.gz file from cys.bios.niu.edu/orfanfinder. Otherwise, instructions
are below.
------------------------------------------------------------------
II. Input Data
ORFanDbFormat takes only a single input file. The file should be
tab-delimited, and contain the gene id in the first column, and
its taxonomy id in the second. Each of these lines should
correspond to a sequence in the database that you are running
BLASTs against, and there should be a line for every entry.
Data should look like:
gi|489223532|ref|WP_003131952.1| 1358
gi|66816243|ref|XP_642131.1| 352472
gi|66818355|ref|XP_642837.1| 352472
gi|446106212|ref|WP_000184067.1| 1386
gi|494110381|ref|WP_007051162.1| 1678
gi|446057344|ref|WP_000135199.1| 543
gi|489344060|ref|WP_003251213.1| 303
------------------------------------------------------------------
III. Running ORFanDbFormat
ORFanDbFormat takes two arguments, the input data explained above,
and the output file.
[-i input_filename] Filename of input data
<-o output_filename> Filename of output file. Optional.
Defaults to input_filename.hdb
For example:
ORFanDbFormat -i nr.tax -o nr.hdb
------------------------------------------------------------------
IV. Output
The output is a binary database. Supply it to ORFanFinder using
the -db argument.