Open
Description
from a user:
Just to followup on today's meeting on a couple of points. First, here
is a test case for the extremely long extraction times for ranges on some
of the nodes in the 200Mammmal alignment. I have a script that simply
reads ranges from a file and calls hal2fasta and times the run. I ran it
on a node which goes quickly overall and one that is extremely slow. Both
runs extract a similar amount of bp over 1000 ranges each:
Example normal run:
./benchExtract.pl 200Mammals/200m-v1.hal
fullTreeAnc208 fullTreeAnc208.bed
Total Ranges: 1000
Total Sequence: 350457 bp
Average hal2fasta extraction time: 15946.44 bp/sec
0.02 records/sec
Total runtime: 22.200921 secs
./benchExtract.pl 200Mammals/200m-v1.hal
Acomys_cahirinus Aconmys_cahirinus.bed
Total Ranges: 1000
Total Sequence: 353557 bp
Average hal2fasta extraction time: 1129.61 bp/sec
0.31 records/sec
Total runtime: 312.902046 secs
Metadata
Metadata
Assignees
Labels
No labels