Skip to content

Commit

Permalink
added links to papers
Browse files Browse the repository at this point in the history
  • Loading branch information
Funatiq committed Jun 23, 2021
1 parent 7f640ba commit 9b88566
Show file tree
Hide file tree
Showing 2 changed files with 5 additions and 4 deletions.
7 changes: 4 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -6,9 +6,10 @@ MetaCache is a classification system for mapping genomic sequences (short reads,

For an independend comparison to other tools in terms of classification accuracy see the [LEMMI](https://lemmi.ezlab.org) benchmarking site.

**MetaCache's CPU** version classifies around 60 Million reads (of length 100) per minute against all complete bacterial, viral and archaea genomes from NCBI RefSeq Release 97 running with 88 threads on a workstation with 2 Intel(R) Xeon(R) Gold 6238 CPUs.
**MetaCache's CPU version** classifies around 60 Million reads (of length 100) per minute against all complete bacterial, viral and archaea genomes from NCBI RefSeq Release 97 running with 88 threads on a workstation with 2 Intel(R) Xeon(R) Gold 6238 CPUs.

**MetaCache's [GPU version](docs/gpu_version.md)** classifies around 300 Million reads (of length 100) per minute against all complete bacterial, viral, fungal and archaea genomes from NCBI RefSeq Release 202 running on a workstation with 4 NVIDIA(R) Tesla(R) V100 GPUs (32 GB model).
[**MetaCache-GPU**](https://arxiv.org/abs/2106.08150) will be presented at ICPP '21.



Expand All @@ -24,10 +25,10 @@ make
./metacache-build-refseq
```
This will
* install the zlib library
* install the zlib library
* download the MetaCache source code from GitHub
* compile MetaCache (without GPU support)
* download the complete bacterial, viral and archaea genomes from the latest NCBI RefSeq release (this can take some time)
* download the complete bacterial, viral and archaea genomes from the latest NCBI RefSeq release (this can take some time)
* build a classification database

Once the default database is built you can classify reads:
Expand Down
2 changes: 1 addition & 1 deletion docs/gpu_version.md
Original file line number Diff line number Diff line change
Expand Up @@ -31,7 +31,7 @@ If you don't have enough GPU memory, you can use [database partitioning](docs/pa
* CUDA >= 11
* CUDA 10.2 and a self-provided version of [CUB](https://github.com/NVlabs/cub) (you also need to set the include path for CUB by supplying `INCLUDE=*your_cub_path*` when calling make)

* Hashtable library [warpcore](https://github.com/sleeepyjack/warpcore) and sorting library [bb_segsort](https://github.com/Funatiq/bb_segsort). Both repositories are included as submodules and need to be checked out in addition to MetaCache itself. You can do so by calling
* Hashtable library [warpcore (github)](https://github.com/sleeepyjack/warpcore) [(HIPC '20 paper)](https://ieeexplore.ieee.org/document/9406635) and sorting library [bb_segsort (github)](https://github.com/Funatiq/bb_segsort). Both repositories are included as submodules and need to be checked out in addition to MetaCache itself. You can do so by calling
```git submodule update --init --recursive```

* Support for gzipped FASTA/FASTQ files requires the zlib compression library to be installed on your system.
Expand Down

0 comments on commit 9b88566

Please sign in to comment.