[cudamapper] Accuracy improvements through chaining #565

edawson · 2020-09-21T19:02:43Z

In theory, this branch will help push our accuracy closer to that of minimap2 through changes in the chaining and scoring algorithms. It also includes some filtering improvements (which may be broken out into a separate PR.

… interface for anchmer-based overlapping.

Completes the basic functionality for the generate_anchmers kernel. Simply prints out anchmers currently.

Provides a working (but unfinished) implementation of anchmer-based overlap generation. Filtering and sorting are not yet implemented.

…erlaps. Implements first-round overlap filtering using cub::Flagged and a masking kernel.

Implements a basic overlap fusion procedure on GPU, using cub's NonTrivialRuns and a set of CUDA kernels.

…cess_overlaps.

Turns off fusion on in host code and removes the initial filtering mask for short overlaps in an attempt to generate longer intermediate and final overlaps.

…equivalence distance from 150bp to 20bp.

Disables initial filter for short overlaps before chaining anchmers. This improves recall compared to minimap2.

using overlapmers. Implements anchmer chaining using overlapmers, which are successively larger overlap windows. Disables CPU fusion again and runs several rounds of GPU-based overlapmer chaining instead.

set of final overlaps that are only 70% intersection / 40% accurate compared to mm2.

This integrates a new implementation of anchmer-based overlap chaining which incoporates a simple scoring mechanism.

Completes an anchmer-based chaining algorithm (with scoring) that achieves 94% intersection / 68% correctness as reported by PAF-assess.

Changes the == operator of Anchors to prevent fusing two anchors if they have the same query position in read. Two adjacent anchors tend to have the same query_position_in_read when a repeat is present.

Fixes a bug where the number of chains was not set correctly when generating anchmers. This caused many anchmers with full-length chains to be dropped. Also disables repeat masking by RLE for the moment to test the effect of the debugged anchmer chaining.

and [Guo et al](https://vast.cs.ucla.edu/sites/default/files/publications/minimap2-acc-approved.pdf). Implements a transformation of minimap2's chaining algorithm similar to what is used in Guo et al. This involves a forward search (up to N overlaps) and a simple cumulative scoring algorithm. This is actually very similar to the windowed chaining algorithm used for anchmers but does not degrade when encountering repeated seeds (as anchmers and RLE do).

Removes all the cerr debugging output, as PEF logs were exploding to large sizes.

…er_anchmer.

Implements minimap2's scoring algorithm for chaining.

…in scoring function.

…g in overlapper_anchmer.cu. After refactoring the chaining loop to correctly utilize threads, this commit further reduces the amount of work by terminating the predecessor finding process as soon as a match is found (rather than continuing).

…chmer and refactor the initial ID check.

…er_minimap. Switches overlapping to a new overlapper (overlapper_minimap).

…d line options. Implements command line options and functions which filter read overlaps covering the entire read. If `-X` is passed, such overlaps are removed.

Implements chaining using a modified version of minimap2's chaining algorithm.

…es or which are contained. Adds methods and procedure that filters overlaps that appear to be duplicates. Overlaps which are greater than 80% reciprocal overlaps are dropped, as are those which are completely contained within another overlap that occurs at a later index. Accuracy for the E. coli dataset has not improved significantly. However, the drosophila dataset // Overlapper triggered, E. coli //precision recall percent_correct num_correct num_records_mm2 num_records_cudamapper 0.3248175182481752 0.9616427741185587 0.7530217566478646 5607 7743 15126 // Overlapper minimap, E. coli //precision recall percent_correct num_correct num_records_mm2 num_records_cudamapper 0.41585440146207736 0.9120495931809376 0.7732936845086378 5461 7743 10850 //Overlapper triggered, drosophila test //precision recall percent_correct num_correct num_records_mm2 num_records_cudamapper 0.11068702290076336 0.12323943661971831 0.8285714285714286 29 284 7 //Overlapper minimap, drosophila test //precision recall percent_correct num_correct num_records_mm2 num_records_cudamapper 0.9819004524886877 0.852112676056338 0.8966942148760331 217 284 154

predecessor search iterations during chaining. - Reduces the number of search iterations during chaining to 32 (from 64). Minimap2 uses 50 and Guo et al. use 64; however, we may be able to get away with fewer since we have sorted anchors. We'll need to benchmark to find out. - Multiple deletions to clean up code.

…ins. Fixes local indexing issues when masking anchors that are not chain terminators. However, this is still not the most efficient way of doing traceback.

Qtpairs chainer

…ixes squash changes

…trace in chainerutils.

[cudamapper] Remove OverlapperMinimap test file, refactor to use back…

…r small changes

…f add support for writing intermediate results from the ith tile to the i+1th tile

…ugh backtrace 2. add debug code for backtrace by serializing backtrace and cpu or gpu

…-recall-improvements 1. precision/recall improvements 2. removed extra syncthreads 3. othe…

…ntation that processes tiles from a given read sequentially.

…he end of a chain so that it may be used in the next tile when running chain_anchors_in_tile.

…_size+1 when processing next read tile. Turn off postprocessing of overlaps.

…t properly placing CUB work in a cuda_stream.

… an entire read. Move functions to grid-stride loops with a madro-defined number of blocks/threads where possible.

… overlapper.cpp

…verlaps and threshold to >= 0.9

Tiler chainer

…crease block_count in overlapper_minimap. The reduction in block size seems to increase recall for t.fa, which is indicative of unstable behavior.

edawson added 30 commits August 11, 2020 09:25

[cudamapper] Add an overlapper_anchmer class which implements a basic…

0e525a9

… interface for anchmer-based overlapping.

[cudamapper] Basic anchmer chaining implementation.

5726a8a

Completes the basic functionality for the generate_anchmers kernel. Simply prints out anchmers currently.

[cudamapper] Implements overlap identification with anchmers.

3da7957

Provides a working (but unfinished) implementation of anchmer-based overlap generation. Filtering and sorting are not yet implemented.

[cudamapper] Implement overlap filtering and start RLE-encoding of ov…

934fcfb

…erlaps. Implements first-round overlap filtering using cub::Flagged and a masking kernel.

[cudamapper] Implements overlap merging on GPU.

99e6bee

Implements a basic overlap fusion procedure on GPU, using cub's NonTrivialRuns and a set of CUDA kernels.

[cudamapper] Implement final overlap filtering on GPU.

6945786

[cudamapper] Disable CPU fusion by placing an early return in postpro…

9c4fe45

…cess_overlaps.

[cudamapper] Parameter tweaking to try to generate longer overlaps

57769e5

Turns off fusion on in host code and removes the initial filtering mask for short overlaps in an attempt to generate longer intermediate and final overlaps.

[cudamapper] Implement mask for self-self mappings and reduce anchor …

6087dab

…equivalence distance from 150bp to 20bp.

[cudamapper] Disable initial filtering stage in overlapper_anchmer.

b787915

Disables initial filter for short overlaps before chaining anchmers. This improves recall compared to minimap2.

[cudamapper] Disable on-CPU fusing and reimplement anchmer chaining

41c5ebb

using overlapmers. Implements anchmer chaining using overlapmers, which are successively larger overlap windows. Disables CPU fusion again and runs several rounds of GPU-based overlapmer chaining instead.

[cudamapper] Turns debugging on for overlapper_anchmer and produces a

863e144

set of final overlaps that are only 70% intersection / 40% accurate compared to mm2.

[cudamapper] Alternative implementation of anchmers, with scoring.

0c689a0

This integrates a new implementation of anchmer-based overlap chaining which incoporates a simple scoring mechanism.

[cudamapper] Finish basic anchmer implementation.

4e1adf3

Completes an anchmer-based chaining algorithm (with scoring) that achieves 94% intersection / 68% correctness as reported by PAF-assess.

[cudamapper] Change the definition of identical anchors.

b0bde26

Changes the == operator of Anchors to prevent fusing two anchors if they have the same query position in read. Two adjacent anchors tend to have the same query_position_in_read when a repeat is present.

[cudamapper] Remove debugging output from overlapper_anchmer.cu.

8733d4b

[cudamapper] Remove all debugging in overlapper_anchmer.

8c6f465

Removes all the cerr debugging output, as PEF logs were exploding to large sizes.

[cudamapper] Add an approximate fast gap scoring function to overlapp…

cd8bf64

…er_anchmer.

[cudamapper] Implement scoring algorithm from minimap2.

ea9dc79

Implements minimap2's scoring algorithm for chaining.

[cudamapper] Clean up overlapper_anchmer and add a simple primary cha…

e1fd5ed

…in scoring function.

[cudamapper] Change the definition of minimum length in overlapper_an…

ec1c2ab

…chmer and refactor the initial ID check.

[cudamapper] Begin refactor of minimap2-inspired chaining to overlapp…

b0122d3

…er_minimap. Switches overlapping to a new overlapper (overlapper_minimap).

[cudamapper] Implement filtering of self mappings and relevant comman…

d7b0d3c

…d line options. Implements command line options and functions which filter read overlaps covering the entire read. If `-X` is passed, such overlaps are removed.

[cudamapper] Implement overlapper_minimap.

ec8350e

Implements chaining using a modified version of minimap2's chaining algorithm.

[cudamapper] Remove status report message in overlapper_minimapper.cu.

7a649b8

edawson and others added 6 commits September 29, 2020 12:49

[overlapper_minimap] Fix select_mask indexing for masking non-max cha…

1924059

…ins. Fixes local indexing issues when masking anchors that are not chain terminators. However, this is still not the most efficient way of doing traceback.

Merge pull request #1 from edawson/qtpairs-chainer

4f896c3

Qtpairs chainer

Merge branch 'dev-v0.6.0' into anchmer-fast-score

7d7bad8

squash changes

ee01ab0

small fix: bitwise op -> boolean

18119aa

Merge pull request #3 from nvvishanthi/nvvishanthi/chaining-scoring-f…

c2dc3e1

…ixes squash changes

mimaric assigned edawson Oct 11, 2020

mimaric added the cudamapper GPU-based overlapper label Oct 11, 2020

edawson and others added 18 commits October 12, 2020 22:06

[cudamapper] Remove OverlapperMinimap test file, refactor to use back…

1dffb46

…trace in chainerutils.

Merge pull request #5 from edawson/chainer-utils-backtrace

9413bf8

[cudamapper] Remove OverlapperMinimap test file, refactor to use back…

1. precision/recall improvements 2. removed extra syncthreads 3. othe…

40f5550

…r small changes

1. fix case where anchors in last tile are not tile-aligned 2. kind o…

1d8e6e0

…f add support for writing intermediate results from the ith tile to the i+1th tile

1. fix backtrace issue where potentially some anchors did not go thro…

333ef13

…ugh backtrace 2. add debug code for backtrace by serializing backtrace and cpu or gpu

Merge pull request #6 from nvvishanthi/nvvishanthi/chaining-precision…

6279bd6

…-recall-improvements 1. precision/recall improvements 2. removed extra syncthreads 3. othe…

[cudamapper] Implement a single-read-per-block style chaining impleme…

7dee298

…ntation that processes tiles from a given read sequentially.

[cudamapper] Reenable writing the score, predecessor, and anchor at t…

28e6e92

…he end of a chain so that it may be used in the next tile when running chain_anchors_in_tile.

[cudamapper] Toggle build options for pygenomeworks. Make offset tile…

6b97d54

…_size+1 when processing next read tile. Turn off postprocessing of overlaps.

[cudamapper] Comment out all filtering for testing.

ae6d597

[cudamapper] Attempt to fix index errors related to chain starts.

6f5f711

[chainerutils] Fix multiple bugs in chainer_utils.cu introduced by no…

fb8fb1b

…t properly placing CUB work in a cuda_stream.

[overlapper_minimap] Modify overlapper minimap to only chain tiles in…

be04ef6

… an entire read. Move functions to grid-stride loops with a madro-defined number of blocks/threads where possible.

[overlapper] Move reciprocal-overlap testing and duplicate removal to…

d14c835

… overlapper.cpp

[chainerutils] Fix wrong index use in backtrace_anchors_to_overlaps.

8f4168b

[overlapper_minimap] Set default for reciprocal overlap check to 50 o…

4e686d4

…verlaps and threshold to >= 0.9

Merge pull request #9 from edawson/tiler-chainer

b58f705

Tiler chainer

[chainer_utils] Revert attempted changes to backtrace anchors, and de…

b5a7c70

…crease block_count in overlapper_minimap. The reduction in block size seems to increase recall for t.fa, which is indicative of unstable behavior.

Base automatically changed from dev-v0.6.0 to dev February 8, 2021 22:44

ohadmo closed this Feb 8, 2021

ohadmo deleted the branch NVIDIA-Genomics-Research:dev February 8, 2021 22:54

ohadmo reopened this Feb 8, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[cudamapper] Accuracy improvements through chaining #565

[cudamapper] Accuracy improvements through chaining #565

edawson commented Sep 21, 2020

[cudamapper] Accuracy improvements through chaining #565

Are you sure you want to change the base?

[cudamapper] Accuracy improvements through chaining #565

Conversation

edawson commented Sep 21, 2020