Arlo/tensor rt #1138

sheridana · 2023-01-20T00:12:43Z

Description

Adds unragging class for keras layers (useful if unragging is needed prior to model saving)
Adds WIP functionality for converting keras models to TensorRT models for faster inference when available.
Todo: clean up, add automatic system checks

Types of changes

Does this address any currently open issues?

1112

Outside contributors checklist

Review the guidelines for contributing to this repository
Read and sign the CLA and add yourself to the authors list
Make sure you are making a pull request against the develop branch (not main). Also you should start your branch off develop
Add tests that prove your fix is effective or that your feature works
Add necessary documentation (if appropriate)

Thank you for contributing to SLEAP!

❤️

codecov · 2023-01-20T00:39:18Z

Codecov Report

Merging #1138 (24d067a) into develop (b37b34f) will decrease coverage by 0.10%.
The diff coverage is 32.00%.

@@             Coverage Diff             @@
##           develop    #1138      +/-   ##
===========================================
- Coverage    70.05%   69.95%   -0.10%     
===========================================
  Files          131      131              
  Lines        22872    22916      +44     
===========================================
+ Hits         16022    16032      +10     
- Misses        6850     6884      +34

Impacted Files	Coverage Δ
sleap/nn/data/utils.py	`88.88% <28.57%> (-8.99%)`	⬇️
sleap/nn/inference.py	`78.41% <32.55%> (-1.54%)`	⬇️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

…/tensor-rt

- Support FP32 and FP16 precisions - Ensure tensor_rt arg is valid - Check if TensorRT was compiled correctly, fall back to normal prediction otherwise

sheridana · 2023-01-26T20:58:49Z

Example usage and benchmarking: https://github.com/sheridana/sleap_tensor_rt

Todos:

We probably want to use real data rather than a tracing batch to get full optimization
Figure out handling the changed dict keys after unragging in a more principled way
Get it to work for other predictors
Not sure best way to add tests for this since it is very system & hardware dependent...
Maybe add automatic check for precision availability? E.g if FP16 isn't available revert to FP32 immediately. This is already handled internally but is pretty verbose. Tricky to handle this because it's checked deep under the hood, maybe here
Add support for Int8?

sheridana added 2 commits January 19, 2023 19:07

Add unragging function for keras layers

bc4611f

Add WIP functionality to optionally support TensorRT

c15c53e

sheridana added 3 commits January 20, 2023 16:08

Only convert model if it hasn't yet been converted

d62fede

Merge branch 'develop' of https://github.com/talmolab/sleap into arlo…

380dc4a

…/tensor-rt

Update TensorRT conversion

24d067a

- Support FP32 and FP16 precisions - Ensure tensor_rt arg is valid - Check if TensorRT was compiled correctly, fall back to normal prediction otherwise

roomrys added the stale but not fixed Issues that have been backlogged for a long time, but may be addressed in the future label Mar 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Arlo/tensor rt #1138

Arlo/tensor rt #1138

sheridana commented Jan 20, 2023

codecov bot commented Jan 20, 2023 •

edited

Loading

sheridana commented Jan 26, 2023

Arlo/tensor rt #1138

Are you sure you want to change the base?

Arlo/tensor rt #1138

Conversation

sheridana commented Jan 20, 2023

Description

Types of changes

Does this address any currently open issues?

Outside contributors checklist

Thank you for contributing to SLEAP!

codecov bot commented Jan 20, 2023 • edited Loading

Codecov Report

sheridana commented Jan 26, 2023

codecov bot commented Jan 20, 2023 •

edited

Loading