Skip to content

NVDLA 1.2.0

Compare
Choose a tag to compare
@prasshantg prasshantg released this 28 May 21:03
· 10 commits to master since this release

Release 1.2.0

NVDLA 1.2.0 release is focused on INT8 precision and nv_large/nv_small configurations support along with performance optimizations

New features and improvements

  • INT8 precision support
    • Per-tensor and per-kernel scale factors
    • Symmetric scaling
  • nv_small and nv_large configuration support
  • FP16 winograd support
  • SDP fusion performance feature

Fixes and other changes

  • Failures in deconvolution for number of groups greater than one
  • Disable scale+bias fusion to BN as it is not required with SDP fusion enabled
  • Disable winograd for output layers of network as it causes size mismatch due to special requirements of winograd
  • Return error for pooling layers with pad size greater than kernel size

Known limitations

  • An INT8 calibration tool is not provided