Skip to content

hipCUB 2.11.0 for ROCm 5.1.0

Compare
Choose a tag to compare
@lawruble13 lawruble13 released this 30 Mar 17:28

Added

  • Device segmented sort
  • Warp merge sort, WarpMask and thread sort from cub 1.15.0 supported in hipCUB
  • Device three way partition

Changed

  • Device_scan and device_segmented_scan: inclusive_scan now uses the input-type as accumulator-type, exclusive_scan uses initial-value-type.
    • This particularly changes behaviour of small-size input types with large-size output types (e.g. short input, int output).
    • And low-res input with high-res output (e.g. float input, double output)
    • Block merge sort no longer supports non power of two blocksizes