Skip to content

Releases: ROCm/hipCUB

hipCUB 2.11.0 for ROCm 5.1.1

08 Apr 20:52
Compare
Choose a tag to compare

hipCUB code for ROCm 5.1.1 did not change. The library was rebuilt for the updated ROCm 5.1.1 stack.

hipCUB 2.11.0 for ROCm 5.1.0

30 Mar 17:28
Compare
Choose a tag to compare

Added

  • Device segmented sort
  • Warp merge sort, WarpMask and thread sort from cub 1.15.0 supported in hipCUB
  • Device three way partition

Changed

  • Device_scan and device_segmented_scan: inclusive_scan now uses the input-type as accumulator-type, exclusive_scan uses initial-value-type.
    • This particularly changes behaviour of small-size input types with large-size output types (e.g. short input, int output).
    • And low-res input with high-res output (e.g. float input, double output)
    • Block merge sort no longer supports non power of two blocksizes

hipCUB-2.10.13 for ROCm 5.0.2

04 Mar 17:53
e68cbbd
Compare
Choose a tag to compare

hipCUB code for ROCm 5.0.2 is unchanged from hipCUB for ROCm 5.0.1. The library was rebuilt for the updated ROCm 5.0.2 stack.

hipCUB-2.10.13 for ROCm 5.0.1

16 Feb 22:14
e68cbbd
Compare
Choose a tag to compare

hipCUB code for ROCm 5.0.1 is unchanged from hipCUB for ROCm 5.0.0. The library was rebuilt for the updated ROCm 5.0.1 stack.

hipCUB-2.10.13 for ROCm 5.0.0

09 Feb 20:28
e68cbbd
Compare
Choose a tag to compare

Fixed

  • Added missing includes to hipcub.hpp

Added

  • Bfloat16 support to test cases (device_reduce & device_radix_sort)
  • Device merge sort
  • Block merge sort
  • API update to CUB 1.14.0

Changed

  • The SetupNVCC.cmake automatic target selector select all of the capabalities of all available card for NVIDIA backend.

hipCUB-2.10.12 for ROCm 4.5.2

10 Dec 19:18
189ae50
Compare
Choose a tag to compare

hipCUB code for ROCm 4.5.2 is unchanged from hipCUB for ROCm 4.5.0. The library was rebuilt for the updated ROCm 4.5.2 stack.

hipCUB-2.10.12 for ROCm 4.5.0

27 Oct 21:23
189ae50
Compare
Choose a tag to compare

Addded

  • Initial HIP on Windows support. See README for instructions on how to build and install.
  • gfx1030 support added.
  • Address Sanitizer build option

Changed

  • Packaging changed to a development package (called hipcub-dev for .deb packages, and hipcub-devel for .rpm packages). As hipCUB is a header-only library, there is no runtime package. To aid in the transition, the development package sets the "provides" field to provide the package hipcub, so that existing packages depending on hipcub can continue to work. This provides feature is introduced as a deprecated feature and will be removed in a future ROCm release.

Fixed

  • BlockRadixRank unit test failure fixed.

hipCUB-2.10.10 for ROCm 4.3.1

27 Aug 17:40
ec415b2
Compare
Choose a tag to compare

No changes made for ROCm 4.3.1.

hipCUB-2.10.10 for ROCm 4.3.0

30 Jul 22:51
ec415b2
Compare
Choose a tag to compare

Added

  • DiscardOutputIterator to backend header

hipCUB-2.10.9 for ROCm 4.2.0

10 May 23:17
Compare
Choose a tag to compare

Added

  • Support for TexObjInputIterator and TexRefInputIterator
  • Support for DevicePartition

Changed

  • Minimum cmake version required is now 3.10.2
  • CUB backend has been updated to 1.11.0

Fixed

  • Benchmark build fixed
  • nvcc build fixed