Releases: ROCm/hipCUB
Releases · ROCm/hipCUB
hipCUB 2.11.0 for ROCm 5.1.1
hipCUB code for ROCm 5.1.1 did not change. The library was rebuilt for the updated ROCm 5.1.1 stack.
hipCUB 2.11.0 for ROCm 5.1.0
Added
- Device segmented sort
- Warp merge sort, WarpMask and thread sort from cub 1.15.0 supported in hipCUB
- Device three way partition
Changed
- Device_scan and device_segmented_scan: inclusive_scan now uses the input-type as accumulator-type, exclusive_scan uses initial-value-type.
- This particularly changes behaviour of small-size input types with large-size output types (e.g. short input, int output).
- And low-res input with high-res output (e.g. float input, double output)
- Block merge sort no longer supports non power of two blocksizes
hipCUB-2.10.13 for ROCm 5.0.2
hipCUB code for ROCm 5.0.2 is unchanged from hipCUB for ROCm 5.0.1. The library was rebuilt for the updated ROCm 5.0.2 stack.
hipCUB-2.10.13 for ROCm 5.0.1
hipCUB code for ROCm 5.0.1 is unchanged from hipCUB for ROCm 5.0.0. The library was rebuilt for the updated ROCm 5.0.1 stack.
hipCUB-2.10.13 for ROCm 5.0.0
Fixed
- Added missing includes to hipcub.hpp
Added
- Bfloat16 support to test cases (device_reduce & device_radix_sort)
- Device merge sort
- Block merge sort
- API update to CUB 1.14.0
Changed
- The SetupNVCC.cmake automatic target selector select all of the capabalities of all available card for NVIDIA backend.
hipCUB-2.10.12 for ROCm 4.5.2
hipCUB code for ROCm 4.5.2 is unchanged from hipCUB for ROCm 4.5.0. The library was rebuilt for the updated ROCm 4.5.2 stack.
hipCUB-2.10.12 for ROCm 4.5.0
Addded
- Initial HIP on Windows support. See README for instructions on how to build and install.
- gfx1030 support added.
- Address Sanitizer build option
Changed
- Packaging changed to a development package (called hipcub-dev for
.deb
packages, and hipcub-devel for.rpm
packages). As hipCUB is a header-only library, there is no runtime package. To aid in the transition, the development package sets the "provides" field to provide the package hipcub, so that existing packages depending on hipcub can continue to work. This provides feature is introduced as a deprecated feature and will be removed in a future ROCm release.
Fixed
- BlockRadixRank unit test failure fixed.
hipCUB-2.10.10 for ROCm 4.3.1
No changes made for ROCm 4.3.1.
hipCUB-2.10.10 for ROCm 4.3.0
Added
- DiscardOutputIterator to backend header
hipCUB-2.10.9 for ROCm 4.2.0
Added
- Support for TexObjInputIterator and TexRefInputIterator
- Support for DevicePartition
Changed
- Minimum cmake version required is now 3.10.2
- CUB backend has been updated to 1.11.0
Fixed
- Benchmark build fixed
- nvcc build fixed