Skip to content

Releases: ROCm/hipCUB

hipCUB 2.13.0 for ROCm 5.4.1

15 Dec 18:39
ce87098
Compare
Choose a tag to compare

hipCUB code for ROCm 5.4.1 did not change. The library was rebuilt for the updated ROCm 5.4.1 stack.

hipCUB 2.13.0 for ROCm 5.4.0

30 Nov 17:34
ce87098
Compare
Choose a tag to compare

Added

  • CMake functionality to improve build parallelism of the test suite that splits compilation units by
    function or by parameters.
  • New overload for BlockAdjacentDifference::SubtractLeftPartialTile that takes a predecessor item.

Changed

  • Improved build parallelism of the test suite by splitting up large compilation units for DeviceRadixSort,
    DeviceSegmentedRadixSort and DeviceSegmentedSort.
  • CUB backend references CUB and thrust version 1.17.1.

hipCUB 2.12.0 for ROCm 5.3.3

17 Nov 19:21
Compare
Choose a tag to compare

hipCUB code for ROCm 5.3.3 did not change. The library was rebuilt for the updated ROCm 5.3.3 stack.

hipCUB 2.12.0 for ROCm 5.3.2

10 Nov 01:05
Compare
Choose a tag to compare

hipCUB code for ROCm 5.3.2 did not change. The library was rebuilt for the updated ROCm 5.3.2 stack.

hipCUB 2.12.0 for ROCm 5.3.1

28 Oct 16:58
Compare
Choose a tag to compare

hipCUB code for ROCm 5.3.1 did not change. The library was rebuilt for the updated ROCm 5.3.1 stack.

hipCUB 2.12.0 for ROCm 5.3.0

30 Sep 19:24
Compare
Choose a tag to compare

Added

  • UniqueByKey device algorithm
  • SubtractLeft, SubtractLeftPartialTile, SubtractRight, SubtractRightPartialTile overloads in BlockAdjacentDifference.
    • The old overloads (FlagHeads, FlagTails, FlagHeadsAndTails) are deprecated.
  • DeviceAdjacentDifference algorithm.
  • Extended benchmark suite of DeviceHistogram, DeviceScan, DevicePartition, DeviceReduce,
    DeviceSegmentedReduce, DeviceSegmentedRadixSort, DeviceRadixSort, DeviceSpmv, DeviceMergeSort,
    DeviceSegmentedSort

Changed

  • Obsolated type traits defined in util_type.hpp. Use the standard library equivalents instead.
  • CUB backend references CUB and thrust version 1.16.0.
  • DeviceRadixSort's num_items parameter's type is now templated instead of being an int.
    • If an integral type with a size at most 4 bytes is passed (i.e. an int), the former logic applies.
    • Otherwise the algorithm uses a larger indexing type that makes it possible to sort input data over 2**32 elements.
  • Improved build parallelism of the test suite by splitting up large compilation units

hipCUB 2.11.1 for ROCm 5.2.3

18 Aug 16:59
Compare
Choose a tag to compare

hipCUB code for ROCm 5.2.3 did not change. The library was rebuilt for the updated ROCm 5.2.3 stack.

hipCUB 2.11.1 for ROCm 5.2.1

21 Jul 20:23
Compare
Choose a tag to compare

hipCUB code for ROCm 5.2.1 did not change. The library was rebuilt for the updated ROCm 5.2.1 stack.

hipCUB 2.11.1 for ROCm 5.2.0

28 Jun 18:43
Compare
Choose a tag to compare

Added

  • Packages for tests and benchmark executable on all supported OSes using CPack.

hipCUB 2.11.0 for ROCm 5.1.3

20 May 17:05
Compare
Choose a tag to compare

hipCUB code for ROCm 5.1.3 did not change. The library was rebuilt for the updated ROCm 5.1.3 stack.