Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Internalize cuda/detail/core/* #3505

Merged
merged 1 commit into from
Jan 30, 2025
Merged

Conversation

bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Jan 23, 2025

This moves all content of thrust/system/cuda/detail/core/util.h into a detail namespace. This is a breaking change.

Does any of the changes here require a deprecation to be backported?

@bernhardmgruber bernhardmgruber requested a review from a team as a code owner January 23, 2025 11:41
@bernhardmgruber bernhardmgruber added thrust For all items related to Thrust. breaking Breaking change labels Jan 23, 2025
@bernhardmgruber bernhardmgruber requested a review from a team as a code owner January 26, 2025 20:14
@bernhardmgruber bernhardmgruber force-pushed the ref_core_cub_util branch 2 times, most recently from 3f99dcf to 866286b Compare January 28, 2025 15:23
@bernhardmgruber bernhardmgruber enabled auto-merge (squash) January 28, 2025 17:43
@bernhardmgruber
Copy link
Contributor Author

@gevtushenko does any of the entities in thrust/system/cuda/detail/core/util.h need a deprecation first?

Copy link
Contributor

🟨 CI finished in 4h 01m: Pass: 98%/89 | Total: 2d 06h | Avg: 36m 28s | Max: 1h 05m | Hits: 418%/10928
  • 🟨 cub: Pass: 97%/44 | Total: 1d 09h | Avg: 45m 05s | Max: 1h 05m | Hits: 538%/3552

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/42  | Total:  1d 07h | Avg: 44m 25s | Max:  1h 05m | Hits: 538%/3552  
      🟩 arm64              Pass: 100%/2   | Total:  1h 58m | Avg: 59m 01s | Max:  1h 02m
    🔍 ctk: 12.6 🔍
      🟩 12.0               Pass: 100%/5   | Total:  3h 57m | Avg: 47m 27s | Max: 54m 26s | Hits: 538%/888   
      🟩 12.5               Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 05m
      🔍 12.6               Pass:  97%/37  | Total:  1d 02h | Avg: 43m 39s | Max:  1h 02m | Hits: 538%/2664  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 48m | Avg: 54m 02s | Max: 54m 56s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 57m | Avg: 47m 27s | Max: 54m 26s | Hits: 538%/888   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 05m
      🔍 nvcc12.6           Pass:  97%/35  | Total:  1d 01h | Avg: 43m 03s | Max:  1h 02m | Hits: 538%/2664  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 48m | Avg: 54m 02s | Max: 54m 56s
      🔍 nvcc               Pass:  97%/42  | Total:  1d 07h | Avg: 44m 39s | Max:  1h 05m | Hits: 538%/3552  
    🔍 cxx: Clang18 🔍
      🟩 Clang14            Pass: 100%/4   | Total:  3h 24m | Avg: 51m 07s | Max: 54m 26s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 45m | Avg: 52m 34s | Max: 55m 39s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 39m | Avg: 49m 34s | Max: 51m 13s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 39m | Avg: 49m 42s | Max: 51m 56s
      🔍 Clang18            Pass:  85%/7   | Total:  5h 06m | Avg: 43m 43s | Max:  1h 02m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 45m | Avg: 52m 55s | Max: 54m 11s
      🟩 GCC8               Pass: 100%/1   | Total: 52m 00s | Avg: 52m 00s | Max: 52m 00s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 45m | Avg: 52m 36s | Max: 54m 26s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 56m | Avg: 58m 11s | Max:  1h 01m
      🟩 GCC11              Pass: 100%/2   | Total:  1h 41m | Avg: 50m 55s | Max: 51m 52s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 30m | Avg: 37m 36s | Max: 57m 13s
      🟩 GCC13              Pass: 100%/8   | Total:  4h 28m | Avg: 33m 37s | Max: 55m 23s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 04m | Avg: 32m 03s | Max: 34m 47s | Hits: 538%/1776  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 13m | Avg: 36m 42s | Max: 37m 23s | Hits: 538%/1776  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 05m
    🔍 cxx_family: Clang 🔍
      🔍 Clang              Pass:  94%/17  | Total: 13h 34m | Avg: 47m 54s | Max:  1h 02m
      🟩 GCC                Pass: 100%/21  | Total: 15h 00m | Avg: 42m 53s | Max:  1h 01m
      🟩 MSVC               Pass: 100%/4   | Total:  2h 17m | Avg: 34m 22s | Max: 37m 23s | Hits: 538%/3552  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 05m
    🔍 gpu: v100 🔍
      🟩 h100               Pass: 100%/2   | Total: 45m 01s | Avg: 22m 30s | Max: 23m 27s
      🔍 v100               Pass:  97%/42  | Total:  1d 08h | Avg: 46m 09s | Max:  1h 05m | Hits: 538%/3552  
    🔍 jobs: HostLaunch 🔍
      🟩 Build              Pass: 100%/37  | Total:  1d 06h | Avg: 49m 47s | Max:  1h 05m | Hits: 538%/3552  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 52s | Avg: 21m 52s | Max: 21m 52s
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 45s | Avg: 16m 45s | Max: 16m 45s
      🔍 HostLaunch         Pass:  66%/3   | Total: 45m 48s | Avg: 15m 16s | Max: 21m 34s
      🟩 TestGPU            Pass: 100%/2   | Total: 57m 19s | Avg: 28m 39s | Max: 29m 46s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/20  | Total: 16h 32m | Avg: 49m 36s | Max:  1h 05m | Hits: 538%/2664  
      🔍 20                 Pass:  95%/24  | Total: 16h 31m | Avg: 41m 19s | Max:  1h 05m | Hits: 538%/888   
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 45m 01s | Avg: 22m 30s | Max: 23m 27s
      🟩 90a                Pass: 100%/1   | Total: 22m 22s | Avg: 22m 22s | Max: 22m 22s
    
  • 🟩 thrust: Pass: 100%/42 | Total: 20h 09m | Avg: 28m 47s | Max: 57m 11s | Hits: 360%/7376

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 39m 20s | Avg: 19m 40s | Max: 24m 29s
    🟩 cpu
      🟩 amd64              Pass: 100%/40  | Total: 19h 15m | Avg: 28m 53s | Max: 57m 11s | Hits: 360%/7376  
      🟩 arm64              Pass: 100%/2   | Total: 54m 04s | Avg: 27m 02s | Max: 28m 20s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  2h 24m | Avg: 28m 51s | Max: 33m 44s | Hits: 360%/1844  
      🟩 12.5               Pass: 100%/2   | Total:  1h 53m | Avg: 56m 44s | Max: 57m 11s
      🟩 12.6               Pass: 100%/35  | Total: 15h 51m | Avg: 27m 11s | Max: 34m 54s | Hits: 360%/5532  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 50m 55s | Avg: 25m 27s | Max: 25m 56s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  2h 24m | Avg: 28m 51s | Max: 33m 44s | Hits: 360%/1844  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 53m | Avg: 56m 44s | Max: 57m 11s
      🟩 nvcc12.6           Pass: 100%/33  | Total: 15h 00m | Avg: 27m 17s | Max: 34m 54s | Hits: 360%/5532  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 50m 55s | Avg: 25m 27s | Max: 25m 56s
      🟩 nvcc               Pass: 100%/40  | Total: 19h 18m | Avg: 28m 57s | Max: 57m 11s | Hits: 360%/7376  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 56m | Avg: 29m 01s | Max: 31m 14s
      🟩 Clang15            Pass: 100%/2   | Total: 56m 12s | Avg: 28m 06s | Max: 28m 38s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 00m | Avg: 30m 22s | Max: 31m 02s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 02m | Avg: 31m 09s | Max: 31m 43s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 35m | Avg: 22m 11s | Max: 29m 32s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 04m | Avg: 32m 28s | Max: 34m 54s
      🟩 GCC8               Pass: 100%/1   | Total: 31m 08s | Avg: 31m 08s | Max: 31m 08s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 06m | Avg: 33m 04s | Max: 33m 44s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 01m | Avg: 30m 53s | Max: 31m 59s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 49s | Max: 33m 05s
      🟩 GCC12              Pass: 100%/2   | Total:  1h 06m | Avg: 33m 26s | Max: 34m 23s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 54m | Avg: 21m 46s | Max: 32m 49s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 52m 49s | Avg: 26m 24s | Max: 27m 52s | Hits: 360%/3688  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 03m | Avg: 31m 51s | Max: 33m 25s | Hits: 360%/3688  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 53m | Avg: 56m 44s | Max: 57m 11s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  7h 30m | Avg: 26m 30s | Max: 31m 43s
      🟩 GCC                Pass: 100%/19  | Total:  8h 48m | Avg: 27m 49s | Max: 34m 54s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 56m | Avg: 29m 08s | Max: 33m 25s | Hits: 360%/7376  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 53m | Avg: 56m 44s | Max: 57m 11s
    🟩 gpu
      🟩 v100               Pass: 100%/42  | Total: 20h 09m | Avg: 28m 47s | Max: 57m 11s | Hits: 360%/7376  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 19h 09m | Avg: 31m 03s | Max: 57m 11s | Hits: 360%/7376  
      🟩 TestCPU            Pass: 100%/2   | Total: 15m 40s | Avg:  7m 50s | Max:  7m 56s
      🟩 TestGPU            Pass: 100%/3   | Total: 44m 30s | Avg: 14m 50s | Max: 15m 29s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 17m 30s | Avg: 17m 30s | Max: 17m 30s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 10h 34m | Avg: 31m 44s | Max: 56m 18s | Hits: 360%/5532  
      🟩 20                 Pass: 100%/20  | Total:  8h 55m | Avg: 26m 45s | Max: 57m 11s | Hits: 360%/1844  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 26s | Avg: 4m 43s | Max: 7m 16s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 26s | Avg:  4m 43s | Max:  7m 16s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 26s | Avg:  4m 43s | Max:  7m 16s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 26s | Avg:  4m 43s | Max:  7m 16s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 26s | Avg:  4m 43s | Max:  7m 16s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 26s | Avg:  4m 43s | Max:  7m 16s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 26s | Avg:  4m 43s | Max:  7m 16s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 26s | Avg:  4m 43s | Max:  7m 16s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 10s | Avg:  2m 10s | Max:  2m 10s
      🟩 Test               Pass: 100%/1   | Total:  7m 16s | Avg:  7m 16s | Max:  7m 16s
    
  • 🟩 python: Pass: 100%/1 | Total: 43m 47s | Avg: 43m 47s | Max: 43m 47s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 43m 47s | Avg: 43m 47s | Max: 43m 47s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 43m 47s | Avg: 43m 47s | Max: 43m 47s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 43m 47s | Avg: 43m 47s | Max: 43m 47s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 43m 47s | Avg: 43m 47s | Max: 43m 47s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 43m 47s | Avg: 43m 47s | Max: 43m 47s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 43m 47s | Avg: 43m 47s | Max: 43m 47s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 43m 47s | Avg: 43m 47s | Max: 43m 47s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 43m 47s | Avg: 43m 47s | Max: 43m 47s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 89)

# Runner
65 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
8 windows-amd64-cpu16
4 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@bernhardmgruber bernhardmgruber changed the title Internalize cuda/detail/core/util.h Internalize cuda/detail/core/* Jan 29, 2025
@bernhardmgruber bernhardmgruber requested a review from a team as a code owner January 29, 2025 17:36
Copy link
Contributor

🟨 CI finished in 2h 57m: Pass: 98%/89 | Total: 1d 08h | Avg: 21m 52s | Max: 58m 40s | Hits: 402%/10936
  • 🟨 cub: Pass: 97%/44 | Total: 23h 51m | Avg: 32m 32s | Max: 58m 40s | Hits: 539%/3552

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/42  | Total: 22h 26m | Avg: 32m 04s | Max: 58m 40s | Hits: 539%/3552  
      🟩 arm64              Pass: 100%/2   | Total:  1h 24m | Avg: 42m 27s | Max: 42m 28s
    🔍 ctk: 12.6 🔍
      🟩 12.0               Pass: 100%/5   | Total:  2h 41m | Avg: 32m 15s | Max: 41m 07s | Hits: 539%/888   
      🟩 12.5               Pass: 100%/2   | Total:  1h 24m | Avg: 42m 00s | Max: 43m 00s
      🔍 12.6               Pass:  97%/37  | Total: 19h 46m | Avg: 32m 04s | Max: 58m 40s | Hits: 538%/2664  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 45m | Avg: 52m 41s | Max: 53m 40s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  2h 41m | Avg: 32m 15s | Max: 41m 07s | Hits: 539%/888   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 24m | Avg: 42m 00s | Max: 43m 00s
      🔍 nvcc12.6           Pass:  97%/35  | Total: 18h 01m | Avg: 30m 53s | Max: 58m 40s | Hits: 538%/2664  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 45m | Avg: 52m 41s | Max: 53m 40s
      🔍 nvcc               Pass:  97%/42  | Total: 22h 06m | Avg: 31m 34s | Max: 58m 40s | Hits: 539%/3552  
    🔍 cxx: Clang18 🔍
      🟩 Clang14            Pass: 100%/4   | Total:  1h 55m | Avg: 28m 58s | Max: 38m 43s
      🟩 Clang15            Pass: 100%/2   | Total: 42m 19s | Avg: 21m 09s | Max: 36m 25s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 16m | Avg: 38m 18s | Max: 42m 45s
      🟩 Clang17            Pass: 100%/2   | Total: 41m 50s | Avg: 20m 55s | Max: 36m 05s
      🔍 Clang18            Pass:  85%/7   | Total:  4h 07m | Avg: 35m 24s | Max: 53m 40s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 18m | Avg: 39m 03s | Max: 42m 46s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 23s | Avg:  5m 23s | Max:  5m 23s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 17m | Avg: 38m 41s | Max: 39m 32s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 10m | Avg: 35m 18s | Max: 35m 43s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 15m | Avg: 37m 57s | Max: 38m 15s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 58m | Avg: 29m 32s | Max: 58m 40s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 42m | Avg: 27m 50s | Max: 43m 31s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 25m | Avg: 42m 46s | Max: 44m 25s | Hits: 539%/1776  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 29m | Avg: 44m 42s | Max: 45m 34s | Hits: 538%/1776  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 24m | Avg: 42m 00s | Max: 43m 00s
    🔍 cxx_family: Clang 🔍
      🔍 Clang              Pass:  94%/17  | Total:  8h 44m | Avg: 30m 51s | Max: 53m 40s
      🟩 GCC                Pass: 100%/21  | Total: 10h 48m | Avg: 30m 52s | Max: 58m 40s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 54m | Avg: 43m 44s | Max: 45m 34s | Hits: 539%/3552  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 24m | Avg: 42m 00s | Max: 43m 00s
    🔍 gpu: v100 🔍
      🟩 h100               Pass: 100%/2   | Total: 23m 41s | Avg: 11m 50s | Max: 19m 15s
      🔍 v100               Pass:  97%/42  | Total: 23h 28m | Avg: 33m 31s | Max: 58m 40s | Hits: 539%/3552  
    🔍 jobs: HostLaunch 🔍
      🟩 Build              Pass: 100%/37  | Total: 21h 30m | Avg: 34m 52s | Max: 58m 40s | Hits: 539%/3552  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 28m 54s | Avg: 28m 54s | Max: 28m 54s
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 08s | Avg: 16m 08s | Max: 16m 08s
      🔍 HostLaunch         Pass:  66%/3   | Total: 46m 30s | Avg: 15m 30s | Max: 23m 14s
      🟩 TestGPU            Pass: 100%/2   | Total: 49m 41s | Avg: 24m 50s | Max: 25m 33s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/20  | Total: 12h 35m | Avg: 37m 46s | Max: 53m 40s | Hits: 538%/2664  
      🔍 20                 Pass:  95%/24  | Total: 11h 16m | Avg: 28m 10s | Max: 58m 40s | Hits: 539%/888   
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 23m 41s | Avg: 11m 50s | Max: 19m 15s
      🟩 90a                Pass: 100%/1   | Total:  4m 11s | Avg:  4m 11s | Max:  4m 11s
    
  • 🟩 thrust: Pass: 100%/42 | Total: 7h 36m | Avg: 10m 52s | Max: 45m 21s | Hits: 337%/7384

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 51m 28s | Avg: 25m 44s | Max: 45m 21s
    🟩 cpu
      🟩 amd64              Pass: 100%/40  | Total:  7h 26m | Avg: 11m 09s | Max: 45m 21s | Hits: 337%/7384  
      🟩 arm64              Pass: 100%/2   | Total: 10m 41s | Avg:  5m 20s | Max:  5m 53s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 47m 33s | Avg:  9m 30s | Max: 26m 00s | Hits: 346%/1846  
      🟩 12.5               Pass: 100%/2   | Total: 36m 16s | Avg: 18m 08s | Max: 20m 56s
      🟩 12.6               Pass: 100%/35  | Total:  6h 12m | Avg: 10m 39s | Max: 45m 21s | Hits: 334%/5538  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 38s | Avg:  5m 19s | Max:  5m 26s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 47m 33s | Avg:  9m 30s | Max: 26m 00s | Hits: 346%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 36m 16s | Avg: 18m 08s | Max: 20m 56s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  6h 02m | Avg: 10m 58s | Max: 45m 21s | Hits: 334%/5538  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 38s | Avg:  5m 19s | Max:  5m 26s
      🟩 nvcc               Pass: 100%/40  | Total:  7h 26m | Avg: 11m 09s | Max: 45m 21s | Hits: 337%/7384  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 19s | Avg:  5m 19s | Max:  5m 32s
      🟩 Clang15            Pass: 100%/2   | Total: 12m 01s | Avg:  6m 00s | Max:  6m 02s
      🟩 Clang16            Pass: 100%/2   | Total: 12m 05s | Avg:  6m 02s | Max:  6m 25s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 45s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 09m | Avg:  9m 58s | Max: 34m 56s
      🟩 GCC7               Pass: 100%/2   | Total: 11m 04s | Avg:  5m 32s | Max:  5m 39s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 47s | Avg:  5m 47s | Max:  5m 47s
      🟩 GCC9               Pass: 100%/2   | Total: 11m 27s | Avg:  5m 43s | Max:  5m 47s
      🟩 GCC10              Pass: 100%/2   | Total: 11m 19s | Avg:  5m 39s | Max:  5m 49s
      🟩 GCC11              Pass: 100%/2   | Total: 11m 39s | Avg:  5m 49s | Max:  6m 11s
      🟩 GCC12              Pass: 100%/2   | Total: 11m 36s | Avg:  5m 48s | Max:  5m 52s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 36m | Avg: 12m 05s | Max: 45m 21s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 01m | Avg: 30m 40s | Max: 35m 20s | Hits: 338%/3692  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 13m | Avg: 36m 37s | Max: 38m 01s | Hits: 336%/3692  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 36m 16s | Avg: 18m 08s | Max: 20m 56s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 06m | Avg:  7m 25s | Max: 34m 56s
      🟩 GCC                Pass: 100%/19  | Total:  2h 39m | Avg:  8m 23s | Max: 45m 21s
      🟩 MSVC               Pass: 100%/4   | Total:  2h 14m | Avg: 33m 38s | Max: 38m 01s | Hits: 337%/7384  
      🟩 NVHPC              Pass: 100%/2   | Total: 36m 16s | Avg: 18m 08s | Max: 20m 56s
    🟩 gpu
      🟩 v100               Pass: 100%/42  | Total:  7h 36m | Avg: 10m 52s | Max: 45m 21s | Hits: 337%/7384  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  5h 45m | Avg:  9m 19s | Max: 38m 01s | Hits: 337%/7384  
      🟩 TestCPU            Pass: 100%/2   | Total: 15m 44s | Avg:  7m 52s | Max:  8m 08s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 35m | Avg: 31m 57s | Max: 45m 21s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 40s | Avg:  4m 40s | Max:  4m 40s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  3h 29m | Avg: 10m 27s | Max: 35m 20s | Hits: 338%/5538  
      🟩 20                 Pass: 100%/20  | Total:  3h 16m | Avg:  9m 48s | Max: 38m 01s | Hits: 335%/1846  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 12m 09s | Avg: 6m 04s | Max: 10m 01s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 12m 09s | Avg:  6m 04s | Max: 10m 01s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 12m 09s | Avg:  6m 04s | Max: 10m 01s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 12m 09s | Avg:  6m 04s | Max: 10m 01s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 12m 09s | Avg:  6m 04s | Max: 10m 01s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 12m 09s | Avg:  6m 04s | Max: 10m 01s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 12m 09s | Avg:  6m 04s | Max: 10m 01s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 12m 09s | Avg:  6m 04s | Max: 10m 01s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 08s | Avg:  2m 08s | Max:  2m 08s
      🟩 Test               Pass: 100%/1   | Total: 10m 01s | Avg: 10m 01s | Max: 10m 01s
    
  • 🟩 python: Pass: 100%/1 | Total: 45m 54s | Avg: 45m 54s | Max: 45m 54s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 45m 54s | Avg: 45m 54s | Max: 45m 54s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 45m 54s | Avg: 45m 54s | Max: 45m 54s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 45m 54s | Avg: 45m 54s | Max: 45m 54s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 45m 54s | Avg: 45m 54s | Max: 45m 54s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 45m 54s | Avg: 45m 54s | Max: 45m 54s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 45m 54s | Avg: 45m 54s | Max: 45m 54s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 45m 54s | Avg: 45m 54s | Max: 45m 54s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 45m 54s | Avg: 45m 54s | Max: 45m 54s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 89)

# Runner
65 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
8 windows-amd64-cpu16
4 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1

Copy link
Contributor

🟩 CI finished in 5h 43m: Pass: 100%/89 | Total: 2d 17h | Avg: 43m 55s | Max: 1h 45m | Hits: 199%/10936
  • 🟩 cub: Pass: 100%/44 | Total: 1d 16h | Avg: 55m 30s | Max: 1h 45m | Hits: 257%/3552

    🟩 cpu
      🟩 amd64              Pass: 100%/42  | Total:  1d 14h | Avg: 55m 17s | Max:  1h 45m | Hits: 257%/3552  
      🟩 arm64              Pass: 100%/2   | Total:  1h 59m | Avg: 59m 57s | Max:  1h 00m
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 00m | Avg:  1h 00m | Max:  1h 06m | Hits: 257%/888   
      🟩 12.5               Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 12m
      🟩 12.6               Pass: 100%/37  | Total:  1d 09h | Avg: 54m 05s | Max:  1h 45m | Hits: 256%/2664  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 00m
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 00m | Avg:  1h 00m | Max:  1h 06m | Hits: 257%/888   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 12m
      🟩 nvcc12.6           Pass: 100%/35  | Total:  1d 07h | Avg: 53m 45s | Max:  1h 45m | Hits: 256%/2664  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 00m
      🟩 nvcc               Pass: 100%/42  | Total:  1d 14h | Avg: 55m 17s | Max:  1h 45m | Hits: 257%/3552  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 58m | Avg: 59m 38s | Max:  1h 01m
      🟩 Clang15            Pass: 100%/2   | Total:  1h 54m | Avg: 57m 15s | Max: 59m 45s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 55m | Avg: 57m 41s | Max: 58m 47s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 55m | Avg: 57m 38s | Max:  1h 00m
      🟩 Clang18            Pass: 100%/7   | Total:  5h 54m | Avg: 50m 35s | Max:  1h 02m
      🟩 GCC7               Pass: 100%/2   | Total:  1h 52m | Avg: 56m 29s | Max: 56m 57s
      🟩 GCC8               Pass: 100%/1   | Total: 54m 43s | Avg: 54m 43s | Max: 54m 43s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 54m | Avg: 57m 02s | Max: 59m 32s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 56m | Avg: 58m 15s | Max:  1h 01m
      🟩 GCC11              Pass: 100%/2   | Total:  1h 57m | Avg: 58m 59s | Max: 59m 01s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 41m | Avg: 40m 19s | Max:  1h 01m
      🟩 GCC13              Pass: 100%/8   | Total:  6h 49m | Avg: 51m 09s | Max:  1h 45m
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 09m | Hits: 257%/1776  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 21m | Avg:  1h 10m | Max:  1h 11m | Hits: 256%/1776  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 12m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 15h 37m | Avg: 55m 09s | Max:  1h 02m
      🟩 GCC                Pass: 100%/21  | Total: 18h 06m | Avg: 51m 45s | Max:  1h 45m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 37m | Avg:  1h 09m | Max:  1h 11m | Hits: 257%/3552  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 12m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 42m 43s | Avg: 21m 21s | Max: 23m 29s
      🟩 v100               Pass: 100%/42  | Total:  1d 15h | Avg: 57m 07s | Max:  1h 45m | Hits: 257%/3552  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 12h | Avg: 58m 26s | Max:  1h 12m | Hits: 257%/3552  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 46m 11s | Avg: 46m 11s | Max: 46m 11s
      🟩 GraphCapture       Pass: 100%/1   | Total: 15m 58s | Avg: 15m 58s | Max: 15m 58s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 29m | Avg: 29m 41s | Max: 36m 53s
      🟩 TestGPU            Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 45m
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 42m 43s | Avg: 21m 21s | Max: 23m 29s
      🟩 90a                Pass: 100%/1   | Total: 25m 20s | Avg: 25m 20s | Max: 25m 20s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 20h 14m | Avg:  1h 00m | Max:  1h 12m | Hits: 257%/2664  
      🟩 20                 Pass: 100%/24  | Total: 20h 27m | Avg: 51m 08s | Max:  1h 45m | Hits: 254%/888   
    
  • 🟩 thrust: Pass: 100%/42 | Total: 23h 30m | Avg: 33m 35s | Max: 1h 12m | Hits: 171%/7384

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 41m 51s | Avg: 20m 55s | Max: 24m 44s
    🟩 cpu
      🟩 amd64              Pass: 100%/40  | Total: 22h 31m | Avg: 33m 47s | Max:  1h 12m | Hits: 171%/7384  
      🟩 arm64              Pass: 100%/2   | Total: 59m 20s | Avg: 29m 40s | Max: 31m 29s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 06m | Avg: 37m 18s | Max: 58m 26s | Hits: 169%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  1h 54m | Avg: 57m 12s | Max: 58m 24s
      🟩 12.6               Pass: 100%/35  | Total: 18h 29m | Avg: 31m 42s | Max:  1h 12m | Hits: 172%/5538  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 55m 50s | Avg: 27m 55s | Max: 29m 55s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 06m | Avg: 37m 18s | Max: 58m 26s | Hits: 169%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 54m | Avg: 57m 12s | Max: 58m 24s
      🟩 nvcc12.6           Pass: 100%/33  | Total: 17h 33m | Avg: 31m 56s | Max:  1h 12m | Hits: 172%/5538  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 55m 50s | Avg: 27m 55s | Max: 29m 55s
      🟩 nvcc               Pass: 100%/40  | Total: 22h 34m | Avg: 33m 52s | Max:  1h 12m | Hits: 171%/7384  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 04m | Avg: 31m 11s | Max: 33m 43s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 00m | Avg: 30m 09s | Max: 30m 47s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 04m | Avg: 32m 20s | Max: 33m 34s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 05m | Avg: 32m 50s | Max: 33m 40s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 58m | Avg: 25m 28s | Max: 33m 32s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 06m | Avg: 33m 05s | Max: 34m 54s
      🟩 GCC8               Pass: 100%/1   | Total: 31m 36s | Avg: 31m 36s | Max: 31m 36s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 07m | Avg: 33m 54s | Max: 36m 42s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 48s | Max: 33m 25s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 05m | Avg: 32m 30s | Max: 33m 02s
      🟩 GCC12              Pass: 100%/2   | Total:  1h 07m | Avg: 33m 36s | Max: 34m 11s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 09m | Avg: 23m 40s | Max: 38m 50s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 12m | Hits: 173%/3692  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 04m | Hits: 169%/3692  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 54m | Avg: 57m 12s | Max: 58m 24s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  8h 13m | Avg: 29m 02s | Max: 33m 43s
      🟩 GCC                Pass: 100%/19  | Total:  9h 10m | Avg: 28m 59s | Max: 38m 50s
      🟩 MSVC               Pass: 100%/4   | Total:  4h 11m | Avg:  1h 02m | Max:  1h 12m | Hits: 171%/7384  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 54m | Avg: 57m 12s | Max: 58m 24s
    🟩 gpu
      🟩 v100               Pass: 100%/42  | Total: 23h 30m | Avg: 33m 35s | Max:  1h 12m | Hits: 171%/7384  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 22h 16m | Avg: 36m 07s | Max:  1h 12m | Hits: 171%/7384  
      🟩 TestCPU            Pass: 100%/2   | Total: 14m 20s | Avg:  7m 10s | Max:  7m 10s
      🟩 TestGPU            Pass: 100%/3   | Total: 59m 42s | Avg: 19m 54s | Max: 23m 21s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 19m 14s | Avg: 19m 14s | Max: 19m 14s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 12h 38m | Avg: 37m 55s | Max:  1h 12m | Hits: 172%/5538  
      🟩 20                 Pass: 100%/20  | Total: 10h 10m | Avg: 30m 31s | Max:  1h 04m | Hits: 169%/1846  
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 11m 19s | Avg: 5m 39s | Max: 8m 55s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 11m 19s | Avg:  5m 39s | Max:  8m 55s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 11m 19s | Avg:  5m 39s | Max:  8m 55s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 11m 19s | Avg:  5m 39s | Max:  8m 55s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 11m 19s | Avg:  5m 39s | Max:  8m 55s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 11m 19s | Avg:  5m 39s | Max:  8m 55s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 11m 19s | Avg:  5m 39s | Max:  8m 55s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 11m 19s | Avg:  5m 39s | Max:  8m 55s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 24s | Avg:  2m 24s | Max:  2m 24s
      🟩 Test               Pass: 100%/1   | Total:  8m 55s | Avg:  8m 55s | Max:  8m 55s
    
  • 🟩 python: Pass: 100%/1 | Total: 45m 30s | Avg: 45m 30s | Max: 45m 30s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 45m 30s | Avg: 45m 30s | Max: 45m 30s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 45m 30s | Avg: 45m 30s | Max: 45m 30s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 45m 30s | Avg: 45m 30s | Max: 45m 30s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 45m 30s | Avg: 45m 30s | Max: 45m 30s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 45m 30s | Avg: 45m 30s | Max: 45m 30s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 45m 30s | Avg: 45m 30s | Max: 45m 30s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 45m 30s | Avg: 45m 30s | Max: 45m 30s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 45m 30s | Avg: 45m 30s | Max: 45m 30s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 89)

# Runner
65 linux-amd64-cpu16
11 linux-amd64-gpu-v100-latest-1
8 windows-amd64-cpu16
4 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1

@miscco miscco merged commit a1a73a8 into NVIDIA:main Jan 30, 2025
101 of 105 checks passed
@bernhardmgruber bernhardmgruber deleted the ref_core_cub_util branch January 30, 2025 10:54
bernhardmgruber added a commit to bernhardmgruber/cccl that referenced this pull request Jan 30, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
breaking Breaking change thrust For all items related to Thrust.
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

3 participants