Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

clean up the cudax __launch_transform code and document its purpose and design #3526

Merged
merged 4 commits into from
Feb 6, 2025

Conversation

ericniebler
Copy link
Collaborator

Description

i find the __launch_transform code to be confusing. in this pr, i replace the need to static_cast the result of __launch_transform with a new function, __kernel_transform. so cudax::launch will transform each argument with:

__kernel_transform(__launch_transform(arg))

a large comment block in launch_transform.hpp describes the protocol and explains why two separate functions are needed.

Checklist

  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@ericniebler ericniebler requested review from a team as code owners January 25, 2025 01:20
@pciolkosz
Copy link
Contributor

I like these changes.
I was wondering if we should simplify the side effects mechanism as well.
The main concern with the current mechanism is that the post-launch side-effects happen in the temporary object destructor. This means we need to be careful with exceptions in anything using the transform mechanism, because there is a chance post-launch will also throw. I can see I already made a mistake in my copy_bytes implementation and I am throwing from the internal function. If there is some sticky error that is causing the launch to fail and post-launch has a CUDA call, like a memcpy, we will most likely terminate.

I think we should decide if post-launch side effects should be skipped if the launch fails. With the current approach it will run even on failure, but for the use-cases we had in mind skipping it might be better.
I also wanted to ask, since we are migrating to two separate steps (launch_transform and kernel_transform), would it make sense to change it into?:

  1. Transform + pre-launch side effects function (__pre_launch_transform ?)
  2. Post-launch side effects function (__post_launch ?), name could also indicate on-fail behavior

This way they stay as an object member functions and we don't run into this limitation regarding exceptions. Then the first step always happens and for the second step we can decide if it should be skipped if the launch failed.

Copy link
Contributor

🟨 CI finished in 2h 55m: Pass: 98%/157 | Total: 1d 01h | Avg: 9m 53s | Max: 51m 32s | Hits: 529%/23359
  • 🟨 libcudacxx: Pass: 97%/43 | Total: 8h 08m | Avg: 11m 21s | Max: 34m 04s | Hits: 688%/10065

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  97%/41  | Total:  8h 01m | Avg: 11m 43s | Max: 34m 04s | Hits: 688%/10065 
      🟩 arm64              Pass: 100%/2   | Total:  7m 06s | Avg:  3m 33s | Max:  3m 38s
    🔍 ctk: 12.6 🔍
      🟩 12.0               Pass: 100%/5   | Total: 36m 30s | Avg:  7m 18s | Max: 21m 27s | Hits: 689%/2471  
      🟩 12.5               Pass: 100%/2   | Total: 43m 27s | Avg: 21m 43s | Max: 34m 04s
      🔍 12.6               Pass:  97%/36  | Total:  6h 48m | Avg: 11m 20s | Max: 27m 26s | Hits: 688%/7594  
    🔍 cudacxx: nvcc12.6 🔍
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 07m | Avg: 16m 57s | Max: 23m 26s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 36m 30s | Avg:  7m 18s | Max: 21m 27s | Hits: 689%/2471  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 43m 27s | Avg: 21m 43s | Max: 34m 04s
      🔍 nvcc12.6           Pass:  96%/32  | Total:  5h 40m | Avg: 10m 38s | Max: 27m 26s | Hits: 688%/7594  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 07m | Avg: 16m 57s | Max: 23m 26s
      🔍 nvcc               Pass:  97%/39  | Total:  7h 00m | Avg: 10m 46s | Max: 34m 04s | Hits: 688%/10065 
    🔍 cxx: Clang18 🔍
      🟩 Clang14            Pass: 100%/4   | Total: 16m 34s | Avg:  4m 08s | Max:  4m 27s
      🟩 Clang15            Pass: 100%/2   | Total: 46m 29s | Avg: 23m 14s | Max: 23m 43s
      🟩 Clang16            Pass: 100%/2   | Total:  9m 02s | Avg:  4m 31s | Max:  4m 36s
      🟩 Clang17            Pass: 100%/2   | Total: 28m 46s | Avg: 14m 23s | Max: 24m 20s
      🔍 Clang18            Pass:  87%/8   | Total:  1h 41m | Avg: 12m 42s | Max: 23m 26s
      🟩 GCC7               Pass: 100%/2   | Total:  7m 12s | Avg:  3m 36s | Max:  3m 37s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 28s | Avg:  3m 28s | Max:  3m 28s
      🟩 GCC9               Pass: 100%/2   | Total:  7m 19s | Avg:  3m 39s | Max:  3m 45s
      🟩 GCC10              Pass: 100%/2   | Total:  7m 33s | Avg:  3m 46s | Max:  3m 49s
      🟩 GCC11              Pass: 100%/2   | Total: 26m 06s | Avg: 13m 03s | Max: 22m 00s
      🟩 GCC12              Pass: 100%/2   | Total:  7m 49s | Avg:  3m 54s | Max:  4m 08s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 28m | Avg: 11m 04s | Max: 24m 22s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 43m 48s | Avg: 21m 54s | Max: 22m 21s | Hits: 689%/4952  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 50m 23s | Avg: 25m 11s | Max: 27m 26s | Hits: 688%/5113  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 43m 27s | Avg: 21m 43s | Max: 34m 04s
    🔍 cxx_family: Clang 🔍
      🔍 Clang              Pass:  94%/18  | Total:  3h 22m | Avg: 11m 14s | Max: 24m 20s
      🟩 GCC                Pass: 100%/19  | Total:  2h 28m | Avg:  7m 47s | Max: 24m 22s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 34m | Avg: 23m 32s | Max: 27m 26s | Hits: 688%/10065 
      🟩 NVHPC              Pass: 100%/2   | Total: 43m 27s | Avg: 21m 43s | Max: 34m 04s
    🔍 jobs: Test 🔍
      🟩 Build              Pass: 100%/38  | Total:  6h 40m | Avg: 10m 32s | Max: 34m 04s | Hits: 688%/10065 
      🟩 NVRTC              Pass: 100%/2   | Total: 47m 18s | Avg: 23m 39s | Max: 24m 22s
      🔍 Test               Pass:  50%/2   | Total: 37m 42s | Avg: 18m 51s | Max: 21m 28s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 17s | Avg:  2m 17s | Max:  2m 17s
    🔍 std: 20 🔍
      🟩 17                 Pass: 100%/21  | Total:  4h 01m | Avg: 11m 29s | Max: 24m 22s | Hits: 689%/7433  
      🔍 20                 Pass:  95%/21  | Total:  4h 04m | Avg: 11m 38s | Max: 34m 04s | Hits: 688%/2632  
    🟨 gpu
      🟨 v100               Pass:  97%/43  | Total:  8h 08m | Avg: 11m 21s | Max: 34m 04s | Hits: 688%/10065 
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 14m 04s | Avg: 14m 04s | Max: 14m 04s
      🟩 90a                Pass: 100%/2   | Total: 16m 43s | Avg:  8m 21s | Max: 12m 42s
    
  • 🟥 python: Pass: 0%/1 | Total: 5m 19s | Avg: 5m 19s | Max: 5m 19s

    🟥 cpu
      🟥 amd64              Pass:   0%/1   | Total:  5m 19s | Avg:  5m 19s | Max:  5m 19s
    🟥 ctk
      🟥 12.6               Pass:   0%/1   | Total:  5m 19s | Avg:  5m 19s | Max:  5m 19s
    🟥 cudacxx
      🟥 nvcc12.6           Pass:   0%/1   | Total:  5m 19s | Avg:  5m 19s | Max:  5m 19s
    🟥 cudacxx_family
      🟥 nvcc               Pass:   0%/1   | Total:  5m 19s | Avg:  5m 19s | Max:  5m 19s
    🟥 cxx
      🟥 GCC13              Pass:   0%/1   | Total:  5m 19s | Avg:  5m 19s | Max:  5m 19s
    🟥 cxx_family
      🟥 GCC                Pass:   0%/1   | Total:  5m 19s | Avg:  5m 19s | Max:  5m 19s
    🟥 gpu
      🟥 v100               Pass:   0%/1   | Total:  5m 19s | Avg:  5m 19s | Max:  5m 19s
    🟥 jobs
      🟥 Test               Pass:   0%/1   | Total:  5m 19s | Avg:  5m 19s | Max:  5m 19s
    
  • 🟩 cub: Pass: 100%/44 | Total: 8h 05m | Avg: 11m 01s | Max: 51m 32s | Hits: 540%/3552

    🟩 cpu
      🟩 amd64              Pass: 100%/42  | Total:  7h 55m | Avg: 11m 18s | Max: 51m 32s | Hits: 540%/3552  
      🟩 arm64              Pass: 100%/2   | Total:  9m 52s | Avg:  4m 56s | Max:  5m 15s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 47m 00s | Avg:  9m 24s | Max: 25m 50s | Hits: 540%/888   
      🟩 12.5               Pass: 100%/2   | Total: 18m 58s | Avg:  9m 29s | Max:  9m 53s
      🟩 12.6               Pass: 100%/37  | Total:  6h 59m | Avg: 11m 19s | Max: 51m 32s | Hits: 540%/2664  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 12s | Avg:  4m 36s | Max:  4m 40s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 47m 00s | Avg:  9m 24s | Max: 25m 50s | Hits: 540%/888   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 18m 58s | Avg:  9m 29s | Max:  9m 53s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  6h 49m | Avg: 11m 42s | Max: 51m 32s | Hits: 540%/2664  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 12s | Avg:  4m 36s | Max:  4m 40s
      🟩 nvcc               Pass: 100%/42  | Total:  7h 55m | Avg: 11m 19s | Max: 51m 32s | Hits: 540%/3552  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 29s | Avg:  5m 22s | Max:  5m 40s
      🟩 Clang15            Pass: 100%/2   | Total: 11m 12s | Avg:  5m 36s | Max:  5m 46s
      🟩 Clang16            Pass: 100%/2   | Total: 11m 39s | Avg:  5m 49s | Max:  5m 50s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 20s | Avg:  5m 40s | Max:  5m 50s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 16m | Avg: 10m 55s | Max: 30m 49s
      🟩 GCC7               Pass: 100%/2   | Total: 11m 05s | Avg:  5m 32s | Max:  6m 00s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 27s | Avg:  5m 27s | Max:  5m 27s
      🟩 GCC9               Pass: 100%/2   | Total: 11m 20s | Avg:  5m 40s | Max:  5m 40s
      🟩 GCC10              Pass: 100%/2   | Total: 11m 19s | Avg:  5m 39s | Max:  5m 44s
      🟩 GCC11              Pass: 100%/2   | Total: 11m 55s | Avg:  5m 57s | Max:  6m 00s
      🟩 GCC12              Pass: 100%/4   | Total: 36m 07s | Avg:  9m 01s | Max: 19m 45s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 20m | Avg: 17m 31s | Max: 51m 32s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 50m 59s | Avg: 25m 29s | Max: 25m 50s | Hits: 540%/1776  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 55m 38s | Avg: 27m 49s | Max: 29m 26s | Hits: 540%/1776  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 18m 58s | Avg:  9m 29s | Max:  9m 53s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 12m | Avg:  7m 46s | Max: 30m 49s
      🟩 GCC                Pass: 100%/21  | Total:  3h 47m | Avg: 10m 49s | Max: 51m 32s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 46m | Avg: 26m 39s | Max: 29m 26s | Hits: 540%/3552  
      🟩 NVHPC              Pass: 100%/2   | Total: 18m 58s | Avg:  9m 29s | Max:  9m 53s
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 24m 18s | Avg: 12m 09s | Max: 19m 45s
      🟩 v100               Pass: 100%/42  | Total:  7h 40m | Avg: 10m 58s | Max: 51m 32s | Hits: 540%/3552  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  4h 55m | Avg:  7m 59s | Max: 29m 26s | Hits: 540%/3552  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 22m 17s | Avg: 22m 17s | Max: 22m 17s
      🟩 GraphCapture       Pass: 100%/1   | Total: 19m 19s | Avg: 19m 19s | Max: 19m 19s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 15m | Avg: 25m 13s | Max: 30m 49s
      🟩 TestGPU            Pass: 100%/2   | Total:  1h 11m | Avg: 35m 58s | Max: 51m 32s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 24m 18s | Avg: 12m 09s | Max: 19m 45s
      🟩 90a                Pass: 100%/1   | Total:  4m 45s | Avg:  4m 45s | Max:  4m 45s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  3h 00m | Avg:  9m 02s | Max: 29m 26s | Hits: 540%/2664  
      🟩 20                 Pass: 100%/24  | Total:  5h 04m | Avg: 12m 40s | Max: 51m 32s | Hits: 540%/888   
    
  • 🟩 thrust: Pass: 100%/43 | Total: 6h 51m | Avg: 9m 34s | Max: 35m 53s | Hits: 365%/9220

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 18m 39s | Avg:  9m 19s | Max: 12m 32s
    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total:  6h 42m | Avg:  9m 48s | Max: 35m 53s | Hits: 365%/9220  
      🟩 arm64              Pass: 100%/2   | Total:  9m 52s | Avg:  4m 56s | Max:  5m 18s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 45m 18s | Avg:  9m 03s | Max: 24m 57s | Hits: 365%/1844  
      🟩 12.5               Pass: 100%/2   | Total: 30m 09s | Avg: 15m 04s | Max: 15m 25s
      🟩 12.6               Pass: 100%/36  | Total:  5h 36m | Avg:  9m 20s | Max: 35m 53s | Hits: 365%/7376  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  4m 56s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 45m 18s | Avg:  9m 03s | Max: 24m 57s | Hits: 365%/1844  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 30m 09s | Avg: 15m 04s | Max: 15m 25s
      🟩 nvcc12.6           Pass: 100%/34  | Total:  5h 26m | Avg:  9m 36s | Max: 35m 53s | Hits: 365%/7376  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 44s | Avg:  4m 52s | Max:  4m 56s
      🟩 nvcc               Pass: 100%/41  | Total:  6h 42m | Avg:  9m 48s | Max: 35m 53s | Hits: 365%/9220  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 20m 21s | Avg:  5m 05s | Max:  5m 15s
      🟩 Clang15            Pass: 100%/2   | Total: 11m 40s | Avg:  5m 50s | Max:  6m 06s
      🟩 Clang16            Pass: 100%/2   | Total: 11m 50s | Avg:  5m 55s | Max:  5m 57s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 56s | Avg:  5m 58s | Max:  6m 06s
      🟩 Clang18            Pass: 100%/7   | Total: 52m 50s | Avg:  7m 32s | Max: 19m 40s
      🟩 GCC7               Pass: 100%/2   | Total: 10m 07s | Avg:  5m 03s | Max:  5m 14s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 21s | Avg:  5m 21s | Max:  5m 21s
      🟩 GCC9               Pass: 100%/2   | Total: 11m 08s | Avg:  5m 34s | Max:  5m 37s
      🟩 GCC10              Pass: 100%/2   | Total: 10m 41s | Avg:  5m 20s | Max:  5m 30s
      🟩 GCC11              Pass: 100%/2   | Total: 11m 40s | Avg:  5m 50s | Max:  6m 06s
      🟩 GCC12              Pass: 100%/2   | Total: 12m 14s | Avg:  6m 07s | Max:  6m 25s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 04m | Avg:  8m 07s | Max: 15m 05s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 51m 00s | Avg: 25m 30s | Max: 26m 03s | Hits: 365%/3688  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  1h 36m | Avg: 32m 00s | Max: 35m 53s | Hits: 365%/5532  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 30m 09s | Avg: 15m 04s | Max: 15m 25s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  1h 48m | Avg:  6m 23s | Max: 19m 40s
      🟩 GCC                Pass: 100%/19  | Total:  2h 06m | Avg:  6m 38s | Max: 15m 05s
      🟩 MSVC               Pass: 100%/5   | Total:  2h 27m | Avg: 29m 24s | Max: 35m 53s | Hits: 365%/9220  
      🟩 NVHPC              Pass: 100%/2   | Total: 30m 09s | Avg: 15m 04s | Max: 15m 25s
    🟩 gpu
      🟩 v100               Pass: 100%/43  | Total:  6h 51m | Avg:  9m 34s | Max: 35m 53s | Hits: 365%/9220  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  5h 12m | Avg:  8m 26s | Max: 32m 26s | Hits: 365%/7376  
      🟩 TestCPU            Pass: 100%/3   | Total: 52m 25s | Avg: 17m 28s | Max: 35m 53s | Hits: 365%/1844  
      🟩 TestGPU            Pass: 100%/3   | Total: 47m 17s | Avg: 15m 45s | Max: 19m 40s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 37s | Avg:  4m 37s | Max:  4m 37s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  3h 07m | Avg:  9m 22s | Max: 32m 26s | Hits: 365%/5532  
      🟩 20                 Pass: 100%/21  | Total:  3h 25m | Avg:  9m 47s | Max: 35m 53s | Hits: 365%/3688  
    
  • 🟩 cudax: Pass: 100%/20 | Total: 2h 05m | Avg: 6m 17s | Max: 23m 18s | Hits: 288%/522

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 51m | Avg:  6m 58s | Max: 23m 18s | Hits: 288%/522   
      🟩 arm64              Pass: 100%/4   | Total: 14m 23s | Avg:  3m 35s | Max:  3m 47s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 10m 15s | Avg: 10m 15s | Max: 10m 15s | Hits: 288%/261   
      🟩 12.5               Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 20s
      🟩 12.6               Pass: 100%/17  | Total:  1h 43m | Avg:  6m 03s | Max: 23m 18s | Hits: 288%/261   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 10m 15s | Avg: 10m 15s | Max: 10m 15s | Hits: 288%/261   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 20s
      🟩 nvcc12.6           Pass: 100%/17  | Total:  1h 43m | Avg:  6m 03s | Max: 23m 18s | Hits: 288%/261   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  2h 05m | Avg:  6m 17s | Max: 23m 18s | Hits: 288%/522   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 55s | Avg:  3m 55s | Max:  3m 55s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 12s | Avg:  4m 12s | Max:  4m 12s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 09s | Avg:  4m 09s | Max:  4m 09s
      🟩 Clang17            Pass: 100%/1   | Total:  4m 07s | Avg:  4m 07s | Max:  4m 07s
      🟩 Clang18            Pass: 100%/4   | Total: 26m 44s | Avg:  6m 41s | Max: 15m 54s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 56s | Avg:  3m 56s | Max:  3m 56s
      🟩 GCC11              Pass: 100%/1   | Total:  4m 00s | Avg:  4m 00s | Max:  4m 00s
      🟩 GCC12              Pass: 100%/2   | Total: 27m 47s | Avg: 13m 53s | Max: 23m 18s
      🟩 GCC13              Pass: 100%/4   | Total: 14m 31s | Avg:  3m 37s | Max:  3m 47s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 15s | Avg: 10m 15s | Max: 10m 15s | Hits: 288%/261   
      🟩 MSVC14.39          Pass: 100%/1   | Total:  9m 43s | Avg:  9m 43s | Max:  9m 43s | Hits: 288%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 20s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 43m 07s | Avg:  5m 23s | Max: 15m 54s
      🟩 GCC                Pass: 100%/8   | Total: 50m 14s | Avg:  6m 16s | Max: 23m 18s
      🟩 MSVC               Pass: 100%/2   | Total: 19m 58s | Avg:  9m 59s | Max: 10m 15s | Hits: 288%/522   
      🟩 NVHPC              Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 20s
    🟩 gpu
      🟩 v100               Pass: 100%/20  | Total:  2h 05m | Avg:  6m 17s | Max: 23m 18s | Hits: 288%/522   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 26m | Avg:  4m 48s | Max: 10m 15s | Hits: 288%/522   
      🟩 Test               Pass: 100%/2   | Total: 39m 12s | Avg: 19m 36s | Max: 23m 18s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 31s | Avg:  3m 31s | Max:  3m 31s
      🟩 90a                Pass: 100%/1   | Total:  3m 30s | Avg:  3m 30s | Max:  3m 30s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 17m 00s | Avg:  4m 15s | Max:  6m 20s
      🟩 20                 Pass: 100%/16  | Total:  1h 48m | Avg:  6m 48s | Max: 23m 18s | Hits: 288%/522   
    
  • 🟩 cccl: Pass: 100%/4 | Total: 25m 24s | Avg: 6m 21s | Max: 8m 58s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 25m 24s | Avg:  6m 21s | Max:  8m 58s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total: 11m 25s | Avg:  5m 42s | Max:  6m 20s
      🟩 12.6               Pass: 100%/2   | Total: 13m 59s | Avg:  6m 59s | Max:  8m 58s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total: 11m 25s | Avg:  5m 42s | Max:  6m 20s
      🟩 nvcc12.6           Pass: 100%/2   | Total: 13m 59s | Avg:  6m 59s | Max:  8m 58s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 25m 24s | Avg:  6m 21s | Max:  8m 58s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  6m 20s | Avg:  6m 20s | Max:  6m 20s
      🟩 Clang18            Pass: 100%/1   | Total:  5m 01s | Avg:  5m 01s | Max:  5m 01s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 05s | Avg:  5m 05s | Max:  5m 05s
      🟩 GCC13              Pass: 100%/1   | Total:  8m 58s | Avg:  8m 58s | Max:  8m 58s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total: 11m 21s | Avg:  5m 40s | Max:  6m 20s
      🟩 GCC                Pass: 100%/2   | Total: 14m 03s | Avg:  7m 01s | Max:  8m 58s
    🟩 gpu
      🟩 v100               Pass: 100%/4   | Total: 25m 24s | Avg:  6m 21s | Max:  8m 58s
    🟩 jobs
      🟩 Infra              Pass: 100%/4   | Total: 25m 24s | Avg:  6m 21s | Max:  8m 58s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 57s | Avg: 4m 58s | Max: 7m 38s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 38s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 38s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 38s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 38s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 38s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 38s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  7m 38s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 19s | Avg:  2m 19s | Max:  2m 19s
      🟩 Test               Pass: 100%/1   | Total:  7m 38s | Avg:  7m 38s | Max:  7m 38s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 157)

# Runner
110 linux-amd64-cpu16
21 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

Copy link
Contributor

🟩 CI finished in 1h 48m: Pass: 100%/157 | Total: 1d 00h | Avg: 9m 23s | Max: 50m 01s | Hits: 531%/23359
  • 🟩 cub: Pass: 100%/44 | Total: 7h 47m | Avg: 10m 37s | Max: 34m 55s | Hits: 540%/3552

    🟩 cpu
      🟩 amd64              Pass: 100%/42  | Total:  7h 37m | Avg: 10m 53s | Max: 34m 55s | Hits: 540%/3552  
      🟩 arm64              Pass: 100%/2   | Total:  9m 50s | Avg:  4m 55s | Max:  5m 14s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 46m 53s | Avg:  9m 22s | Max: 24m 55s | Hits: 540%/888   
      🟩 12.5               Pass: 100%/2   | Total: 18m 48s | Avg:  9m 24s | Max:  9m 50s
      🟩 12.6               Pass: 100%/37  | Total:  6h 41m | Avg: 10m 51s | Max: 34m 55s | Hits: 540%/2664  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 17s | Avg:  4m 38s | Max:  4m 41s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 46m 53s | Avg:  9m 22s | Max: 24m 55s | Hits: 540%/888   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 18m 48s | Avg:  9m 24s | Max:  9m 50s
      🟩 nvcc12.6           Pass: 100%/35  | Total:  6h 32m | Avg: 11m 13s | Max: 34m 55s | Hits: 540%/2664  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 17s | Avg:  4m 38s | Max:  4m 41s
      🟩 nvcc               Pass: 100%/42  | Total:  7h 38m | Avg: 10m 54s | Max: 34m 55s | Hits: 540%/3552  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 22m 03s | Avg:  5m 30s | Max:  5m 42s
      🟩 Clang15            Pass: 100%/2   | Total: 11m 26s | Avg:  5m 43s | Max:  5m 46s
      🟩 Clang16            Pass: 100%/2   | Total: 11m 18s | Avg:  5m 39s | Max:  5m 47s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 32s | Avg:  5m 46s | Max:  5m 53s
      🟩 Clang18            Pass: 100%/7   | Total:  1h 28m | Avg: 12m 39s | Max: 34m 55s
      🟩 GCC7               Pass: 100%/2   | Total: 11m 10s | Avg:  5m 35s | Max:  5m 42s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 35s | Avg:  5m 35s | Max:  5m 35s
      🟩 GCC9               Pass: 100%/2   | Total: 11m 03s | Avg:  5m 31s | Max:  5m 33s
      🟩 GCC10              Pass: 100%/2   | Total: 11m 13s | Avg:  5m 36s | Max:  5m 42s
      🟩 GCC11              Pass: 100%/2   | Total: 11m 54s | Avg:  5m 57s | Max:  6m 04s
      🟩 GCC12              Pass: 100%/4   | Total: 35m 27s | Avg:  8m 51s | Max: 19m 21s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 54m | Avg: 14m 18s | Max: 31m 06s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 50m 03s | Avg: 25m 01s | Max: 25m 08s | Hits: 540%/1776  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 53m 05s | Avg: 26m 32s | Max: 26m 59s | Hits: 540%/1776  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 18m 48s | Avg:  9m 24s | Max:  9m 50s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 24m | Avg:  8m 31s | Max: 34m 55s
      🟩 GCC                Pass: 100%/21  | Total:  3h 20m | Avg:  9m 33s | Max: 31m 06s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 43m | Avg: 25m 47s | Max: 26m 59s | Hits: 540%/3552  
      🟩 NVHPC              Pass: 100%/2   | Total: 18m 48s | Avg:  9m 24s | Max:  9m 50s
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 23m 28s | Avg: 11m 44s | Max: 19m 21s
      🟩 v100               Pass: 100%/42  | Total:  7h 24m | Avg: 10m 34s | Max: 34m 55s | Hits: 540%/3552  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  4h 53m | Avg:  7m 56s | Max: 26m 59s | Hits: 540%/3552  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 24m 08s | Avg: 24m 08s | Max: 24m 08s
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 40s | Avg: 16m 40s | Max: 16m 40s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 07m | Avg: 22m 22s | Max: 27m 36s
      🟩 TestGPU            Pass: 100%/2   | Total:  1h 06m | Avg: 33m 00s | Max: 34m 55s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 23m 28s | Avg: 11m 44s | Max: 19m 21s
      🟩 90a                Pass: 100%/1   | Total:  4m 44s | Avg:  4m 44s | Max:  4m 44s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  2h 56m | Avg:  8m 49s | Max: 26m 06s | Hits: 540%/2664  
      🟩 20                 Pass: 100%/24  | Total:  4h 51m | Avg: 12m 07s | Max: 34m 55s | Hits: 540%/888   
    
  • 🟩 libcudacxx: Pass: 100%/43 | Total: 6h 35m | Avg: 9m 11s | Max: 25m 18s | Hits: 688%/10065

    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total:  6h 28m | Avg:  9m 28s | Max: 25m 18s | Hits: 688%/10065 
      🟩 arm64              Pass: 100%/2   | Total:  7m 23s | Avg:  3m 41s | Max:  3m 44s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 34m 57s | Avg:  6m 59s | Max: 20m 19s | Hits: 689%/2471  
      🟩 12.5               Pass: 100%/2   | Total: 17m 33s | Avg:  8m 46s | Max:  9m 07s
      🟩 12.6               Pass: 100%/36  | Total:  5h 43m | Avg:  9m 31s | Max: 25m 18s | Hits: 688%/7594  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 05m | Avg: 16m 20s | Max: 19m 26s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 34m 57s | Avg:  6m 59s | Max: 20m 19s | Hits: 689%/2471  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 33s | Avg:  8m 46s | Max:  9m 07s
      🟩 nvcc12.6           Pass: 100%/32  | Total:  4h 37m | Avg:  8m 40s | Max: 25m 18s | Hits: 688%/7594  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 05m | Avg: 16m 20s | Max: 19m 26s
      🟩 nvcc               Pass: 100%/39  | Total:  5h 30m | Avg:  8m 28s | Max: 25m 18s | Hits: 688%/10065 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 33m 06s | Avg:  8m 16s | Max: 20m 55s
      🟩 Clang15            Pass: 100%/2   | Total:  8m 39s | Avg:  4m 19s | Max:  4m 20s
      🟩 Clang16            Pass: 100%/2   | Total:  8m 34s | Avg:  4m 17s | Max:  4m 26s
      🟩 Clang17            Pass: 100%/2   | Total:  8m 15s | Avg:  4m 07s | Max:  4m 11s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 41m | Avg: 12m 40s | Max: 23m 43s
      🟩 GCC7               Pass: 100%/2   | Total:  6m 56s | Avg:  3m 28s | Max:  3m 43s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 52s | Avg:  3m 52s | Max:  3m 52s
      🟩 GCC9               Pass: 100%/2   | Total:  7m 18s | Avg:  3m 39s | Max:  3m 49s
      🟩 GCC10              Pass: 100%/2   | Total:  7m 58s | Avg:  3m 59s | Max:  4m 01s
      🟩 GCC11              Pass: 100%/2   | Total:  7m 30s | Avg:  3m 45s | Max:  4m 01s
      🟩 GCC12              Pass: 100%/2   | Total:  7m 46s | Avg:  3m 53s | Max:  4m 02s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 24m | Avg: 10m 36s | Max: 25m 18s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 42m 23s | Avg: 21m 11s | Max: 22m 04s | Hits: 689%/4952  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 49m 27s | Avg: 24m 43s | Max: 24m 51s | Hits: 688%/5113  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 33s | Avg:  8m 46s | Max:  9m 07s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  2h 39m | Avg:  8m 53s | Max: 23m 43s
      🟩 GCC                Pass: 100%/19  | Total:  2h 06m | Avg:  6m 38s | Max: 25m 18s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 31m | Avg: 22m 57s | Max: 24m 51s | Hits: 688%/10065 
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 33s | Avg:  8m 46s | Max:  9m 07s
    🟩 gpu
      🟩 v100               Pass: 100%/43  | Total:  6h 35m | Avg:  9m 11s | Max: 25m 18s | Hits: 688%/10065 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  5h 03m | Avg:  7m 58s | Max: 24m 51s | Hits: 688%/10065 
      🟩 NVRTC              Pass: 100%/2   | Total: 47m 40s | Avg: 23m 50s | Max: 25m 18s
      🟩 Test               Pass: 100%/2   | Total: 42m 26s | Avg: 21m 13s | Max: 23m 43s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 12s | Avg:  2m 12s | Max:  2m 12s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 21s | Avg: 13m 21s | Max: 13m 21s
      🟩 90a                Pass: 100%/2   | Total: 17m 38s | Avg:  8m 49s | Max: 13m 36s
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  2h 58m | Avg:  8m 30s | Max: 25m 18s | Hits: 689%/7433  
      🟩 20                 Pass: 100%/21  | Total:  3h 34m | Avg: 10m 13s | Max: 24m 36s | Hits: 688%/2632  
    
  • 🟩 thrust: Pass: 100%/43 | Total: 6h 47m | Avg: 9m 28s | Max: 36m 24s | Hits: 365%/9220

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 18m 00s | Avg:  9m 00s | Max: 11m 11s
    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total:  6h 37m | Avg:  9m 41s | Max: 36m 24s | Hits: 365%/9220  
      🟩 arm64              Pass: 100%/2   | Total:  9m 57s | Avg:  4m 58s | Max:  5m 17s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 46m 36s | Avg:  9m 19s | Max: 25m 21s | Hits: 365%/1844  
      🟩 12.5               Pass: 100%/2   | Total: 28m 40s | Avg: 14m 20s | Max: 14m 23s
      🟩 12.6               Pass: 100%/36  | Total:  5h 32m | Avg:  9m 13s | Max: 36m 24s | Hits: 365%/7376  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 45s | Avg:  5m 22s | Max:  5m 34s
      🟩 nvcc12.0           Pass: 100%/5   | Total: 46m 36s | Avg:  9m 19s | Max: 25m 21s | Hits: 365%/1844  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 28m 40s | Avg: 14m 20s | Max: 14m 23s
      🟩 nvcc12.6           Pass: 100%/34  | Total:  5h 21m | Avg:  9m 27s | Max: 36m 24s | Hits: 365%/7376  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 45s | Avg:  5m 22s | Max:  5m 34s
      🟩 nvcc               Pass: 100%/41  | Total:  6h 36m | Avg:  9m 40s | Max: 36m 24s | Hits: 365%/9220  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 08s | Avg:  5m 17s | Max:  5m 23s
      🟩 Clang15            Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 34s
      🟩 Clang16            Pass: 100%/2   | Total: 11m 35s | Avg:  5m 47s | Max:  6m 02s
      🟩 Clang17            Pass: 100%/2   | Total: 11m 34s | Avg:  5m 47s | Max:  5m 48s
      🟩 Clang18            Pass: 100%/7   | Total: 56m 14s | Avg:  8m 02s | Max: 21m 27s
      🟩 GCC7               Pass: 100%/2   | Total: 11m 38s | Avg:  5m 49s | Max:  6m 10s
      🟩 GCC8               Pass: 100%/1   | Total:  5m 46s | Avg:  5m 46s | Max:  5m 46s
      🟩 GCC9               Pass: 100%/2   | Total: 11m 23s | Avg:  5m 41s | Max:  6m 08s
      🟩 GCC10              Pass: 100%/2   | Total: 11m 21s | Avg:  5m 40s | Max:  5m 42s
      🟩 GCC11              Pass: 100%/2   | Total: 12m 07s | Avg:  6m 03s | Max:  6m 17s
      🟩 GCC12              Pass: 100%/2   | Total: 12m 06s | Avg:  6m 03s | Max:  6m 14s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 02m | Avg:  7m 45s | Max: 13m 12s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 50m 16s | Avg: 25m 08s | Max: 25m 21s | Hits: 365%/3688  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  1h 30m | Avg: 30m 13s | Max: 36m 24s | Hits: 365%/5532  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 28m 40s | Avg: 14m 20s | Max: 14m 23s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  1h 51m | Avg:  6m 33s | Max: 21m 27s
      🟩 GCC                Pass: 100%/19  | Total:  2h 06m | Avg:  6m 39s | Max: 13m 12s
      🟩 MSVC               Pass: 100%/5   | Total:  2h 20m | Avg: 28m 11s | Max: 36m 24s | Hits: 365%/9220  
      🟩 NVHPC              Pass: 100%/2   | Total: 28m 40s | Avg: 14m 20s | Max: 14m 23s
    🟩 gpu
      🟩 v100               Pass: 100%/43  | Total:  6h 47m | Avg:  9m 28s | Max: 36m 24s | Hits: 365%/9220  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  5h 09m | Avg:  8m 21s | Max: 29m 29s | Hits: 365%/7376  
      🟩 TestCPU            Pass: 100%/3   | Total: 52m 35s | Avg: 17m 31s | Max: 36m 24s | Hits: 365%/1844  
      🟩 TestGPU            Pass: 100%/3   | Total: 45m 50s | Avg: 15m 16s | Max: 21m 27s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total:  4m 36s | Avg:  4m 36s | Max:  4m 36s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  3h 00m | Avg:  9m 02s | Max: 25m 21s | Hits: 365%/5532  
      🟩 20                 Pass: 100%/21  | Total:  3h 28m | Avg:  9m 56s | Max: 36m 24s | Hits: 365%/3688  
    
  • 🟩 cudax: Pass: 100%/20 | Total: 2h 02m | Avg: 6m 07s | Max: 24m 17s | Hits: 388%/522

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 51m | Avg:  6m 57s | Max: 24m 17s | Hits: 388%/522   
      🟩 arm64              Pass: 100%/4   | Total: 11m 03s | Avg:  2m 45s | Max:  2m 47s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total:  9m 29s | Avg:  9m 29s | Max:  9m 29s | Hits: 388%/261   
      🟩 12.5               Pass: 100%/2   | Total: 11m 23s | Avg:  5m 41s | Max:  5m 50s
      🟩 12.6               Pass: 100%/17  | Total:  1h 41m | Avg:  5m 58s | Max: 24m 17s | Hits: 388%/261   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total:  9m 29s | Avg:  9m 29s | Max:  9m 29s | Hits: 388%/261   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 11m 23s | Avg:  5m 41s | Max:  5m 50s
      🟩 nvcc12.6           Pass: 100%/17  | Total:  1h 41m | Avg:  5m 58s | Max: 24m 17s | Hits: 388%/261   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  2h 02m | Avg:  6m 07s | Max: 24m 17s | Hits: 388%/522   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 27s | Avg:  3m 27s | Max:  3m 27s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 15s | Avg:  3m 15s | Max:  3m 15s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 28s | Avg:  3m 28s | Max:  3m 28s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 27s | Avg:  3m 27s | Max:  3m 27s
      🟩 Clang18            Pass: 100%/4   | Total: 33m 15s | Avg:  8m 18s | Max: 24m 17s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 11s | Avg:  3m 11s | Max:  3m 11s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 17s | Avg:  3m 17s | Max:  3m 17s
      🟩 GCC12              Pass: 100%/2   | Total: 26m 56s | Avg: 13m 28s | Max: 23m 42s
      🟩 GCC13              Pass: 100%/4   | Total: 11m 13s | Avg:  2m 48s | Max:  2m 55s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  9m 29s | Avg:  9m 29s | Max:  9m 29s | Hits: 388%/261   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 07s | Avg: 10m 07s | Max: 10m 07s | Hits: 388%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 11m 23s | Avg:  5m 41s | Max:  5m 50s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 46m 52s | Avg:  5m 51s | Max: 24m 17s
      🟩 GCC                Pass: 100%/8   | Total: 44m 37s | Avg:  5m 34s | Max: 23m 42s
      🟩 MSVC               Pass: 100%/2   | Total: 19m 36s | Avg:  9m 48s | Max: 10m 07s | Hits: 388%/522   
      🟩 NVHPC              Pass: 100%/2   | Total: 11m 23s | Avg:  5m 41s | Max:  5m 50s
    🟩 gpu
      🟩 v100               Pass: 100%/20  | Total:  2h 02m | Avg:  6m 07s | Max: 24m 17s | Hits: 388%/522   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 14m | Avg:  4m 08s | Max: 10m 07s | Hits: 388%/522   
      🟩 Test               Pass: 100%/2   | Total: 47m 59s | Avg: 23m 59s | Max: 24m 17s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 47s | Avg:  2m 47s | Max:  2m 47s
      🟩 90a                Pass: 100%/1   | Total:  2m 55s | Avg:  2m 55s | Max:  2m 55s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 13m 53s | Avg:  3m 28s | Max:  5m 33s
      🟩 20                 Pass: 100%/16  | Total:  1h 48m | Avg:  6m 47s | Max: 24m 17s | Hits: 388%/522   
    
  • 🟩 cccl: Pass: 100%/4 | Total: 21m 03s | Avg: 5m 15s | Max: 5m 55s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 21m 03s | Avg:  5m 15s | Max:  5m 55s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total: 10m 26s | Avg:  5m 13s | Max:  5m 25s
      🟩 12.6               Pass: 100%/2   | Total: 10m 37s | Avg:  5m 18s | Max:  5m 55s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total: 10m 26s | Avg:  5m 13s | Max:  5m 25s
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 37s | Avg:  5m 18s | Max:  5m 55s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 21m 03s | Avg:  5m 15s | Max:  5m 55s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  5m 01s | Avg:  5m 01s | Max:  5m 01s
      🟩 Clang18            Pass: 100%/1   | Total:  4m 42s | Avg:  4m 42s | Max:  4m 42s
      🟩 GCC12              Pass: 100%/1   | Total:  5m 25s | Avg:  5m 25s | Max:  5m 25s
      🟩 GCC13              Pass: 100%/1   | Total:  5m 55s | Avg:  5m 55s | Max:  5m 55s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total:  9m 43s | Avg:  4m 51s | Max:  5m 01s
      🟩 GCC                Pass: 100%/2   | Total: 11m 20s | Avg:  5m 40s | Max:  5m 55s
    🟩 gpu
      🟩 v100               Pass: 100%/4   | Total: 21m 03s | Avg:  5m 15s | Max:  5m 55s
    🟩 jobs
      🟩 Infra              Pass: 100%/4   | Total: 21m 03s | Avg:  5m 15s | Max:  5m 55s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 35s | Avg: 5m 17s | Max: 8m 27s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 27s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 27s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 27s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 27s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 27s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 27s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 35s | Avg:  5m 17s | Max:  8m 27s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 08s | Avg:  2m 08s | Max:  2m 08s
      🟩 Test               Pass: 100%/1   | Total:  8m 27s | Avg:  8m 27s | Max:  8m 27s
    
  • 🟩 python: Pass: 100%/1 | Total: 50m 01s | Avg: 50m 01s | Max: 50m 01s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 50m 01s | Avg: 50m 01s | Max: 50m 01s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 50m 01s | Avg: 50m 01s | Max: 50m 01s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 50m 01s | Avg: 50m 01s | Max: 50m 01s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 50m 01s | Avg: 50m 01s | Max: 50m 01s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 50m 01s | Avg: 50m 01s | Max: 50m 01s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 50m 01s | Avg: 50m 01s | Max: 50m 01s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 50m 01s | Avg: 50m 01s | Max: 50m 01s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 50m 01s | Avg: 50m 01s | Max: 50m 01s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 157)

# Runner
110 linux-amd64-cpu16
21 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

Copy link
Contributor

github-actions bot commented Feb 6, 2025

🟩 CI finished in 1h 05m: Pass: 100%/155 | Total: 1d 01h | Avg: 9m 50s | Max: 33m 44s | Hits: 87%/241925
  • 🟩 cub: Pass: 100%/44 | Total: 8h 11m | Avg: 11m 09s | Max: 33m 09s | Hits: 92%/52320

    🟩 cpu
      🟩 amd64              Pass: 100%/42  | Total:  8h 00m | Avg: 11m 25s | Max: 33m 09s | Hits:  92%/49888 
      🟩 arm64              Pass: 100%/2   | Total: 11m 10s | Avg:  5m 35s | Max:  5m 54s | Hits:  99%/2432  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 48m 53s | Avg:  9m 46s | Max: 26m 31s | Hits:  84%/5914  
      🟩 12.5               Pass: 100%/2   | Total: 24m 57s | Avg: 12m 28s | Max: 12m 55s | Hits:  98%/2250  
      🟩 12.8               Pass: 100%/37  | Total:  6h 57m | Avg: 11m 16s | Max: 33m 09s | Hits:  93%/44156 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  9m 35s | Avg:  4m 47s | Max:  4m 48s | Hits:  99%/2104  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 48m 53s | Avg:  9m 46s | Max: 26m 31s | Hits:  84%/5914  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 24m 57s | Avg: 12m 28s | Max: 12m 55s | Hits:  98%/2250  
      🟩 nvcc12.8           Pass: 100%/35  | Total:  6h 47m | Avg: 11m 39s | Max: 33m 09s | Hits:  93%/42052 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  9m 35s | Avg:  4m 47s | Max:  4m 48s | Hits:  99%/2104  
      🟩 nvcc               Pass: 100%/42  | Total:  8h 01m | Avg: 11m 28s | Max: 33m 09s | Hits:  92%/50216 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 23m 25s | Avg:  5m 51s | Max:  6m 25s | Hits:  99%/4872  
      🟩 Clang15            Pass: 100%/2   | Total: 12m 09s | Avg:  6m 04s | Max:  6m 21s | Hits:  99%/2432  
      🟩 Clang16            Pass: 100%/2   | Total: 12m 43s | Avg:  6m 21s | Max:  6m 27s | Hits:  99%/2432  
      🟩 Clang17            Pass: 100%/2   | Total: 12m 36s | Avg:  6m 18s | Max:  6m 21s | Hits:  99%/2432  
      🟩 Clang18            Pass: 100%/7   | Total:  1h 11m | Avg: 10m 16s | Max: 23m 03s | Hits:  99%/8184  
      🟩 GCC7               Pass: 100%/2   | Total: 11m 36s | Avg:  5m 48s | Max:  6m 00s | Hits:  99%/2436  
      🟩 GCC8               Pass: 100%/1   | Total:  6m 06s | Avg:  6m 06s | Max:  6m 06s | Hits:  99%/1218  
      🟩 GCC9               Pass: 100%/2   | Total: 12m 58s | Avg:  6m 29s | Max:  6m 53s | Hits:  99%/2436  
      🟩 GCC10              Pass: 100%/2   | Total: 12m 44s | Avg:  6m 22s | Max:  6m 25s | Hits:  99%/2436  
      🟩 GCC11              Pass: 100%/2   | Total: 13m 01s | Avg:  6m 30s | Max:  6m 44s | Hits:  99%/2432  
      🟩 GCC12              Pass: 100%/2   | Total: 13m 15s | Avg:  6m 37s | Max:  6m 52s | Hits:  99%/2432  
      🟩 GCC13              Pass: 100%/10  | Total:  2h 28m | Avg: 14m 51s | Max: 26m 02s | Hits:  99%/12160 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 54m 15s | Avg: 27m 07s | Max: 27m 44s | Hits:  15%/2084  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  1h 00m | Avg: 30m 28s | Max: 33m 09s | Hits:  15%/2084  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 24m 57s | Avg: 12m 28s | Max: 12m 55s | Hits:  98%/2250  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  2h 12m | Avg:  7m 48s | Max: 23m 03s | Hits:  99%/20352 
      🟩 GCC                Pass: 100%/21  | Total:  3h 38m | Avg: 10m 23s | Max: 26m 02s | Hits:  99%/25550 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 55m | Avg: 28m 48s | Max: 33m 09s | Hits:  15%/4168  
      🟩 NVHPC              Pass: 100%/2   | Total: 24m 57s | Avg: 12m 28s | Max: 12m 55s | Hits:  98%/2250  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 30m 51s | Avg: 15m 25s | Max: 26m 02s | Hits:  99%/2432  
      🟩 rtx2080            Pass: 100%/34  | Total:  5h 12m | Avg:  9m 10s | Max: 33m 09s | Hits:  90%/40160 
      🟩 rtxa6000           Pass: 100%/8   | Total:  2h 28m | Avg: 18m 32s | Max: 25m 54s | Hits:  99%/9728  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  5h 30m | Avg:  8m 55s | Max: 33m 09s | Hits:  91%/43808 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 23m 03s | Avg: 23m 03s | Max: 23m 03s | Hits:  99%/1216  
      🟩 GraphCapture       Pass: 100%/1   | Total: 18m 22s | Avg: 18m 22s | Max: 18m 22s | Hits:  99%/1216  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 14m | Avg: 24m 59s | Max: 26m 02s | Hits:  99%/3648  
      🟩 TestGPU            Pass: 100%/2   | Total: 44m 36s | Avg: 22m 18s | Max: 23m 17s | Hits:  99%/2432  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 30m 51s | Avg: 15m 25s | Max: 26m 02s | Hits:  99%/2432  
      🟩 90;90a;100         Pass: 100%/1   | Total:  7m 45s | Avg:  7m 45s | Max:  7m 45s | Hits:  99%/1216  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  3h 13m | Avg:  9m 39s | Max: 27m 48s | Hits:  88%/23559 
      🟩 20                 Pass: 100%/24  | Total:  4h 58m | Avg: 12m 25s | Max: 33m 09s | Hits:  96%/28761 
    
  • 🟩 thrust: Pass: 100%/43 | Total: 6h 32m | Avg: 9m 07s | Max: 33m 44s | Hits: 96%/79625

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 17m 50s | Avg:  8m 55s | Max: 11m 55s | Hits:  99%/3706  
    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total:  6h 22m | Avg:  9m 19s | Max: 33m 44s | Hits:  96%/75920 
      🟩 arm64              Pass: 100%/2   | Total: 10m 13s | Avg:  5m 06s | Max:  5m 27s | Hits:  99%/3705  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total: 43m 36s | Avg:  8m 43s | Max: 22m 53s | Hits:  93%/9256  
      🟩 12.5               Pass: 100%/2   | Total: 28m 04s | Avg: 14m 02s | Max: 14m 04s | Hits:  99%/3704  
      🟩 12.8               Pass: 100%/36  | Total:  5h 20m | Avg:  8m 54s | Max: 33m 44s | Hits:  96%/66665 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 33s | Avg:  5m 16s | Max:  5m 40s | Hits: 100%/3704  
      🟩 nvcc12.0           Pass: 100%/5   | Total: 43m 36s | Avg:  8m 43s | Max: 22m 53s | Hits:  93%/9256  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 28m 04s | Avg: 14m 02s | Max: 14m 04s | Hits:  99%/3704  
      🟩 nvcc12.8           Pass: 100%/34  | Total:  5h 10m | Avg:  9m 07s | Max: 33m 44s | Hits:  96%/62961 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 33s | Avg:  5m 16s | Max:  5m 40s | Hits: 100%/3704  
      🟩 nvcc               Pass: 100%/41  | Total:  6h 21m | Avg:  9m 18s | Max: 33m 44s | Hits:  96%/75921 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 21m 34s | Avg:  5m 23s | Max:  6m 01s | Hits: 100%/7408  
      🟩 Clang15            Pass: 100%/2   | Total: 11m 11s | Avg:  5m 35s | Max:  5m 38s | Hits: 100%/3704  
      🟩 Clang16            Pass: 100%/2   | Total: 11m 53s | Avg:  5m 56s | Max:  5m 58s | Hits: 100%/3704  
      🟩 Clang17            Pass: 100%/2   | Total: 11m 38s | Avg:  5m 49s | Max:  6m 02s | Hits: 100%/3704  
      🟩 Clang18            Pass: 100%/7   | Total: 46m 14s | Avg:  6m 36s | Max: 10m 58s | Hits: 100%/12964 
      🟩 GCC7               Pass: 100%/2   | Total: 11m 23s | Avg:  5m 41s | Max:  5m 59s | Hits:  99%/3706  
      🟩 GCC8               Pass: 100%/1   | Total:  5m 57s | Avg:  5m 57s | Max:  5m 57s | Hits:  99%/1853  
      🟩 GCC9               Pass: 100%/2   | Total: 10m 50s | Avg:  5m 25s | Max:  5m 35s | Hits:  99%/3706  
      🟩 GCC10              Pass: 100%/2   | Total: 11m 47s | Avg:  5m 53s | Max:  5m 56s | Hits:  99%/3706  
      🟩 GCC11              Pass: 100%/2   | Total: 12m 19s | Avg:  6m 09s | Max:  6m 18s | Hits:  99%/3706  
      🟩 GCC12              Pass: 100%/2   | Total: 11m 42s | Avg:  5m 51s | Max:  5m 54s | Hits:  99%/3706  
      🟩 GCC13              Pass: 100%/8   | Total:  1h 03m | Avg:  7m 55s | Max: 12m 01s | Hits:  99%/14824 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 48m 04s | Avg: 24m 02s | Max: 25m 11s | Hits:  69%/3692  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  1h 26m | Avg: 28m 45s | Max: 33m 44s | Hits:  69%/5538  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 28m 04s | Avg: 14m 02s | Max: 14m 04s | Hits:  99%/3704  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  1h 42m | Avg:  6m 01s | Max: 10m 58s | Hits: 100%/31484 
      🟩 GCC                Pass: 100%/19  | Total:  2h 07m | Avg:  6m 42s | Max: 12m 01s | Hits:  99%/35207 
      🟩 MSVC               Pass: 100%/5   | Total:  2h 14m | Avg: 26m 51s | Max: 33m 44s | Hits:  69%/9230  
      🟩 NVHPC              Pass: 100%/2   | Total: 28m 04s | Avg: 14m 02s | Max: 14m 04s | Hits:  99%/3704  
    🟩 gpu
      🟩 rtx2080            Pass: 100%/33  | Total:  4h 22m | Avg:  7m 56s | Max: 26m 22s | Hits:  97%/61112 
      🟩 rtx4090            Pass: 100%/10  | Total:  2h 10m | Avg: 13m 00s | Max: 33m 44s | Hits:  93%/18513 
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  5h 06m | Avg:  8m 17s | Max: 26m 22s | Hits:  96%/68516 
      🟩 TestCPU            Pass: 100%/3   | Total: 50m 37s | Avg: 16m 52s | Max: 33m 44s | Hits:  89%/5551  
      🟩 TestGPU            Pass: 100%/3   | Total: 34m 54s | Avg: 11m 38s | Max: 12m 01s | Hits:  99%/5558  
    🟩 sm
      🟩 90;90a;100         Pass: 100%/1   | Total:  6m 53s | Avg:  6m 53s | Max:  6m 53s | Hits:  99%/1853  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total:  3h 01m | Avg:  9m 03s | Max: 26m 22s | Hits:  95%/37031 
      🟩 20                 Pass: 100%/21  | Total:  3h 13m | Avg:  9m 12s | Max: 33m 44s | Hits:  97%/38888 
    
  • 🟩 libcudacxx: Pass: 100%/41 | Total: 7h 41m | Avg: 11m 14s | Max: 29m 55s | Hits: 78%/99620

    🟩 cpu
      🟩 amd64              Pass: 100%/39  | Total:  6h 58m | Avg: 10m 43s | Max: 29m 55s | Hits:  80%/93995 
      🟩 arm64              Pass: 100%/2   | Total: 43m 03s | Avg: 21m 31s | Max: 22m 10s | Hits:  31%/5625  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 07m | Avg: 13m 29s | Max: 21m 55s | Hits:  71%/13598 
      🟩 12.5               Pass: 100%/2   | Total: 17m 10s | Avg:  8m 35s | Max:  8m 37s | Hits:  98%/5569  
      🟩 12.8               Pass: 100%/34  | Total:  6h 16m | Avg: 11m 04s | Max: 29m 55s | Hits:  77%/80453 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 39m 18s | Avg: 19m 39s | Max: 20m 21s | Hits:  26%/5589  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 07m | Avg: 13m 29s | Max: 21m 55s | Hits:  71%/13598 
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 10s | Avg:  8m 35s | Max:  8m 37s | Hits:  98%/5569  
      🟩 nvcc12.8           Pass: 100%/32  | Total:  5h 37m | Avg: 10m 32s | Max: 29m 55s | Hits:  81%/74864 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 39m 18s | Avg: 19m 39s | Max: 20m 21s | Hits:  26%/5589  
      🟩 nvcc               Pass: 100%/39  | Total:  7h 01m | Avg: 10m 48s | Max: 29m 55s | Hits:  81%/94031 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 47m 08s | Avg: 11m 47s | Max: 19m 46s | Hits:  66%/11142 
      🟩 Clang15            Pass: 100%/2   | Total:  9m 02s | Avg:  4m 31s | Max:  4m 35s | Hits:  99%/5581  
      🟩 Clang16            Pass: 100%/2   | Total:  8m 58s | Avg:  4m 29s | Max:  4m 32s | Hits:  99%/5581  
      🟩 Clang17            Pass: 100%/2   | Total:  8m 44s | Avg:  4m 22s | Max:  4m 22s | Hits:  99%/5581  
      🟩 Clang18            Pass: 100%/6   | Total:  1h 35m | Avg: 15m 50s | Max: 21m 01s | Hits:  43%/13982 
      🟩 GCC7               Pass: 100%/2   | Total: 26m 49s | Avg: 13m 24s | Max: 23m 09s | Hits:  66%/5518  
      🟩 GCC8               Pass: 100%/1   | Total:  3m 40s | Avg:  3m 40s | Max:  3m 40s | Hits:  99%/2769  
      🟩 GCC9               Pass: 100%/2   | Total:  7m 07s | Avg:  3m 33s | Max:  3m 42s | Hits:  99%/5530  
      🟩 GCC10              Pass: 100%/2   | Total: 24m 53s | Avg: 12m 26s | Max: 21m 01s | Hits:  66%/5587  
      🟩 GCC11              Pass: 100%/2   | Total: 28m 23s | Avg: 14m 11s | Max: 24m 06s | Hits:  66%/5583  
      🟩 GCC12              Pass: 100%/2   | Total:  7m 57s | Avg:  3m 58s | Max:  4m 08s | Hits:  99%/5583  
      🟩 GCC13              Pass: 100%/8   | Total:  1h 16m | Avg:  9m 34s | Max: 22m 10s | Hits:  82%/11341 
      🟩 MSVC14.29          Pass: 100%/2   | Total: 46m 46s | Avg: 23m 23s | Max: 24m 51s | Hits:  93%/5056  
      🟩 MSVC14.42          Pass: 100%/2   | Total: 52m 53s | Avg: 26m 26s | Max: 29m 55s | Hits:  63%/5217  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 10s | Avg:  8m 35s | Max:  8m 37s | Hits:  98%/5569  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/16  | Total:  2h 48m | Avg: 10m 33s | Max: 21m 01s | Hits:  71%/41867 
      🟩 GCC                Pass: 100%/19  | Total:  2h 55m | Avg:  9m 14s | Max: 24m 06s | Hits:  81%/41911 
      🟩 MSVC               Pass: 100%/4   | Total:  1h 39m | Avg: 24m 54s | Max: 29m 55s | Hits:  77%/10273 
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 10s | Avg:  8m 35s | Max:  8m 37s | Hits:  98%/5569  
    🟩 gpu
      🟩 rtx2080            Pass: 100%/41  | Total:  7h 41m | Avg: 11m 14s | Max: 29m 55s | Hits:  78%/99620 
    🟩 jobs
      🟩 Build              Pass: 100%/36  | Total:  6h 50m | Avg: 11m 23s | Max: 29m 55s | Hits:  78%/99580 
      🟩 NVRTC              Pass: 100%/2   | Total: 30m 29s | Avg: 15m 14s | Max: 15m 49s | Hits:  90%/40    
      🟩 Test               Pass: 100%/2   | Total: 18m 17s | Avg:  9m 08s | Max:  9m 10s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 05s | Avg:  2m 05s | Max:  2m 05s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 30m 29s | Avg: 15m 14s | Max: 15m 49s | Hits:  90%/40    
      🟩 90;90a;100         Pass: 100%/1   | Total:  4m 11s | Avg:  4m 11s | Max:  4m 11s | Hits:  99%/2905  
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  4h 02m | Avg: 11m 31s | Max: 24m 51s | Hits:  81%/54648 
      🟩 20                 Pass: 100%/19  | Total:  3h 36m | Avg: 11m 24s | Max: 29m 55s | Hits:  73%/44972 
    
  • 🟩 cudax: Pass: 100%/20 | Total: 2h 01m | Avg: 6m 03s | Max: 14m 40s | Hits: 90%/10080

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 45m | Avg:  6m 34s | Max: 14m 40s | Hits:  89%/7868  
      🟩 arm64              Pass: 100%/4   | Total: 15m 58s | Avg:  3m 59s | Max:  4m 07s | Hits:  91%/2212  
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 10m 01s | Avg: 10m 01s | Max: 10m 01s | Hits:  59%/261   
      🟩 12.5               Pass: 100%/2   | Total: 14m 27s | Avg:  7m 13s | Max:  7m 22s | Hits:  84%/706   
      🟩 12.8               Pass: 100%/17  | Total:  1h 36m | Avg:  5m 41s | Max: 14m 40s | Hits:  91%/9113  
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 10m 01s | Avg: 10m 01s | Max: 10m 01s | Hits:  59%/261   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 14m 27s | Avg:  7m 13s | Max:  7m 22s | Hits:  84%/706   
      🟩 nvcc12.8           Pass: 100%/17  | Total:  1h 36m | Avg:  5m 41s | Max: 14m 40s | Hits:  91%/9113  
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  2h 01m | Avg:  6m 03s | Max: 14m 40s | Hits:  90%/10080 
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  4m 20s | Avg:  4m 20s | Max:  4m 20s | Hits:  91%/555   
      🟩 Clang15            Pass: 100%/1   | Total:  4m 54s | Avg:  4m 54s | Max:  4m 54s | Hits:  91%/553   
      🟩 Clang16            Pass: 100%/1   | Total:  4m 36s | Avg:  4m 36s | Max:  4m 36s | Hits:  91%/553   
      🟩 Clang17            Pass: 100%/1   | Total:  4m 16s | Avg:  4m 16s | Max:  4m 16s | Hits:  91%/553   
      🟩 Clang18            Pass: 100%/4   | Total: 27m 24s | Avg:  6m 51s | Max: 14m 40s | Hits:  93%/2212  
      🟩 GCC10              Pass: 100%/1   | Total:  4m 22s | Avg:  4m 22s | Max:  4m 22s | Hits:  91%/555   
      🟩 GCC11              Pass: 100%/1   | Total:  4m 32s | Avg:  4m 32s | Max:  4m 32s | Hits:  91%/553   
      🟩 GCC12              Pass: 100%/2   | Total: 17m 08s | Avg:  8m 34s | Max: 12m 29s | Hits:  95%/1106  
      🟩 GCC13              Pass: 100%/4   | Total: 15m 47s | Avg:  3m 56s | Max:  4m 07s | Hits:  91%/2212  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 01s | Avg: 10m 01s | Max: 10m 01s | Hits:  59%/261   
      🟩 MSVC14.42          Pass: 100%/1   | Total:  9m 25s | Avg:  9m 25s | Max:  9m 25s | Hits:  59%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 14m 27s | Avg:  7m 13s | Max:  7m 22s | Hits:  84%/706   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 45m 30s | Avg:  5m 41s | Max: 14m 40s | Hits:  92%/4426  
      🟩 GCC                Pass: 100%/8   | Total: 41m 49s | Avg:  5m 13s | Max: 12m 29s | Hits:  92%/4426  
      🟩 MSVC               Pass: 100%/2   | Total: 19m 26s | Avg:  9m 43s | Max: 10m 01s | Hits:  59%/522   
      🟩 NVHPC              Pass: 100%/2   | Total: 14m 27s | Avg:  7m 13s | Max:  7m 22s | Hits:  84%/706   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/20  | Total:  2h 01m | Avg:  6m 03s | Max: 14m 40s | Hits:  90%/10080 
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 34m | Avg:  5m 13s | Max: 10m 01s | Hits:  88%/8974  
      🟩 Test               Pass: 100%/2   | Total: 27m 09s | Avg: 13m 34s | Max: 14m 40s | Hits:  99%/1106  
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 51s | Avg:  3m 51s | Max:  3m 51s | Hits:  91%/553   
      🟩 90a                Pass: 100%/1   | Total:  3m 49s | Avg:  3m 49s | Max:  3m 49s | Hits:  91%/553   
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 18m 51s | Avg:  4m 42s | Max:  7m 05s | Hits:  90%/2012  
      🟩 20                 Pass: 100%/16  | Total:  1h 42m | Avg:  6m 23s | Max: 14m 40s | Hits:  90%/8068  
    
  • 🟩 cccl: Pass: 100%/4 | Total: 19m 59s | Avg: 4m 59s | Max: 8m 11s

    🟩 cpu
      🟩 amd64              Pass: 100%/4   | Total: 19m 59s | Avg:  4m 59s | Max:  8m 11s
    🟩 ctk
      🟩 12.0               Pass: 100%/2   | Total: 11m 39s | Avg:  5m 49s | Max:  8m 11s
      🟩 12.8               Pass: 100%/2   | Total:  8m 20s | Avg:  4m 10s | Max:  4m 14s
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/2   | Total: 11m 39s | Avg:  5m 49s | Max:  8m 11s
      🟩 nvcc12.8           Pass: 100%/2   | Total:  8m 20s | Avg:  4m 10s | Max:  4m 14s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/4   | Total: 19m 59s | Avg:  4m 59s | Max:  8m 11s
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  8m 11s | Avg:  8m 11s | Max:  8m 11s
      🟩 Clang18            Pass: 100%/1   | Total:  4m 06s | Avg:  4m 06s | Max:  4m 06s
      🟩 GCC12              Pass: 100%/1   | Total:  3m 28s | Avg:  3m 28s | Max:  3m 28s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 14s | Avg:  4m 14s | Max:  4m 14s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/2   | Total: 12m 17s | Avg:  6m 08s | Max:  8m 11s
      🟩 GCC                Pass: 100%/2   | Total:  7m 42s | Avg:  3m 51s | Max:  4m 14s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 19m 59s | Avg:  4m 59s | Max:  8m 11s
    🟩 jobs
      🟩 Infra              Pass: 100%/4   | Total: 19m 59s | Avg:  4m 59s | Max:  8m 11s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 55s | Avg: 5m 27s | Max: 8m 34s | Hits: 98%/280

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 55s | Avg:  5m 27s | Max:  8m 34s | Hits:  98%/280   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 10m 55s | Avg:  5m 27s | Max:  8m 34s | Hits:  98%/280   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 10m 55s | Avg:  5m 27s | Max:  8m 34s | Hits:  98%/280   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 55s | Avg:  5m 27s | Max:  8m 34s | Hits:  98%/280   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 55s | Avg:  5m 27s | Max:  8m 34s | Hits:  98%/280   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 55s | Avg:  5m 27s | Max:  8m 34s | Hits:  98%/280   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 10m 55s | Avg:  5m 27s | Max:  8m 34s | Hits:  98%/280   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 21s | Avg:  2m 21s | Max:  2m 21s | Hits:  98%/140   
      🟩 Test               Pass: 100%/1   | Total:  8m 34s | Avg:  8m 34s | Max:  8m 34s | Hits:  98%/140   
    
  • 🟩 python: Pass: 100%/1 | Total: 29m 32s | Avg: 29m 32s | Max: 29m 32s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 29m 32s | Avg: 29m 32s | Max: 29m 32s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 29m 32s | Avg: 29m 32s | Max: 29m 32s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 29m 32s | Avg: 29m 32s | Max: 29m 32s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 29m 32s | Avg: 29m 32s | Max: 29m 32s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 29m 32s | Avg: 29m 32s | Max: 29m 32s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 29m 32s | Avg: 29m 32s | Max: 29m 32s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 29m 32s | Avg: 29m 32s | Max: 29m 32s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 29m 32s | Avg: 29m 32s | Max: 29m 32s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 155)

# Runner
108 linux-amd64-cpu16
15 windows-amd64-cpu16
12 linux-amd64-gpu-rtx2080-latest-1
10 linux-arm64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
1 linux-amd64-gpu-h100-latest-1

@miscco miscco merged commit 1faabf3 into NVIDIA:main Feb 6, 2025
167 of 170 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

3 participants