Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PTX: Add multimem instructions #3603

Merged
merged 4 commits into from
Jan 30, 2025

Conversation

bernhardmgruber
Copy link
Contributor

No description provided.

@bernhardmgruber bernhardmgruber enabled auto-merge (squash) January 30, 2025 11:07
Copy link
Contributor

🟩 CI finished in 1h 30m: Pass: 100%/152 | Total: 2d 20h | Avg: 27m 04s | Max: 1h 15m | Hits: 418%/21603
  • 🟩 cub: Pass: 100%/44 | Total: 1d 10h | Avg: 47m 20s | Max: 1h 15m | Hits: 256%/3552

    🟩 cpu
      🟩 amd64              Pass: 100%/42  | Total:  1d 09h | Avg: 47m 13s | Max:  1h 15m | Hits: 256%/3552  
      🟩 arm64              Pass: 100%/2   | Total:  1h 39m | Avg: 49m 36s | Max: 49m 58s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  4h 25m | Avg: 53m 01s | Max:  1h 05m | Hits: 335%/888   
      🟩 12.5               Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 03m
      🟩 12.6               Pass: 100%/37  | Total:  1d 04h | Avg: 45m 47s | Max:  1h 15m | Hits: 229%/2664  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 49m | Avg: 54m 31s | Max: 55m 15s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 25m | Avg: 53m 01s | Max:  1h 05m | Hits: 335%/888   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 03m
      🟩 nvcc12.6           Pass: 100%/35  | Total:  1d 02h | Avg: 45m 17s | Max:  1h 15m | Hits: 229%/2664  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 49m | Avg: 54m 31s | Max: 55m 15s
      🟩 nvcc               Pass: 100%/42  | Total:  1d 08h | Avg: 46m 59s | Max:  1h 15m | Hits: 256%/3552  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 18m | Avg: 49m 31s | Max: 51m 40s
      🟩 Clang15            Pass: 100%/2   | Total:  1h 39m | Avg: 49m 52s | Max: 52m 37s
      🟩 Clang16            Pass: 100%/2   | Total:  1h 43m | Avg: 51m 35s | Max: 54m 49s
      🟩 Clang17            Pass: 100%/2   | Total:  1h 42m | Avg: 51m 09s | Max: 53m 45s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 13m | Avg: 44m 49s | Max: 55m 16s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 39m | Avg: 49m 51s | Max: 52m 22s
      🟩 GCC8               Pass: 100%/1   | Total: 47m 44s | Avg: 47m 44s | Max: 47m 44s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 39m | Avg: 49m 41s | Max: 50m 28s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 36m | Avg: 48m 17s | Max: 49m 53s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 44m | Avg: 52m 13s | Max: 52m 47s
      🟩 GCC12              Pass: 100%/4   | Total:  2h 37m | Avg: 39m 24s | Max: 55m 33s
      🟩 GCC13              Pass: 100%/8   | Total:  4h 13m | Avg: 31m 42s | Max: 58m 15s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 18m | Avg:  1h 09m | Max:  1h 13m | Hits: 293%/1776  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 15m | Hits: 219%/1776  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 03m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 13h 37m | Avg: 48m 03s | Max: 55m 16s
      🟩 GCC                Pass: 100%/21  | Total: 14h 19m | Avg: 40m 54s | Max: 58m 15s
      🟩 MSVC               Pass: 100%/4   | Total:  4h 43m | Avg:  1h 10m | Max:  1h 15m | Hits: 256%/3552  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 03m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 52m 50s | Avg: 26m 25s | Max: 29m 44s
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 03m | Avg: 30m 25s | Max: 58m 15s
      🟩 v100               Pass: 100%/34  | Total:  1d 05h | Avg: 52m 32s | Max:  1h 15m | Hits: 256%/3552  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 08h | Avg: 51m 58s | Max:  1h 15m | Hits: 256%/3552  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 20m 55s | Avg: 20m 55s | Max: 20m 55s
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 11s | Avg: 16m 11s | Max: 16m 11s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 22m | Avg: 27m 20s | Max: 29m 44s
      🟩 TestGPU            Pass: 100%/2   | Total: 40m 30s | Avg: 20m 15s | Max: 21m 16s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 52m 50s | Avg: 26m 25s | Max: 29m 44s
      🟩 90a                Pass: 100%/1   | Total: 17m 27s | Avg: 17m 27s | Max: 17m 27s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 17h 47m | Avg: 53m 23s | Max:  1h 15m | Hits: 267%/2664  
      🟩 20                 Pass: 100%/24  | Total: 16h 54m | Avg: 42m 17s | Max:  1h 09m | Hits: 223%/888   
    
  • 🟩 libcudacxx: Pass: 100%/43 | Total: 10h 24m | Avg: 14m 30s | Max: 34m 05s | Hits: 637%/10145

    🟩 cpu
      🟩 amd64              Pass: 100%/41  | Total:  9h 57m | Avg: 14m 34s | Max: 34m 05s | Hits: 637%/10145 
      🟩 arm64              Pass: 100%/2   | Total: 26m 15s | Avg: 13m 07s | Max: 20m 23s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 08m | Avg: 13m 43s | Max: 23m 51s | Hits: 620%/2491  
      🟩 12.5               Pass: 100%/2   | Total: 30m 04s | Avg: 15m 02s | Max: 21m 31s
      🟩 12.6               Pass: 100%/36  | Total:  8h 45m | Avg: 14m 35s | Max: 34m 05s | Hits: 643%/7654  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 12m | Avg: 18m 05s | Max: 26m 34s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 08m | Avg: 13m 43s | Max: 23m 51s | Hits: 620%/2491  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 30m 04s | Avg: 15m 02s | Max: 21m 31s
      🟩 nvcc12.6           Pass: 100%/32  | Total:  7h 33m | Avg: 14m 09s | Max: 34m 05s | Hits: 643%/7654  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 12m | Avg: 18m 05s | Max: 26m 34s
      🟩 nvcc               Pass: 100%/39  | Total:  9h 11m | Avg: 14m 08s | Max: 34m 05s | Hits: 637%/10145 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 04m | Avg: 16m 00s | Max: 20m 53s
      🟩 Clang15            Pass: 100%/2   | Total: 25m 17s | Avg: 12m 38s | Max: 20m 29s
      🟩 Clang16            Pass: 100%/2   | Total: 25m 22s | Avg: 12m 41s | Max: 20m 51s
      🟩 Clang17            Pass: 100%/2   | Total: 32m 16s | Avg: 16m 08s | Max: 22m 14s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 52m | Avg: 14m 02s | Max: 26m 34s
      🟩 GCC7               Pass: 100%/2   | Total:  7m 32s | Avg:  3m 46s | Max:  4m 00s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 44s | Avg:  3m 44s | Max:  3m 44s
      🟩 GCC9               Pass: 100%/2   | Total: 34m 16s | Avg: 17m 08s | Max: 17m 27s
      🟩 GCC10              Pass: 100%/2   | Total: 24m 31s | Avg: 12m 15s | Max: 20m 18s
      🟩 GCC11              Pass: 100%/2   | Total: 35m 40s | Avg: 17m 50s | Max: 17m 58s
      🟩 GCC12              Pass: 100%/2   | Total: 42m 19s | Avg: 21m 09s | Max: 22m 13s
      🟩 GCC13              Pass: 100%/8   | Total:  1h 16m | Avg:  9m 34s | Max: 20m 23s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 49m 02s | Avg: 24m 31s | Max: 25m 11s | Hits: 625%/4992  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 01m | Avg: 30m 33s | Max: 34m 05s | Hits: 649%/5153  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 30m 04s | Avg: 15m 02s | Max: 21m 31s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/18  | Total:  4h 19m | Avg: 14m 24s | Max: 26m 34s
      🟩 GCC                Pass: 100%/19  | Total:  3h 44m | Avg: 11m 49s | Max: 22m 13s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 50m | Avg: 27m 32s | Max: 34m 05s | Hits: 637%/10145 
      🟩 NVHPC              Pass: 100%/2   | Total: 30m 04s | Avg: 15m 02s | Max: 21m 31s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/6   | Total:  1h 00m | Avg: 10m 04s | Max: 18m 52s
      🟩 v100               Pass: 100%/37  | Total:  9h 23m | Avg: 15m 14s | Max: 34m 05s | Hits: 637%/10145 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total:  9h 30m | Avg: 15m 00s | Max: 34m 05s | Hits: 637%/10145 
      🟩 NVRTC              Pass: 100%/2   | Total: 34m 32s | Avg: 17m 16s | Max: 18m 52s
      🟩 Test               Pass: 100%/2   | Total: 17m 07s | Avg:  8m 33s | Max:  8m 57s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 55s | Avg:  1m 55s | Max:  1m 55s
    🟩 sm
      🟩 75                 Pass: 100%/2   | Total: 34m 32s | Avg: 17m 16s | Max: 18m 52s
      🟩 90                 Pass: 100%/1   | Total: 13m 22s | Avg: 13m 22s | Max: 13m 22s
      🟩 90a                Pass: 100%/2   | Total: 17m 41s | Avg:  8m 50s | Max: 14m 06s
    🟩 std
      🟩 17                 Pass: 100%/21  | Total:  6h 02m | Avg: 17m 16s | Max: 27m 01s | Hits: 645%/7493  
      🟩 20                 Pass: 100%/21  | Total:  4h 19m | Avg: 12m 21s | Max: 34m 05s | Hits: 616%/2652  
    
  • 🟩 thrust: Pass: 100%/42 | Total: 21h 08m | Avg: 30m 11s | Max: 1h 10m | Hits: 195%/7384

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 36m 49s | Avg: 18m 24s | Max: 26m 06s
    🟩 cpu
      🟩 amd64              Pass: 100%/40  | Total: 20h 24m | Avg: 30m 36s | Max:  1h 10m | Hits: 195%/7384  
      🟩 arm64              Pass: 100%/2   | Total: 43m 50s | Avg: 21m 55s | Max: 24m 05s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  2h 37m | Avg: 31m 32s | Max: 50m 11s | Hits: 217%/1846  
      🟩 12.5               Pass: 100%/2   | Total:  1h 42m | Avg: 51m 23s | Max: 53m 10s
      🟩 12.6               Pass: 100%/35  | Total: 16h 47m | Avg: 28m 47s | Max:  1h 10m | Hits: 188%/5538  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 45m 50s | Avg: 22m 55s | Max: 23m 51s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  2h 37m | Avg: 31m 32s | Max: 50m 11s | Hits: 217%/1846  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 42m | Avg: 51m 23s | Max: 53m 10s
      🟩 nvcc12.6           Pass: 100%/33  | Total: 16h 01m | Avg: 29m 09s | Max:  1h 10m | Hits: 188%/5538  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 45m 50s | Avg: 22m 55s | Max: 23m 51s
      🟩 nvcc               Pass: 100%/40  | Total: 20h 22m | Avg: 30m 33s | Max:  1h 10m | Hits: 195%/7384  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 45m | Avg: 26m 25s | Max: 28m 55s
      🟩 Clang15            Pass: 100%/2   | Total: 58m 13s | Avg: 29m 06s | Max: 29m 59s
      🟩 Clang16            Pass: 100%/2   | Total: 57m 21s | Avg: 28m 40s | Max: 30m 43s
      🟩 Clang17            Pass: 100%/2   | Total: 56m 17s | Avg: 28m 08s | Max: 29m 54s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 23m | Avg: 20m 34s | Max: 30m 02s
      🟩 GCC7               Pass: 100%/2   | Total: 54m 08s | Avg: 27m 04s | Max: 27m 06s
      🟩 GCC8               Pass: 100%/1   | Total: 31m 18s | Avg: 31m 18s | Max: 31m 18s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 03m | Avg: 31m 56s | Max: 34m 40s
      🟩 GCC10              Pass: 100%/2   | Total:  1h 05m | Avg: 32m 59s | Max: 34m 34s
      🟩 GCC11              Pass: 100%/2   | Total:  1h 04m | Avg: 32m 21s | Max: 33m 29s
      🟩 GCC12              Pass: 100%/2   | Total:  1h 05m | Avg: 32m 56s | Max: 36m 33s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 45m | Avg: 20m 38s | Max: 34m 42s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 45m | Avg: 52m 30s | Max: 54m 50s | Hits: 207%/3692  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 10m | Hits: 184%/3692  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 42m | Avg: 51m 23s | Max: 53m 10s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  7h 01m | Avg: 24m 47s | Max: 30m 43s
      🟩 GCC                Pass: 100%/19  | Total:  8h 31m | Avg: 26m 53s | Max: 36m 33s
      🟩 MSVC               Pass: 100%/4   | Total:  3h 52m | Avg: 58m 14s | Max:  1h 10m | Hits: 195%/7384  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 42m | Avg: 51m 23s | Max: 53m 10s
    🟩 gpu
      🟩 rtx4090            Pass: 100%/8   | Total:  2h 19m | Avg: 17m 25s | Max: 34m 42s
      🟩 v100               Pass: 100%/34  | Total: 18h 48m | Avg: 33m 12s | Max:  1h 10m | Hits: 195%/7384  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total: 20h 19m | Avg: 32m 57s | Max:  1h 10m | Hits: 195%/7384  
      🟩 TestCPU            Pass: 100%/2   | Total: 15m 33s | Avg:  7m 46s | Max:  8m 10s
      🟩 TestGPU            Pass: 100%/3   | Total: 33m 01s | Avg: 11m 00s | Max: 11m 46s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 17m 15s | Avg: 17m 15s | Max: 17m 15s
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 11h 25m | Avg: 34m 17s | Max: 57m 44s | Hits: 201%/5538  
      🟩 20                 Pass: 100%/20  | Total:  9h 05m | Avg: 27m 17s | Max:  1h 10m | Hits: 180%/1846  
    
  • 🟩 cudax: Pass: 100%/20 | Total: 1h 41m | Avg: 5m 04s | Max: 15m 57s | Hits: 386%/522

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 31m | Avg:  5m 42s | Max: 15m 57s | Hits: 386%/522   
      🟩 arm64              Pass: 100%/4   | Total: 10m 19s | Avg:  2m 34s | Max:  2m 38s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total:  8m 40s | Avg:  8m 40s | Max:  8m 40s | Hits: 388%/261   
      🟩 12.5               Pass: 100%/2   | Total: 10m 21s | Avg:  5m 10s | Max:  5m 24s
      🟩 12.6               Pass: 100%/17  | Total:  1h 22m | Avg:  4m 51s | Max: 15m 57s | Hits: 385%/261   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total:  8m 40s | Avg:  8m 40s | Max:  8m 40s | Hits: 388%/261   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 10m 21s | Avg:  5m 10s | Max:  5m 24s
      🟩 nvcc12.6           Pass: 100%/17  | Total:  1h 22m | Avg:  4m 51s | Max: 15m 57s | Hits: 385%/261   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  1h 41m | Avg:  5m 04s | Max: 15m 57s | Hits: 386%/522   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 22s | Avg:  3m 22s | Max:  3m 22s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 07s | Avg:  3m 07s | Max:  3m 07s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 24s | Avg:  3m 24s | Max:  3m 24s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 15s | Avg:  3m 15s | Max:  3m 15s
      🟩 Clang18            Pass: 100%/4   | Total: 24m 18s | Avg:  6m 04s | Max: 15m 57s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 09s | Avg:  3m 09s | Max:  3m 09s
      🟩 GCC11              Pass: 100%/1   | Total:  2m 57s | Avg:  2m 57s | Max:  2m 57s
      🟩 GCC12              Pass: 100%/2   | Total: 16m 44s | Avg:  8m 22s | Max: 13m 35s
      🟩 GCC13              Pass: 100%/4   | Total: 10m 45s | Avg:  2m 41s | Max:  2m 53s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  8m 40s | Avg:  8m 40s | Max:  8m 40s | Hits: 388%/261   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 29s | Avg: 11m 29s | Max: 11m 29s | Hits: 385%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 10m 21s | Avg:  5m 10s | Max:  5m 24s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 37m 26s | Avg:  4m 40s | Max: 15m 57s
      🟩 GCC                Pass: 100%/8   | Total: 33m 35s | Avg:  4m 11s | Max: 13m 35s
      🟩 MSVC               Pass: 100%/2   | Total: 20m 09s | Avg: 10m 04s | Max: 11m 29s | Hits: 386%/522   
      🟩 NVHPC              Pass: 100%/2   | Total: 10m 21s | Avg:  5m 10s | Max:  5m 24s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/4   | Total: 35m 50s | Avg:  8m 57s | Max: 15m 57s
      🟩 v100               Pass: 100%/16  | Total:  1h 05m | Avg:  4m 06s | Max: 11m 29s | Hits: 386%/522   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 11m | Avg:  3m 59s | Max: 11m 29s | Hits: 386%/522   
      🟩 Test               Pass: 100%/2   | Total: 29m 32s | Avg: 14m 46s | Max: 15m 57s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 53s | Avg:  2m 53s | Max:  2m 53s
      🟩 90a                Pass: 100%/1   | Total:  2m 45s | Avg:  2m 45s | Max:  2m 45s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 12m 55s | Avg:  3m 13s | Max:  4m 57s
      🟩 20                 Pass: 100%/16  | Total:  1h 28m | Avg:  5m 32s | Max: 15m 57s | Hits: 386%/522   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 11m 00s | Avg: 5m 30s | Max: 8m 51s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 11m 00s | Avg:  5m 30s | Max:  8m 51s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 11m 00s | Avg:  5m 30s | Max:  8m 51s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 11m 00s | Avg:  5m 30s | Max:  8m 51s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 11m 00s | Avg:  5m 30s | Max:  8m 51s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 11m 00s | Avg:  5m 30s | Max:  8m 51s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 11m 00s | Avg:  5m 30s | Max:  8m 51s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 11m 00s | Avg:  5m 30s | Max:  8m 51s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 09s | Avg:  2m 09s | Max:  2m 09s
      🟩 Test               Pass: 100%/1   | Total:  8m 51s | Avg:  8m 51s | Max:  8m 51s
    
  • 🟩 python: Pass: 100%/1 | Total: 27m 37s | Avg: 27m 37s | Max: 27m 37s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 27m 37s | Avg: 27m 37s | Max: 27m 37s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 27m 37s | Avg: 27m 37s | Max: 27m 37s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 27m 37s | Avg: 27m 37s | Max: 27m 37s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 27m 37s | Avg: 27m 37s | Max: 27m 37s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 27m 37s | Avg: 27m 37s | Max: 27m 37s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 27m 37s | Avg: 27m 37s | Max: 27m 37s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 27m 37s | Avg: 27m 37s | Max: 27m 37s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 27m 37s | Avg: 27m 37s | Max: 27m 37s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 152)

# Runner
110 linux-amd64-cpu16
14 windows-amd64-cpu16
10 linux-arm64-cpu16
8 linux-amd64-gpu-rtx2080-latest-1
6 linux-amd64-gpu-rtxa6000-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
1 linux-amd64-gpu-h100-latest-1

@bernhardmgruber bernhardmgruber merged commit afa2ca2 into NVIDIA:main Jan 30, 2025
164 of 168 checks passed
Copy link
Contributor

Git push to origin failed for branch/2.8.x with exitcode 128

@bernhardmgruber bernhardmgruber deleted the ptx_multimem branch January 30, 2025 12:21
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

3 participants