Skip to content

Conversation

@bernhardmgruber
Copy link
Contributor

@bernhardmgruber bernhardmgruber commented Mar 10, 2025

This PR:

  • Allows write access to the future value (up for debate, I don't necessarily need it, but it would allow an indirect offset_iterator (with a FutureValue as offset) to be moved. See also: Add thrust::offset_iterator #4073
  • Make getting the value out a const operation
  • Adds a deduction guide
  • Adds tests

@bernhardmgruber bernhardmgruber requested a review from a team as a code owner March 10, 2025 16:53
@github-project-automation github-project-automation bot moved this to Todo in CCCL Mar 10, 2025
@bernhardmgruber bernhardmgruber marked this pull request as draft March 10, 2025 16:53
@cccl-authenticator-app cccl-authenticator-app bot moved this from Todo to In Review in CCCL Mar 10, 2025
@cccl-authenticator-app cccl-authenticator-app bot moved this from In Review to In Progress in CCCL Mar 10, 2025
@bernhardmgruber bernhardmgruber marked this pull request as ready for review March 10, 2025 16:57
@cccl-authenticator-app cccl-authenticator-app bot moved this from In Progress to In Review in CCCL Mar 10, 2025
@NVIDIA NVIDIA deleted a comment from copy-pr-bot bot Mar 10, 2025
@github-actions
Copy link
Contributor

🟩 CI finished in 3h 23m: Pass: 100%/93 | Total: 2d 17h | Avg: 42m 28s | Max: 1h 26m | Hits: 60%/133890
  • 🟩 cub: Pass: 100%/45 | Total: 1d 18h | Avg: 56m 28s | Max: 1h 26m | Hits: 35%/53614

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 16h | Avg: 56m 08s | Max:  1h 26m | Hits:  36%/51178 
      🟩 arm64              Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 06m | Hits:  22%/2436  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 17m | Avg:  1h 03m | Max:  1h 09m | Hits:  20%/5922  
      🟩 12.5               Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 15m | Hits:  18%/2254  
      🟩 12.8               Pass: 100%/38  | Total:  1d 10h | Avg: 54m 42s | Max:  1h 26m | Hits:  38%/45438 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m | Hits:  21%/2104  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 17m | Avg:  1h 03m | Max:  1h 09m | Hits:  20%/5922  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 15m | Hits:  18%/2254  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 08h | Avg: 54m 14s | Max:  1h 26m | Hits:  38%/43334 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 03m | Hits:  21%/2104  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 16h | Avg: 56m 10s | Max:  1h 26m | Hits:  35%/51510 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 05m | Avg:  1h 01m | Max:  1h 03m | Hits:  22%/4880  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 05m | Hits:  22%/2436  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 05m | Hits:  22%/2436  
      🟩 Clang17            Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 00m | Hits:  22%/2436  
      🟩 Clang18            Pass: 100%/7   | Total:  5h 55m | Avg: 50m 47s | Max:  1h 03m | Hits:  45%/8194  
      🟩 GCC7               Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 03m | Hits:  22%/2440  
      🟩 GCC8               Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m | Hits:  22%/1220  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 03m | Hits:  22%/2440  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 02m | Hits:  22%/2440  
      🟩 GCC11              Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 01m | Hits:  22%/2436  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 03m | Hits:  22%/2436  
      🟩 GCC13              Pass: 100%/11  | Total:  6h 58m | Avg: 38m 00s | Max:  1h 09m | Hits:  64%/13398 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 31m | Avg:  1h 15m | Max:  1h 21m | Hits:  12%/2084  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 42m | Avg:  1h 21m | Max:  1h 26m | Hits:  12%/2084  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 15m | Hits:  18%/2254  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 16h 15m | Avg: 57m 22s | Max:  1h 05m | Hits:  31%/20382 
      🟩 GCC                Pass: 100%/22  | Total: 18h 26m | Avg: 50m 18s | Max:  1h 09m | Hits:  43%/26810 
      🟩 MSVC               Pass: 100%/4   | Total:  5h 14m | Avg:  1h 18m | Max:  1h 26m | Hits:  12%/4168  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 25m | Avg:  1h 12m | Max:  1h 15m | Hits:  18%/2254  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 15m | Avg: 25m 08s | Max: 28m 58s | Hits:  73%/3654  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 12h | Avg:  1h 05m | Max:  1h 26m | Hits:  21%/40216 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 11m | Avg: 31m 29s | Max:  1h 02m | Hits:  80%/9744  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 15h | Avg:  1h 03m | Max:  1h 26m | Hits:  21%/43870 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 25s | Avg: 21m 25s | Max: 21m 25s | Hits:  99%/1218  
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 13s | Avg: 17m 13s | Max: 17m 13s | Hits:  99%/1218  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 11m | Avg: 23m 56s | Max: 25m 02s | Hits:  99%/3654  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 04m | Avg: 21m 20s | Max: 21m 24s | Hits:  99%/3654  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 15m | Avg: 25m 08s | Max: 28m 58s | Hits:  73%/3654  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 09m | Avg:  1h 09m | Max:  1h 09m | Hits:  22%/1218  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 21h 31m | Avg:  1h 04m | Max:  1h 21m | Hits:  20%/23591 
      🟩 20                 Pass: 100%/25  | Total: 20h 50m | Avg: 50m 01s | Max:  1h 26m | Hits:  46%/30023 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 22h 10m | Avg: 29m 33s | Max: 58m 35s | Hits: 77%/79956

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 35m 48s | Avg: 17m 54s | Max: 24m 37s | Hits:  88%/3556  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 21h 18m | Avg: 29m 43s | Max: 58m 35s | Hits:  77%/76401 
      🟩 arm64              Pass: 100%/2   | Total: 52m 00s | Avg: 26m 00s | Max: 27m 44s | Hits:  76%/3555  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  2h 54m | Avg: 34m 48s | Max: 57m 47s | Hits:  72%/8881  
      🟩 12.5               Pass: 100%/2   | Total:  1h 47m | Avg: 53m 43s | Max: 55m 11s | Hits:  63%/3554  
      🟩 12.8               Pass: 100%/38  | Total: 17h 28m | Avg: 27m 36s | Max: 58m 35s | Hits:  78%/67521 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 47m 40s | Avg: 23m 50s | Max: 25m 08s | Hits:  76%/3554  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  2h 54m | Avg: 34m 48s | Max: 57m 47s | Hits:  72%/8881  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 47m | Avg: 53m 43s | Max: 55m 11s | Hits:  63%/3554  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 16h 41m | Avg: 27m 48s | Max: 58m 35s | Hits:  78%/63967 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 47m 40s | Avg: 23m 50s | Max: 25m 08s | Hits:  76%/3554  
      🟩 nvcc               Pass: 100%/43  | Total: 21h 22m | Avg: 29m 49s | Max: 58m 35s | Hits:  77%/76402 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 48m | Avg: 27m 12s | Max: 27m 55s | Hits:  77%/7108  
      🟩 Clang15            Pass: 100%/2   | Total: 59m 19s | Avg: 29m 39s | Max: 29m 57s | Hits:  76%/3554  
      🟩 Clang16            Pass: 100%/2   | Total: 57m 43s | Avg: 28m 51s | Max: 30m 24s | Hits:  76%/3554  
      🟩 Clang17            Pass: 100%/2   | Total: 57m 49s | Avg: 28m 54s | Max: 29m 10s | Hits:  76%/3554  
      🟩 Clang18            Pass: 100%/7   | Total:  2h 27m | Avg: 21m 06s | Max: 30m 08s | Hits:  83%/12439 
      🟩 GCC7               Pass: 100%/2   | Total: 58m 12s | Avg: 29m 06s | Max: 30m 49s | Hits:  76%/3556  
      🟩 GCC8               Pass: 100%/1   | Total: 27m 58s | Avg: 27m 58s | Max: 27m 58s | Hits:  76%/1778  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 27s | Max: 33m 11s | Hits:  76%/3556  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 03m | Avg: 31m 39s | Max: 32m 55s | Hits:  76%/3556  
      🟩 GCC11              Pass: 100%/2   | Total: 59m 09s | Avg: 29m 34s | Max: 31m 13s | Hits:  76%/3556  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 00m | Avg: 30m 00s | Max: 31m 56s | Hits:  76%/3556  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 22m | Avg: 20m 14s | Max: 32m 25s | Hits:  86%/17780 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 51m | Avg: 55m 38s | Max: 57m 47s | Hits:  54%/3542  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 26m | Avg: 48m 46s | Max: 58m 35s | Hits:  60%/5313  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 47m | Avg: 53m 43s | Max: 55m 11s | Hits:  63%/3554  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  7h 11m | Avg: 25m 22s | Max: 30m 24s | Hits:  79%/30209 
      🟩 GCC                Pass: 100%/21  | Total:  8h 53m | Avg: 25m 25s | Max: 33m 11s | Hits:  81%/37338 
      🟩 MSVC               Pass: 100%/5   | Total:  4h 17m | Avg: 51m 31s | Max: 58m 35s | Hits:  58%/8855  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 47m | Avg: 53m 43s | Max: 55m 11s | Hits:  63%/3554  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 27m 57s | Avg: 13m 58s | Max: 16m 42s | Hits:  88%/3556  
      🟩 rtx2080            Pass: 100%/33  | Total: 18h 03m | Avg: 32m 50s | Max: 58m 35s | Hits:  74%/58637 
      🟩 rtx4090            Pass: 100%/10  | Total:  3h 38m | Avg: 21m 53s | Max: 55m 06s | Hits:  85%/17763 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total: 20h 38m | Avg: 32m 35s | Max: 58m 35s | Hits:  73%/67519 
      🟩 TestCPU            Pass: 100%/3   | Total: 48m 05s | Avg: 16m 01s | Max: 32m 39s | Hits:  90%/5326  
      🟩 TestGPU            Pass: 100%/4   | Total: 44m 09s | Avg: 11m 02s | Max: 11m 32s | Hits:  99%/7111  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 27m 57s | Avg: 13m 58s | Max: 16m 42s | Hits:  88%/3556  
      🟩 90;90a;100         Pass: 100%/1   | Total: 32m 25s | Avg: 32m 25s | Max: 32m 25s | Hits:  76%/1778  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 11h 35m | Avg: 34m 45s | Max: 58m 35s | Hits:  73%/35531 
      🟩 20                 Pass: 100%/23  | Total:  9h 59m | Avg: 26m 03s | Max: 55m 06s | Hits:  80%/40869 
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 16m 57s | Avg: 8m 28s | Max: 14m 21s | Hits: 96%/320

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 16m 57s | Avg:  8m 28s | Max: 14m 21s | Hits:  96%/320   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 16m 57s | Avg:  8m 28s | Max: 14m 21s | Hits:  96%/320   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 16m 57s | Avg:  8m 28s | Max: 14m 21s | Hits:  96%/320   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 16m 57s | Avg:  8m 28s | Max: 14m 21s | Hits:  96%/320   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 16m 57s | Avg:  8m 28s | Max: 14m 21s | Hits:  96%/320   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 16m 57s | Avg:  8m 28s | Max: 14m 21s | Hits:  96%/320   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 16m 57s | Avg:  8m 28s | Max: 14m 21s | Hits:  96%/320   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 36s | Avg:  2m 36s | Max:  2m 36s | Hits:  94%/160   
      🟩 Test               Pass: 100%/1   | Total: 14m 21s | Avg: 14m 21s | Max: 14m 21s | Hits:  98%/160   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 00m | Avg: 1h 00m | Max: 1h 00m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 93)

# Runner
66 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

@elstehle
Copy link
Contributor

  • Allows write access to the future value (up for debate, I don't necessarily need it, but it would allow an indirect offset_iterator (with a FutureValue as offset) to be moved. See also: [Add thrust::offset_iterator Add thrust::offset_iterator #4073]

I am not in favor of allowing write access to FutureValue. FutureValue was introduced at the interface of CUB algorithms, like DeviceScan, to allow users to pass the initial value used for scan as a device-accessible iterator. Considering that use case, I'm afraid that if we introduce write access, writes to a FutureValue are prone to crossing the host/device boundary and many users ending up writing on the host to what is just a wrapper to a device-accessible value.

Copy link
Contributor

@elstehle elstehle left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'd vote against introducing write access. Other changes LGTM.

bernhardmgruber and others added 3 commits March 11, 2025 10:59
Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>
@github-actions
Copy link
Contributor

🟩 CI finished in 1h 40m: Pass: 100%/93 | Total: 2d 17h | Avg: 42m 31s | Max: 1h 22m | Hits: 60%/133890
  • 🟩 cub: Pass: 100%/45 | Total: 1d 18h | Avg: 56m 46s | Max: 1h 22m | Hits: 35%/53614

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 16h | Avg: 56m 30s | Max:  1h 22m | Hits:  36%/51178 
      🟩 arm64              Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 03m | Hits:  22%/2436  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 14m | Avg:  1h 02m | Max:  1h 09m | Hits:  20%/5922  
      🟩 12.5               Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 18m | Hits:  18%/2254  
      🟩 12.8               Pass: 100%/38  | Total:  1d 10h | Avg: 55m 02s | Max:  1h 22m | Hits:  38%/45438 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 04m | Hits:  21%/2104  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 14m | Avg:  1h 02m | Max:  1h 09m | Hits:  20%/5922  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 18m | Hits:  18%/2254  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 08h | Avg: 54m 36s | Max:  1h 22m | Hits:  38%/43334 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 04m | Hits:  21%/2104  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 16h | Avg: 56m 28s | Max:  1h 22m | Hits:  35%/51510 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 04m | Avg:  1h 01m | Max:  1h 03m | Hits:  22%/4880  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 02m | Hits:  22%/2436  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 02m | Hits:  22%/2436  
      🟩 Clang17            Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 06m | Hits:  22%/2436  
      🟩 Clang18            Pass: 100%/7   | Total:  5h 52m | Avg: 50m 22s | Max:  1h 04m | Hits:  45%/8194  
      🟩 GCC7               Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 05m | Hits:  22%/2440  
      🟩 GCC8               Pass: 100%/1   | Total: 59m 39s | Avg: 59m 39s | Max: 59m 39s | Hits:  22%/1220  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 05m | Hits:  22%/2440  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 08m | Hits:  22%/2440  
      🟩 GCC11              Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 01m | Hits:  22%/2436  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 08m | Hits:  22%/2436  
      🟩 GCC13              Pass: 100%/11  | Total:  7h 07m | Avg: 38m 52s | Max:  1h 12m | Hits:  64%/13398 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 27m | Avg:  1h 13m | Max:  1h 18m | Hits:  12%/2084  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 38m | Avg:  1h 19m | Max:  1h 22m | Hits:  12%/2084  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 18m | Hits:  18%/2254  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 16h 11m | Avg: 57m 07s | Max:  1h 06m | Hits:  31%/20382 
      🟩 GCC                Pass: 100%/22  | Total: 18h 50m | Avg: 51m 22s | Max:  1h 12m | Hits:  43%/26810 
      🟩 MSVC               Pass: 100%/4   | Total:  5h 05m | Avg:  1h 16m | Max:  1h 22m | Hits:  12%/4168  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 18m | Hits:  18%/2254  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 13m | Avg: 24m 35s | Max: 28m 50s | Hits:  73%/3654  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 13h | Avg:  1h 05m | Max:  1h 22m | Hits:  21%/40216 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 19m | Avg: 32m 22s | Max:  1h 02m | Hits:  80%/9744  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 15h | Avg:  1h 04m | Max:  1h 22m | Hits:  21%/43870 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 24m 57s | Avg: 24m 57s | Max: 24m 57s | Hits:  99%/1218  
      🟩 GraphCapture       Pass: 100%/1   | Total: 16m 44s | Avg: 16m 44s | Max: 16m 44s | Hits:  99%/1218  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 11m | Avg: 23m 52s | Max: 24m 18s | Hits:  99%/3654  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 07m | Avg: 22m 21s | Max: 25m 51s | Hits:  99%/3654  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 13m | Avg: 24m 35s | Max: 28m 50s | Hits:  73%/3654  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 12m | Avg:  1h 12m | Max:  1h 12m | Hits:  22%/1218  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 21h 39m | Avg:  1h 04m | Max:  1h 18m | Hits:  20%/23591 
      🟩 20                 Pass: 100%/25  | Total: 20h 55m | Avg: 50m 12s | Max:  1h 22m | Hits:  46%/30023 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 21h 56m | Avg: 29m 15s | Max: 1h 00m | Hits: 77%/79956

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 36m 27s | Avg: 18m 13s | Max: 25m 10s | Hits:  88%/3556  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 21h 03m | Avg: 29m 23s | Max:  1h 00m | Hits:  77%/76401 
      🟩 arm64              Pass: 100%/2   | Total: 52m 34s | Avg: 26m 17s | Max: 28m 08s | Hits:  76%/3555  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  2h 46m | Avg: 33m 21s | Max: 53m 13s | Hits:  72%/8881  
      🟩 12.5               Pass: 100%/2   | Total:  1h 41m | Avg: 50m 37s | Max: 52m 10s | Hits:  63%/3554  
      🟩 12.8               Pass: 100%/38  | Total: 17h 28m | Avg: 27m 35s | Max:  1h 00m | Hits:  78%/67521 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 47m 54s | Avg: 23m 57s | Max: 24m 04s | Hits:  76%/3554  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  2h 46m | Avg: 33m 21s | Max: 53m 13s | Hits:  72%/8881  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 41m | Avg: 50m 37s | Max: 52m 10s | Hits:  63%/3554  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 16h 40m | Avg: 27m 47s | Max:  1h 00m | Hits:  78%/63967 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 47m 54s | Avg: 23m 57s | Max: 24m 04s | Hits:  76%/3554  
      🟩 nvcc               Pass: 100%/43  | Total: 21h 08m | Avg: 29m 29s | Max:  1h 00m | Hits:  77%/76402 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 49m | Avg: 27m 15s | Max: 29m 17s | Hits:  77%/7108  
      🟩 Clang15            Pass: 100%/2   | Total: 55m 03s | Avg: 27m 31s | Max: 27m 36s | Hits:  76%/3554  
      🟩 Clang16            Pass: 100%/2   | Total: 55m 36s | Avg: 27m 48s | Max: 27m 51s | Hits:  76%/3554  
      🟩 Clang17            Pass: 100%/2   | Total: 58m 17s | Avg: 29m 08s | Max: 29m 37s | Hits:  76%/3554  
      🟩 Clang18            Pass: 100%/7   | Total:  2h 25m | Avg: 20m 43s | Max: 28m 53s | Hits:  83%/12439 
      🟩 GCC7               Pass: 100%/2   | Total:  1h 02m | Avg: 31m 09s | Max: 31m 16s | Hits:  76%/3556  
      🟩 GCC8               Pass: 100%/1   | Total: 28m 52s | Avg: 28m 52s | Max: 28m 52s | Hits:  76%/1778  
      🟩 GCC9               Pass: 100%/2   | Total:  1h 00m | Avg: 30m 06s | Max: 30m 32s | Hits:  76%/3556  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 00m | Avg: 30m 21s | Max: 31m 21s | Hits:  76%/3556  
      🟩 GCC11              Pass: 100%/2   | Total: 56m 25s | Avg: 28m 12s | Max: 28m 25s | Hits:  76%/3556  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 01m | Avg: 30m 55s | Max: 32m 13s | Hits:  76%/3556  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 23m | Avg: 20m 20s | Max: 31m 09s | Hits:  86%/17780 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 48m | Avg: 54m 28s | Max: 55m 43s | Hits:  54%/3542  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 29m | Avg: 49m 43s | Max:  1h 00m | Hits:  60%/5313  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 41m | Avg: 50m 37s | Max: 52m 10s | Hits:  63%/3554  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  7h 03m | Avg: 24m 53s | Max: 29m 37s | Hits:  79%/30209 
      🟩 GCC                Pass: 100%/21  | Total:  8h 53m | Avg: 25m 25s | Max: 32m 13s | Hits:  81%/37338 
      🟩 MSVC               Pass: 100%/5   | Total:  4h 18m | Avg: 51m 37s | Max:  1h 00m | Hits:  58%/8855  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 41m | Avg: 50m 37s | Max: 52m 10s | Hits:  63%/3554  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 27m 55s | Avg: 13m 57s | Max: 16m 20s | Hits:  88%/3556  
      🟩 rtx2080            Pass: 100%/33  | Total: 17h 42m | Avg: 32m 11s | Max: 55m 43s | Hits:  74%/58637 
      🟩 rtx4090            Pass: 100%/10  | Total:  3h 45m | Avg: 22m 35s | Max:  1h 00m | Hits:  85%/17763 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total: 20h 21m | Avg: 32m 08s | Max:  1h 00m | Hits:  73%/67519 
      🟩 TestCPU            Pass: 100%/3   | Total: 50m 26s | Avg: 16m 48s | Max: 35m 17s | Hits:  90%/5326  
      🟩 TestGPU            Pass: 100%/4   | Total: 44m 40s | Avg: 11m 10s | Max: 11m 35s | Hits:  99%/7111  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 27m 55s | Avg: 13m 57s | Max: 16m 20s | Hits:  88%/3556  
      🟩 90;90a;100         Pass: 100%/1   | Total: 31m 09s | Avg: 31m 09s | Max: 31m 09s | Hits:  76%/1778  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 11h 12m | Avg: 33m 37s | Max: 55m 43s | Hits:  73%/35531 
      🟩 20                 Pass: 100%/23  | Total: 10h 07m | Avg: 26m 24s | Max:  1h 00m | Hits:  80%/40869 
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 17m 55s | Avg: 8m 57s | Max: 15m 09s | Hits: 96%/320

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 17m 55s | Avg:  8m 57s | Max: 15m 09s | Hits:  96%/320   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 17m 55s | Avg:  8m 57s | Max: 15m 09s | Hits:  96%/320   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 17m 55s | Avg:  8m 57s | Max: 15m 09s | Hits:  96%/320   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 17m 55s | Avg:  8m 57s | Max: 15m 09s | Hits:  96%/320   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 17m 55s | Avg:  8m 57s | Max: 15m 09s | Hits:  96%/320   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 17m 55s | Avg:  8m 57s | Max: 15m 09s | Hits:  96%/320   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 17m 55s | Avg:  8m 57s | Max: 15m 09s | Hits:  96%/320   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 46s | Avg:  2m 46s | Max:  2m 46s | Hits:  94%/160   
      🟩 Test               Pass: 100%/1   | Total: 15m 09s | Avg: 15m 09s | Max: 15m 09s | Hits:  98%/160   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 06m | Avg: 1h 06m | Max: 1h 06m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 06m | Avg:  1h 06m | Max:  1h 06m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 06m | Avg:  1h 06m | Max:  1h 06m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 06m | Avg:  1h 06m | Max:  1h 06m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 06m | Avg:  1h 06m | Max:  1h 06m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 06m | Avg:  1h 06m | Max:  1h 06m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 06m | Avg:  1h 06m | Max:  1h 06m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 06m | Avg:  1h 06m | Max:  1h 06m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 06m | Avg:  1h 06m | Max:  1h 06m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 93)

# Runner
66 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

@bernhardmgruber bernhardmgruber enabled auto-merge (squash) March 11, 2025 12:23
@github-actions
Copy link
Contributor

🟩 CI finished in 1h 35m: Pass: 100%/93 | Total: 2d 18h | Avg: 42m 47s | Max: 1h 27m | Hits: 60%/133890
  • 🟩 cub: Pass: 100%/45 | Total: 1d 18h | Avg: 57m 01s | Max: 1h 27m | Hits: 35%/53614

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 16h | Avg: 56m 44s | Max:  1h 27m | Hits:  35%/51178 
      🟩 arm64              Pass: 100%/2   | Total:  2h 06m | Avg:  1h 03m | Max:  1h 04m | Hits:  22%/2436  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 18m | Avg:  1h 03m | Max:  1h 09m | Hits:  19%/5922  
      🟩 12.5               Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 15m | Hits:  15%/2254  
      🟩 12.8               Pass: 100%/38  | Total:  1d 10h | Avg: 55m 14s | Max:  1h 27m | Hits:  38%/45438 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 11m | Hits:  21%/2104  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 18m | Avg:  1h 03m | Max:  1h 09m | Hits:  19%/5922  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 15m | Hits:  15%/2254  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 08h | Avg: 54m 34s | Max:  1h 27m | Hits:  39%/43334 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 11m | Hits:  21%/2104  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 16h | Avg: 56m 32s | Max:  1h 27m | Hits:  35%/51510 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 13m | Avg:  1h 03m | Max:  1h 06m | Hits:  22%/4880  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m | Hits:  22%/2436  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 05m | Hits:  22%/2436  
      🟩 Clang17            Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 02m | Hits:  22%/2436  
      🟩 Clang18            Pass: 100%/7   | Total:  6h 06m | Avg: 52m 24s | Max:  1h 11m | Hits:  45%/8194  
      🟩 GCC7               Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 01m | Hits:  20%/2440  
      🟩 GCC8               Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m | Hits:  22%/1220  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 07m | Hits:  20%/2440  
      🟩 GCC10              Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 02m | Hits:  22%/2440  
      🟩 GCC11              Pass: 100%/2   | Total:  2h 13m | Avg:  1h 06m | Max:  1h 12m | Hits:  22%/2436  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 02m | Hits:  22%/2436  
      🟩 GCC13              Pass: 100%/11  | Total:  7h 01m | Avg: 38m 16s | Max:  1h 08m | Hits:  64%/13398 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 18m | Hits:  12%/2084  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 43m | Avg:  1h 21m | Max:  1h 27m | Hits:  12%/2084  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 15m | Hits:  15%/2254  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 16h 30m | Avg: 58m 15s | Max:  1h 11m | Hits:  31%/20382 
      🟩 GCC                Pass: 100%/22  | Total: 18h 35m | Avg: 50m 42s | Max:  1h 12m | Hits:  43%/26810 
      🟩 MSVC               Pass: 100%/4   | Total:  5h 12m | Avg:  1h 18m | Max:  1h 27m | Hits:  12%/4168  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 28m | Avg:  1h 14m | Max:  1h 15m | Hits:  15%/2254  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 13m | Avg: 24m 32s | Max: 27m 14s | Hits:  73%/3654  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 13h | Avg:  1h 05m | Max:  1h 27m | Hits:  20%/40216 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 15m | Avg: 31m 56s | Max:  1h 03m | Hits:  80%/9744  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 15h | Avg:  1h 04m | Max:  1h 27m | Hits:  20%/43870 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 12s | Avg: 21m 12s | Max: 21m 12s | Hits:  99%/1218  
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 26s | Avg: 17m 26s | Max: 17m 26s | Hits:  99%/1218  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 10m | Avg: 23m 33s | Max: 23m 51s | Hits:  99%/3654  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 06m | Avg: 22m 15s | Max: 23m 15s | Hits:  99%/3654  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 13m | Avg: 24m 32s | Max: 27m 14s | Hits:  73%/3654  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 08m | Avg:  1h 08m | Max:  1h 08m | Hits:  22%/1218  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 21h 42m | Avg:  1h 05m | Max:  1h 18m | Hits:  20%/23591 
      🟩 20                 Pass: 100%/25  | Total: 21h 03m | Avg: 50m 33s | Max:  1h 27m | Hits:  46%/30023 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 22h 13m | Avg: 29m 38s | Max: 1h 00m | Hits: 77%/79956

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 37m 17s | Avg: 18m 38s | Max: 25m 59s | Hits:  88%/3556  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 21h 21m | Avg: 29m 47s | Max:  1h 00m | Hits:  77%/76401 
      🟩 arm64              Pass: 100%/2   | Total: 52m 50s | Avg: 26m 25s | Max: 28m 19s | Hits:  76%/3555  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  2h 55m | Avg: 35m 11s | Max:  1h 00m | Hits:  72%/8881  
      🟩 12.5               Pass: 100%/2   | Total:  1h 34m | Avg: 47m 12s | Max: 48m 22s | Hits:  63%/3554  
      🟩 12.8               Pass: 100%/38  | Total: 17h 43m | Avg: 27m 59s | Max: 57m 48s | Hits:  78%/67521 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 49m 50s | Avg: 24m 55s | Max: 26m 14s | Hits:  76%/3554  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  2h 55m | Avg: 35m 11s | Max:  1h 00m | Hits:  72%/8881  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 34m | Avg: 47m 12s | Max: 48m 22s | Hits:  63%/3554  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 16h 53m | Avg: 28m 09s | Max: 57m 48s | Hits:  78%/63967 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 49m 50s | Avg: 24m 55s | Max: 26m 14s | Hits:  76%/3554  
      🟩 nvcc               Pass: 100%/43  | Total: 21h 24m | Avg: 29m 51s | Max:  1h 00m | Hits:  77%/76402 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 58m | Avg: 29m 31s | Max: 32m 09s | Hits:  77%/7108  
      🟩 Clang15            Pass: 100%/2   | Total: 59m 20s | Avg: 29m 40s | Max: 31m 42s | Hits:  76%/3554  
      🟩 Clang16            Pass: 100%/2   | Total: 57m 19s | Avg: 28m 39s | Max: 30m 28s | Hits:  76%/3554  
      🟩 Clang17            Pass: 100%/2   | Total: 56m 04s | Avg: 28m 02s | Max: 28m 16s | Hits:  76%/3554  
      🟩 Clang18            Pass: 100%/7   | Total:  2h 29m | Avg: 21m 24s | Max: 30m 12s | Hits:  83%/12439 
      🟩 GCC7               Pass: 100%/2   | Total: 57m 47s | Avg: 28m 53s | Max: 29m 35s | Hits:  76%/3556  
      🟩 GCC8               Pass: 100%/1   | Total: 29m 06s | Avg: 29m 06s | Max: 29m 06s | Hits:  76%/1778  
      🟩 GCC9               Pass: 100%/2   | Total: 58m 58s | Avg: 29m 29s | Max: 30m 37s | Hits:  76%/3556  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 01m | Avg: 30m 59s | Max: 31m 47s | Hits:  76%/3556  
      🟩 GCC11              Pass: 100%/2   | Total: 58m 08s | Avg: 29m 04s | Max: 30m 38s | Hits:  76%/3556  
      🟩 GCC12              Pass: 100%/2   | Total:  1h 05m | Avg: 32m 47s | Max: 34m 07s | Hits:  76%/3556  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 27m | Avg: 20m 43s | Max: 33m 33s | Hits:  86%/17780 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 55m | Avg: 57m 49s | Max:  1h 00m | Hits:  55%/3542  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 24m | Avg: 48m 08s | Max: 57m 48s | Hits:  60%/5313  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 34m | Avg: 47m 12s | Max: 48m 22s | Hits:  63%/3554  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  7h 20m | Avg: 25m 55s | Max: 32m 09s | Hits:  79%/30209 
      🟩 GCC                Pass: 100%/21  | Total:  8h 58m | Avg: 25m 39s | Max: 34m 07s | Hits:  81%/37338 
      🟩 MSVC               Pass: 100%/5   | Total:  4h 20m | Avg: 52m 00s | Max:  1h 00m | Hits:  58%/8855  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 34m | Avg: 47m 12s | Max: 48m 22s | Hits:  63%/3554  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 28m 07s | Avg: 14m 03s | Max: 16m 31s | Hits:  88%/3556  
      🟩 rtx2080            Pass: 100%/33  | Total: 18h 02m | Avg: 32m 48s | Max:  1h 00m | Hits:  74%/58637 
      🟩 rtx4090            Pass: 100%/10  | Total:  3h 42m | Avg: 22m 17s | Max: 57m 48s | Hits:  85%/17763 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total: 20h 40m | Avg: 32m 38s | Max:  1h 00m | Hits:  73%/67519 
      🟩 TestCPU            Pass: 100%/3   | Total: 48m 40s | Avg: 16m 13s | Max: 33m 21s | Hits:  90%/5326  
      🟩 TestGPU            Pass: 100%/4   | Total: 44m 44s | Avg: 11m 11s | Max: 11m 36s | Hits:  99%/7111  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 28m 07s | Avg: 14m 03s | Max: 16m 31s | Hits:  88%/3556  
      🟩 90;90a;100         Pass: 100%/1   | Total: 33m 33s | Avg: 33m 33s | Max: 33m 33s | Hits:  76%/1778  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 11h 25m | Avg: 34m 16s | Max:  1h 00m | Hits:  73%/35531 
      🟩 20                 Pass: 100%/23  | Total: 10h 11m | Avg: 26m 34s | Max: 57m 48s | Hits:  80%/40869 
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 16m 58s | Avg: 8m 29s | Max: 14m 31s | Hits: 96%/320

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 16m 58s | Avg:  8m 29s | Max: 14m 31s | Hits:  96%/320   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 16m 58s | Avg:  8m 29s | Max: 14m 31s | Hits:  96%/320   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 16m 58s | Avg:  8m 29s | Max: 14m 31s | Hits:  96%/320   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 16m 58s | Avg:  8m 29s | Max: 14m 31s | Hits:  96%/320   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 16m 58s | Avg:  8m 29s | Max: 14m 31s | Hits:  96%/320   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 16m 58s | Avg:  8m 29s | Max: 14m 31s | Hits:  96%/320   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 16m 58s | Avg:  8m 29s | Max: 14m 31s | Hits:  96%/320   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 27s | Avg:  2m 27s | Max:  2m 27s | Hits:  94%/160   
      🟩 Test               Pass: 100%/1   | Total: 14m 31s | Avg: 14m 31s | Max: 14m 31s | Hits:  98%/160   
    
  • 🟩 python: Pass: 100%/1 | Total: 1h 01m | Avg: 1h 01m | Max: 1h 01m

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 93)

# Runner
66 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

@bernhardmgruber bernhardmgruber merged commit facefba into NVIDIA:main Mar 11, 2025
106 of 108 checks passed
@github-project-automation github-project-automation bot moved this from In Review to Done in CCCL Mar 11, 2025
davebayer pushed a commit to davebayer/cccl that referenced this pull request Mar 12, 2025
Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>
bernhardmgruber added a commit to bernhardmgruber/cccl that referenced this pull request Mar 13, 2025
Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>
davebayer pushed a commit to davebayer/cccl that referenced this pull request Apr 7, 2025
Co-authored-by: Michael Schellenberger Costa <miscco@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

3 participants