Skip to content

Conversation

@bernhardmgruber
Copy link
Contributor

No description provided.

@bernhardmgruber bernhardmgruber requested a review from a team as a code owner February 25, 2025 16:55
@bernhardmgruber bernhardmgruber added cub For all items related to CUB breaking Breaking change labels Feb 25, 2025
@bernhardmgruber bernhardmgruber enabled auto-merge (squash) February 25, 2025 17:23
@github-actions
Copy link
Contributor

🟩 CI finished in 1h 50m: Pass: 100%/93 | Total: 2d 16h | Avg: 41m 33s | Max: 1h 24m | Hits: 58%/133929
  • 🟩 cub: Pass: 100%/45 | Total: 1d 18h | Avg: 56m 24s | Max: 1h 24m | Hits: 31%/53485

    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total:  1d 16h | Avg: 56m 07s | Max:  1h 24m | Hits:  31%/51055 
      🟩 arm64              Pass: 100%/2   | Total:  2h 04m | Avg:  1h 02m | Max:  1h 02m | Hits:  16%/2430  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  5h 22m | Avg:  1h 04m | Max:  1h 08m | Hits:  15%/5908  
      🟩 12.5               Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 11m | Hits:  12%/2248  
      🟩 12.8               Pass: 100%/38  | Total:  1d 10h | Avg: 54m 38s | Max:  1h 24m | Hits:  34%/45329 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 07m | Hits:  15%/2100  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  5h 22m | Avg:  1h 04m | Max:  1h 08m | Hits:  15%/5908  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 11m | Hits:  12%/2248  
      🟩 nvcc12.8           Pass: 100%/36  | Total:  1d 08h | Avg: 54m 05s | Max:  1h 24m | Hits:  34%/43229 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 07m | Hits:  15%/2100  
      🟩 nvcc               Pass: 100%/43  | Total:  1d 16h | Avg: 56m 01s | Max:  1h 24m | Hits:  31%/51385 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  4h 05m | Avg:  1h 01m | Max:  1h 03m | Hits:  17%/4868  
      🟩 Clang15            Pass: 100%/2   | Total:  2h 10m | Avg:  1h 05m | Max:  1h 06m | Hits:  16%/2430  
      🟩 Clang16            Pass: 100%/2   | Total:  2h 03m | Avg:  1h 01m | Max:  1h 03m | Hits:  16%/2430  
      🟩 Clang17            Pass: 100%/2   | Total:  2h 02m | Avg:  1h 01m | Max:  1h 01m | Hits:  16%/2430  
      🟩 Clang18            Pass: 100%/7   | Total:  5h 56m | Avg: 50m 51s | Max:  1h 07m | Hits:  41%/8175  
      🟩 GCC7               Pass: 100%/2   | Total:  2h 11m | Avg:  1h 05m | Max:  1h 06m | Hits:  16%/2434  
      🟩 GCC8               Pass: 100%/1   | Total:  1h 03m | Avg:  1h 03m | Max:  1h 03m | Hits:  16%/1217  
      🟩 GCC9               Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 04m | Hits:  16%/2434  
      🟩 GCC10              Pass: 100%/2   | Total:  1h 58m | Avg: 59m 23s | Max: 59m 30s | Hits:  16%/2434  
      🟩 GCC11              Pass: 100%/2   | Total:  2h 09m | Avg:  1h 04m | Max:  1h 06m | Hits:  16%/2430  
      🟩 GCC12              Pass: 100%/2   | Total:  2h 05m | Avg:  1h 02m | Max:  1h 02m | Hits:  16%/2430  
      🟩 GCC13              Pass: 100%/11  | Total:  6h 59m | Avg: 38m 09s | Max:  1h 14m | Hits:  61%/13365 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 22m | Avg:  1h 11m | Max:  1h 14m | Hits:  12%/2080  
      🟩 MSVC14.42          Pass: 100%/2   | Total:  2h 40m | Avg:  1h 20m | Max:  1h 24m | Hits:  12%/2080  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 11m | Hits:  12%/2248  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total: 16h 17m | Avg: 57m 31s | Max:  1h 07m | Hits:  26%/20333 
      🟩 GCC                Pass: 100%/22  | Total: 18h 37m | Avg: 50m 46s | Max:  1h 14m | Hits:  39%/26744 
      🟩 MSVC               Pass: 100%/4   | Total:  5h 03m | Avg:  1h 15m | Max:  1h 24m | Hits:  12%/4160  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 11m | Hits:  12%/2248  
    🟩 gpu
      🟩 h100               Pass: 100%/3   | Total:  1h 13m | Avg: 24m 33s | Max: 29m 21s | Hits:  71%/3645  
      🟩 rtx2080            Pass: 100%/34  | Total:  1d 12h | Avg:  1h 05m | Max:  1h 24m | Hits:  15%/40120 
      🟩 rtxa6000           Pass: 100%/8   | Total:  4h 14m | Avg: 31m 45s | Max:  1h 06m | Hits:  78%/9720  
    🟩 jobs
      🟩 Build              Pass: 100%/37  | Total:  1d 15h | Avg:  1h 03m | Max:  1h 24m | Hits:  15%/43765 
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 21m 09s | Avg: 21m 09s | Max: 21m 09s | Hits:  99%/1215  
      🟩 GraphCapture       Pass: 100%/1   | Total: 17m 03s | Avg: 17m 03s | Max: 17m 03s | Hits:  99%/1215  
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 09m | Avg: 23m 14s | Max: 23m 40s | Hits:  99%/3645  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 04m | Avg: 21m 21s | Max: 21m 34s | Hits:  99%/3645  
    🟩 sm
      🟩 90                 Pass: 100%/3   | Total:  1h 13m | Avg: 24m 33s | Max: 29m 21s | Hits:  71%/3645  
      🟩 90;90a;100         Pass: 100%/1   | Total:  1h 14m | Avg:  1h 14m | Max:  1h 14m | Hits:  16%/1215  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 21h 38m | Avg:  1h 04m | Max:  1h 24m | Hits:  15%/23535 
      🟩 20                 Pass: 100%/25  | Total: 20h 39m | Avg: 49m 34s | Max:  1h 16m | Hits:  43%/29950 
    
  • 🟩 thrust: Pass: 100%/45 | Total: 21h 00m | Avg: 28m 00s | Max: 53m 34s | Hits: 75%/80136

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 34m 19s | Avg: 17m 09s | Max: 23m 11s | Hits:  87%/3564  
    🟩 cpu
      🟩 amd64              Pass: 100%/43  | Total: 20h 07m | Avg: 28m 05s | Max: 53m 34s | Hits:  76%/76573 
      🟩 arm64              Pass: 100%/2   | Total: 52m 35s | Avg: 26m 17s | Max: 27m 53s | Hits:  75%/3563  
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  2h 43m | Avg: 32m 37s | Max: 50m 13s | Hits:  71%/8901  
      🟩 12.5               Pass: 100%/2   | Total:  1h 34m | Avg: 47m 10s | Max: 47m 40s | Hits:  58%/3562  
      🟩 12.8               Pass: 100%/38  | Total: 16h 42m | Avg: 26m 23s | Max: 53m 34s | Hits:  77%/67673 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 46m 28s | Avg: 23m 14s | Max: 24m 06s | Hits:  75%/3562  
      🟩 nvcc12.0           Pass: 100%/5   | Total:  2h 43m | Avg: 32m 37s | Max: 50m 13s | Hits:  71%/8901  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 34m | Avg: 47m 10s | Max: 47m 40s | Hits:  58%/3562  
      🟩 nvcc12.8           Pass: 100%/36  | Total: 15h 56m | Avg: 26m 34s | Max: 53m 34s | Hits:  77%/64111 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 46m 28s | Avg: 23m 14s | Max: 24m 06s | Hits:  75%/3562  
      🟩 nvcc               Pass: 100%/43  | Total: 20h 13m | Avg: 28m 13s | Max: 53m 34s | Hits:  76%/76574 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  1h 50m | Avg: 27m 34s | Max: 29m 21s | Hits:  75%/7124  
      🟩 Clang15            Pass: 100%/2   | Total: 53m 22s | Avg: 26m 41s | Max: 26m 47s | Hits:  75%/3562  
      🟩 Clang16            Pass: 100%/2   | Total: 56m 09s | Avg: 28m 04s | Max: 29m 31s | Hits:  75%/3562  
      🟩 Clang17            Pass: 100%/2   | Total: 55m 52s | Avg: 27m 56s | Max: 29m 17s | Hits:  75%/3562  
      🟩 Clang18            Pass: 100%/7   | Total:  2h 22m | Avg: 20m 19s | Max: 27m 57s | Hits:  82%/12467 
      🟩 GCC7               Pass: 100%/2   | Total: 55m 24s | Avg: 27m 42s | Max: 28m 28s | Hits:  75%/3564  
      🟩 GCC8               Pass: 100%/1   | Total: 27m 24s | Avg: 27m 24s | Max: 27m 24s | Hits:  75%/1782  
      🟩 GCC9               Pass: 100%/2   | Total: 55m 59s | Avg: 27m 59s | Max: 28m 20s | Hits:  75%/3564  
      🟩 GCC10              Pass: 100%/2   | Total: 55m 08s | Avg: 27m 34s | Max: 27m 39s | Hits:  75%/3564  
      🟩 GCC11              Pass: 100%/2   | Total: 56m 29s | Avg: 28m 14s | Max: 28m 43s | Hits:  75%/3564  
      🟩 GCC12              Pass: 100%/2   | Total: 58m 12s | Avg: 29m 06s | Max: 29m 40s | Hits:  75%/3564  
      🟩 GCC13              Pass: 100%/10  | Total:  3h 23m | Avg: 20m 23s | Max: 34m 02s | Hits:  85%/17820 
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 37m | Avg: 48m 52s | Max: 50m 13s | Hits:  54%/3550  
      🟩 MSVC14.42          Pass: 100%/3   | Total:  2h 17m | Avg: 45m 55s | Max: 53m 34s | Hits:  60%/5325  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 34m | Avg: 47m 10s | Max: 47m 40s | Hits:  58%/3562  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/17  | Total:  6h 58m | Avg: 24m 35s | Max: 29m 31s | Hits:  78%/30277 
      🟩 GCC                Pass: 100%/21  | Total:  8h 32m | Avg: 24m 24s | Max: 34m 02s | Hits:  80%/37422 
      🟩 MSVC               Pass: 100%/5   | Total:  3h 55m | Avg: 47m 06s | Max: 53m 34s | Hits:  58%/8875  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 34m | Avg: 47m 10s | Max: 47m 40s | Hits:  58%/3562  
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 27m 56s | Avg: 13m 58s | Max: 16m 28s | Hits:  87%/3564  
      🟩 rtx2080            Pass: 100%/33  | Total: 16h 57m | Avg: 30m 50s | Max: 53m 34s | Hits:  72%/58769 
      🟩 rtx4090            Pass: 100%/10  | Total:  3h 34m | Avg: 21m 28s | Max: 52m 03s | Hits:  85%/17803 
    🟩 jobs
      🟩 Build              Pass: 100%/38  | Total: 19h 28m | Avg: 30m 44s | Max: 53m 34s | Hits:  72%/67671 
      🟩 TestCPU            Pass: 100%/3   | Total: 47m 32s | Avg: 15m 50s | Max: 32m 10s | Hits:  90%/5338  
      🟩 TestGPU            Pass: 100%/4   | Total: 44m 38s | Avg: 11m 09s | Max: 11m 51s | Hits:  99%/7127  
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 27m 56s | Avg: 13m 58s | Max: 16m 28s | Hits:  87%/3564  
      🟩 90;90a;100         Pass: 100%/1   | Total: 34m 02s | Avg: 34m 02s | Max: 34m 02s | Hits:  75%/1782  
    🟩 std
      🟩 17                 Pass: 100%/20  | Total: 10h 35m | Avg: 31m 45s | Max: 53m 34s | Hits:  71%/35611 
      🟩 20                 Pass: 100%/23  | Total:  9h 50m | Avg: 25m 41s | Max: 52m 03s | Hits:  78%/40961 
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 15m 25s | Avg: 7m 42s | Max: 12m 45s | Hits: 97%/308

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 15m 25s | Avg:  7m 42s | Max: 12m 45s | Hits:  97%/308   
    🟩 ctk
      🟩 12.8               Pass: 100%/2   | Total: 15m 25s | Avg:  7m 42s | Max: 12m 45s | Hits:  97%/308   
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/2   | Total: 15m 25s | Avg:  7m 42s | Max: 12m 45s | Hits:  97%/308   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 15m 25s | Avg:  7m 42s | Max: 12m 45s | Hits:  97%/308   
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 15m 25s | Avg:  7m 42s | Max: 12m 45s | Hits:  97%/308   
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 15m 25s | Avg:  7m 42s | Max: 12m 45s | Hits:  97%/308   
    🟩 gpu
      🟩 rtx2080            Pass: 100%/2   | Total: 15m 25s | Avg:  7m 42s | Max: 12m 45s | Hits:  97%/308   
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 40s | Avg:  2m 40s | Max:  2m 40s | Hits:  95%/154   
      🟩 Test               Pass: 100%/1   | Total: 12m 45s | Avg: 12m 45s | Max: 12m 45s | Hits:  98%/154   
    
  • 🟩 python: Pass: 100%/1 | Total: 50m 51s | Avg: 50m 51s | Max: 50m 51s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 50m 51s | Avg: 50m 51s | Max: 50m 51s
    🟩 ctk
      🟩 12.8               Pass: 100%/1   | Total: 50m 51s | Avg: 50m 51s | Max: 50m 51s
    🟩 cudacxx
      🟩 nvcc12.8           Pass: 100%/1   | Total: 50m 51s | Avg: 50m 51s | Max: 50m 51s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 50m 51s | Avg: 50m 51s | Max: 50m 51s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 50m 51s | Avg: 50m 51s | Max: 50m 51s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 50m 51s | Avg: 50m 51s | Max: 50m 51s
    🟩 gpu
      🟩 rtx2080            Pass: 100%/1   | Total: 50m 51s | Avg: 50m 51s | Max: 50m 51s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 50m 51s | Avg: 50m 51s | Max: 50m 51s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 93)

# Runner
66 linux-amd64-cpu16
9 windows-amd64-cpu16
6 linux-amd64-gpu-rtxa6000-latest-1
4 linux-arm64-cpu16
3 linux-amd64-gpu-h100-latest-1
3 linux-amd64-gpu-rtx4090-latest-1
2 linux-amd64-gpu-rtx2080-latest-1

@bernhardmgruber bernhardmgruber merged commit 512f674 into NVIDIA:main Feb 25, 2025
107 of 110 checks passed
@bernhardmgruber bernhardmgruber deleted the drop_scaling branch February 25, 2025 20:25
davebayer pushed a commit to davebayer/cccl that referenced this pull request Apr 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

breaking Breaking change cub For all items related to CUB

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

3 participants