Skip to content

Conversation

@fbusato
Copy link
Contributor

@fbusato fbusato commented Jan 1, 2025

Fixes #2845, #2391

Description

ceil_div returns the resulting type of the operation and has been optimized for CUDA

Features

  • Add concept-like macro
  • Use common type for the output
  • Handle enumerator for a, b, and both
  • Document performance considerations

#### DO NOT MERGE

  • require C++17
  • breaking change in the API

@fbusato fbusato requested review from a team as code owners January 1, 2025 01:44
@fbusato fbusato requested review from alliepiper and wmaxey January 1, 2025 01:44
Copy link
Contributor

@miscco miscco left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I find the previous implementation much simpler, please keep signed and unsigned separate

@fbusato
Copy link
Contributor Author

fbusato commented Jan 2, 2025

I find the previous implementation much simpler, please keep signed and unsigned separate

I don't quite agree...With two separate functions we need to duplicate 15 lines of code, which is not great.

template <class _Tp,
          class _Up,
          _CUDA_VSTD::enable_if_t<_CCCL_TRAIT(_CUDA_VSTD::is_integral, _Tp), int> = 0,
          _CUDA_VSTD::enable_if_t<_CCCL_TRAIT(_CUDA_VSTD::is_integral, _Up), int> = 0>
_CCCL_NODISCARD _LIBCUDACXX_HIDE_FROM_ABI _CCCL_CONSTEXPR_CXX14 decltype(_Tp{} / _Up{})
ceil_div(const _Tp __a, const _Up __b) noexcept
{
  _CCCL_ASSERT(__b > _Up{0}, "cuda::ceil_div: b must be positive");
  using _Common  = decltype(_Tp{} / _Up{});
  using _UCommon = _CUDA_VSTD::make_unsigned_t<_Common>;
  if constexpr (_CUDA_VSTD::is_signed_v<_Tp>)
  {
    _CCCL_ASSERT(__a >= _Tp{0}, "cuda::ceil_div: a must be non negative");
  }
  auto __a1 = static_cast<_UCommon>(__a);
  auto __b1 = static_cast<_UCommon>(__b);

@github-actions
Copy link
Contributor

github-actions bot commented Jan 2, 2025

🟨 CI finished in 2h 02m: Pass: 79%/170 | Total: 3d 02h | Avg: 26m 16s | Max: 1h 23m | Hits: 36%/17647
  • 🟨 libcudacxx: Pass: 72%/48 | Total: 15h 01m | Avg: 18m 46s | Max: 1h 23m | Hits: 30%/7578

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  71%/46  | Total: 14h 17m | Avg: 18m 38s | Max:  1h 23m | Hits:  30%/7578  
      🟩 arm64              Pass: 100%/2   | Total: 43m 46s | Avg: 21m 53s | Max: 22m 50s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 09m | Avg: 17m 24s | Max: 22m 43s
      🔍 nvcc               Pass:  70%/44  | Total: 13h 51m | Avg: 18m 53s | Max:  1h 23m | Hits:  30%/7578  
    🟨 ctk
      🟥 11.1               Pass:   0%/7   | Total: 58m 15s | Avg:  8m 19s | Max: 17m 41s
      🟩 12.5               Pass: 100%/2   | Total: 49m 28s | Avg: 24m 44s | Max: 34m 40s
      🟨 12.6               Pass:  84%/39  | Total: 13h 13m | Avg: 20m 20s | Max:  1h 23m | Hits:  30%/7578  
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 09m | Avg: 17m 24s | Max: 22m 43s
      🟥 nvcc11.1           Pass:   0%/7   | Total: 58m 15s | Avg:  8m 19s | Max: 17m 41s
      🟩 nvcc12.5           Pass: 100%/2   | Total: 49m 28s | Avg: 24m 44s | Max: 34m 40s
      🟨 nvcc12.6           Pass:  82%/35  | Total: 12h 03m | Avg: 20m 40s | Max:  1h 23m | Hits:  30%/7578  
    🟨 cxx
      🟥 Clang9             Pass:   0%/4   | Total: 26m 32s | Avg:  6m 38s | Max: 17m 41s
      🟩 Clang10            Pass: 100%/1   | Total:  4m 45s | Avg:  4m 45s | Max:  4m 45s
      🟩 Clang11            Pass: 100%/1   | Total:  4m 23s | Avg:  4m 23s | Max:  4m 23s
      🟩 Clang12            Pass: 100%/1   | Total:  4m 03s | Avg:  4m 03s | Max:  4m 03s
      🟩 Clang13            Pass: 100%/1   | Total: 21m 21s | Avg: 21m 21s | Max: 21m 21s
      🟩 Clang14            Pass: 100%/1   | Total: 22m 42s | Avg: 22m 42s | Max: 22m 42s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 30s | Avg:  4m 30s | Max:  4m 30s
      🟩 Clang16            Pass: 100%/1   | Total: 21m 34s | Avg: 21m 34s | Max: 21m 34s
      🟩 Clang17            Pass: 100%/1   | Total: 23m 37s | Avg: 23m 37s | Max: 23m 37s
      🟩 Clang18            Pass: 100%/8   | Total:  2h 44m | Avg: 20m 31s | Max: 44m 04s
      🟥 GCC6               Pass:   0%/2   | Total:  3m 39s | Avg:  1m 49s | Max:  1m 50s
      🟥 GCC7               Pass:   0%/2   | Total: 20m 01s | Avg: 10m 00s | Max: 17m 55s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 37s | Avg:  3m 37s | Max:  3m 37s
      🟨 GCC9               Pass:  33%/3   | Total: 40m 54s | Avg: 13m 38s | Max: 22m 57s
      🟩 GCC10              Pass: 100%/1   | Total:  4m 18s | Avg:  4m 18s | Max:  4m 18s
      🟩 GCC11              Pass: 100%/1   | Total: 23m 35s | Avg: 23m 35s | Max: 23m 35s
      🟩 GCC12              Pass: 100%/1   | Total: 22m 31s | Avg: 22m 31s | Max: 22m 31s
      🟨 GCC13              Pass:  80%/10  | Total:  4h 44m | Avg: 28m 28s | Max:  1h 23m
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 22m 52s | Avg: 22m 52s | Max: 22m 52s
      🟥 MSVC14.16          Pass:   0%/1   | Total: 16m 54s | Avg: 16m 54s | Max: 16m 54s
      🟩 MSVC14.29          Pass: 100%/1   | Total: 37m 57s | Avg: 37m 57s | Max: 37m 57s | Hits:  30%/2477  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 13m | Avg: 36m 30s | Max: 37m 15s | Hits:  29%/5101  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 49m 28s | Avg: 24m 44s | Max: 34m 40s
    🟨 cxx_family
      🟨 Clang              Pass:  80%/20  | Total:  4h 57m | Avg: 14m 52s | Max: 44m 04s
      🟨 GCC                Pass:  61%/21  | Total:  6h 43m | Avg: 19m 12s | Max:  1h 23m
      🟩 Intel              Pass: 100%/1   | Total: 22m 52s | Avg: 22m 52s | Max: 22m 52s
      🟨 MSVC               Pass:  75%/4   | Total:  2h 07m | Avg: 31m 58s | Max: 37m 57s | Hits:  30%/7578  
      🟩 NVHPC              Pass: 100%/2   | Total: 49m 28s | Avg: 24m 44s | Max: 34m 40s
    🟨 jobs
      🟨 Build              Pass:  73%/41  | Total: 10h 46m | Avg: 15m 46s | Max: 37m 57s | Hits:  30%/7578  
      🟨 NVRTC              Pass:  50%/4   | Total:  2h 05m | Avg: 31m 23s | Max: 38m 59s
      🟩 Test               Pass: 100%/2   | Total:  2h 07m | Avg:  1h 03m | Max:  1h 23m
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 54s | Avg:  1m 54s | Max:  1m 54s
    🟨 std
      🟥 11                 Pass:   0%/6   | Total: 42m 32s | Avg:  7m 05s | Max: 32m 46s
      🟥 14                 Pass:   0%/5   | Total:  1h 05m | Avg: 13m 00s | Max: 23m 40s
      🟨 17                 Pass:  84%/13  | Total:  4h 29m | Avg: 20m 46s | Max: 37m 57s | Hits:  30%/4954  
      🟩 20                 Pass: 100%/23  | Total:  8h 41m | Avg: 22m 41s | Max:  1h 23m | Hits:  29%/2624  
    🟨 gpu
      🟨 v100               Pass:  72%/48  | Total: 15h 01m | Avg: 18m 46s | Max:  1h 23m | Hits:  30%/7578  
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 11s | Avg: 13m 11s | Max: 13m 11s
      🟩 90a                Pass: 100%/2   | Total: 20m 56s | Avg: 10m 28s | Max: 13m 09s
    
  • 🟨 cub: Pass: 76%/47 | Total: 1d 07h | Avg: 40m 03s | Max: 1h 13m | Hits: 27%/2349

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  75%/45  | Total:  1d 05h | Avg: 39m 07s | Max:  1h 13m | Hits:  27%/2349  
      🟩 arm64              Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 06m
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 04m
      🔍 nvcc               Pass:  75%/45  | Total:  1d 05h | Avg: 39m 08s | Max:  1h 13m | Hits:  27%/2349  
    🔍 gpu: v100 🔍
      🟩 h100               Pass: 100%/2   | Total: 50m 10s | Avg: 25m 05s | Max: 27m 26s
      🔍 v100               Pass:  75%/45  | Total:  1d 06h | Avg: 40m 43s | Max:  1h 13m | Hits:  27%/2349  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  72%/40  | Total:  1d 04h | Avg: 42m 24s | Max:  1h 13m | Hits:  27%/2349  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 45m 00s | Avg: 45m 00s | Max: 45m 00s
      🟩 GraphCapture       Pass: 100%/1   | Total: 20m 38s | Avg: 20m 38s | Max: 20m 38s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 02m | Avg: 20m 54s | Max: 22m 44s
      🟩 TestGPU            Pass: 100%/2   | Total: 57m 57s | Avg: 28m 58s | Max: 29m 03s
    🟨 ctk
      🟥 11.1               Pass:   0%/7   | Total: 33m 37s | Avg:  4m 48s | Max: 18m 21s
      🟩 12.5               Pass: 100%/2   | Total:  2h 26m | Avg:  1h 13m | Max:  1h 13m
      🟨 12.6               Pass:  89%/38  | Total:  1d 04h | Avg: 44m 48s | Max:  1h 06m | Hits:  27%/2349  
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 01m | Avg:  1h 00m | Max:  1h 04m
      🟥 nvcc11.1           Pass:   0%/7   | Total: 33m 37s | Avg:  4m 48s | Max: 18m 21s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 26m | Avg:  1h 13m | Max:  1h 13m
      🟨 nvcc12.6           Pass:  88%/36  | Total:  1d 02h | Avg: 43m 55s | Max:  1h 06m | Hits:  27%/2349  
    🟨 cxx
      🟥 Clang9             Pass:   0%/4   | Total: 12m 14s | Avg:  3m 03s | Max:  3m 14s
      🟩 Clang10            Pass: 100%/1   | Total: 58m 01s | Avg: 58m 01s | Max: 58m 01s
      🟩 Clang11            Pass: 100%/1   | Total: 59m 56s | Avg: 59m 56s | Max: 59m 56s
      🟩 Clang12            Pass: 100%/1   | Total: 52m 27s | Avg: 52m 27s | Max: 52m 27s
      🟩 Clang13            Pass: 100%/1   | Total: 53m 29s | Avg: 53m 29s | Max: 53m 29s
      🟩 Clang14            Pass: 100%/1   | Total: 53m 47s | Avg: 53m 47s | Max: 53m 47s
      🟩 Clang15            Pass: 100%/1   | Total: 57m 41s | Avg: 57m 41s | Max: 57m 41s
      🟩 Clang16            Pass: 100%/1   | Total: 53m 41s | Avg: 53m 41s | Max: 53m 41s
      🟩 Clang17            Pass: 100%/1   | Total: 55m 59s | Avg: 55m 59s | Max: 55m 59s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 35m | Avg: 47m 52s | Max:  1h 04m
      🟥 GCC6               Pass:   0%/2   | Total:  4m 06s | Avg:  2m 03s | Max:  2m 04s
      🟥 GCC7               Pass:   0%/2   | Total:  5m 53s | Avg:  2m 56s | Max:  2m 57s
      🟩 GCC8               Pass: 100%/1   | Total: 58m 49s | Avg: 58m 49s | Max: 58m 49s
      🟨 GCC9               Pass:  33%/3   | Total: 59m 09s | Avg: 19m 43s | Max: 53m 48s
      🟩 GCC10              Pass: 100%/1   | Total: 56m 43s | Avg: 56m 43s | Max: 56m 43s
      🟩 GCC11              Pass: 100%/1   | Total: 58m 25s | Avg: 58m 25s | Max: 58m 25s
      🟩 GCC12              Pass: 100%/3   | Total:  1h 45m | Avg: 35m 01s | Max: 54m 53s
      🟩 GCC13              Pass: 100%/8   | Total:  5h 23m | Avg: 40m 23s | Max:  1h 06m
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
      🟥 MSVC14.16          Pass:   0%/1   | Total: 18m 21s | Avg: 18m 21s | Max: 18m 21s
      🟩 MSVC14.29          Pass: 100%/1   | Total:  1h 04m | Avg:  1h 04m | Max:  1h 04m | Hits:  28%/783   
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 08m | Avg:  1h 04m | Max:  1h 04m | Hits:  27%/1566  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 26m | Avg:  1h 13m | Max:  1h 13m
    🟨 cxx_family
      🟨 Clang              Pass:  78%/19  | Total: 13h 12m | Avg: 41m 42s | Max:  1h 04m
      🟨 GCC                Pass:  71%/21  | Total: 11h 11m | Avg: 31m 57s | Max:  1h 06m
      🟩 Intel              Pass: 100%/1   | Total:  1h 01m | Avg:  1h 01m | Max:  1h 01m
      🟨 MSVC               Pass:  75%/4   | Total:  3h 31m | Avg: 52m 52s | Max:  1h 04m | Hits:  27%/2349  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 26m | Avg:  1h 13m | Max:  1h 13m
    🟨 std
      🟥 11                 Pass:   0%/5   | Total: 13m 41s | Avg:  2m 44s | Max:  3m 14s
      🟥 14                 Pass:   0%/4   | Total: 26m 33s | Avg:  6m 38s | Max: 18m 21s
      🟨 17                 Pass:  83%/12  | Total: 10h 16m | Avg: 51m 21s | Max:  1h 13m | Hits:  28%/1566  
      🟩 20                 Pass: 100%/26  | Total: 20h 26m | Avg: 47m 09s | Max:  1h 12m | Hits:  27%/783   
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 50m 10s | Avg: 25m 05s | Max: 27m 26s
      🟩 90a                Pass: 100%/1   | Total: 27m 02s | Avg: 27m 02s | Max: 27m 02s
    
  • 🟨 thrust: Pass: 76%/46 | Total: 1d 00h | Avg: 32m 06s | Max: 1h 15m | Hits: 43%/7408

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  75%/44  | Total: 23h 21m | Avg: 31m 50s | Max:  1h 15m | Hits:  43%/7408  
      🟩 arm64              Pass: 100%/2   | Total:  1h 15m | Avg: 37m 59s | Max: 41m 06s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 09m | Avg: 34m 56s | Max: 38m 30s
      🔍 nvcc               Pass:  75%/44  | Total: 23h 27m | Avg: 31m 58s | Max:  1h 15m | Hits:  43%/7408  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  72%/40  | Total: 22h 23m | Avg: 33m 35s | Max:  1h 15m | Hits:  24%/5556  
      🟩 TestCPU            Pass: 100%/3   | Total: 38m 05s | Avg: 12m 41s | Max: 21m 56s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 35m | Avg: 31m 42s | Max: 54m 30s
    🟨 ctk
      🟥 11.1               Pass:   0%/7   | Total: 36m 15s | Avg:  5m 10s | Max: 25m 00s
      🟩 12.5               Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 15m
      🟨 12.6               Pass:  89%/37  | Total: 21h 37m | Avg: 35m 03s | Max:  1h 12m | Hits:  43%/7408  
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 09m | Avg: 34m 56s | Max: 38m 30s
      🟥 nvcc11.1           Pass:   0%/7   | Total: 36m 15s | Avg:  5m 10s | Max: 25m 00s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 15m
      🟨 nvcc12.6           Pass:  88%/35  | Total: 20h 27m | Avg: 35m 04s | Max:  1h 12m | Hits:  43%/7408  
    🟨 cxx
      🟥 Clang9             Pass:   0%/4   | Total:  8m 43s | Avg:  2m 10s | Max:  2m 26s
      🟩 Clang10            Pass: 100%/1   | Total: 39m 37s | Avg: 39m 37s | Max: 39m 37s
      🟩 Clang11            Pass: 100%/1   | Total: 36m 48s | Avg: 36m 48s | Max: 36m 48s
      🟩 Clang12            Pass: 100%/1   | Total: 36m 20s | Avg: 36m 20s | Max: 36m 20s
      🟩 Clang13            Pass: 100%/1   | Total: 34m 48s | Avg: 34m 48s | Max: 34m 48s
      🟩 Clang14            Pass: 100%/1   | Total: 41m 57s | Avg: 41m 57s | Max: 41m 57s
      🟩 Clang15            Pass: 100%/1   | Total: 43m 30s | Avg: 43m 30s | Max: 43m 30s
      🟩 Clang16            Pass: 100%/1   | Total: 40m 17s | Avg: 40m 17s | Max: 40m 17s
      🟩 Clang17            Pass: 100%/1   | Total: 40m 59s | Avg: 40m 59s | Max: 40m 59s
      🟩 Clang18            Pass: 100%/7   | Total:  3h 32m | Avg: 30m 23s | Max: 40m 27s
      🟥 GCC6               Pass:   0%/2   | Total:  3m 42s | Avg:  1m 51s | Max:  1m 54s
      🟥 GCC7               Pass:   0%/2   | Total:  4m 20s | Avg:  2m 10s | Max:  2m 11s
      🟩 GCC8               Pass: 100%/1   | Total: 41m 50s | Avg: 41m 50s | Max: 41m 50s
      🟨 GCC9               Pass:  33%/3   | Total: 44m 23s | Avg: 14m 47s | Max: 40m 50s
      🟩 GCC10              Pass: 100%/1   | Total: 43m 10s | Avg: 43m 10s | Max: 43m 10s
      🟩 GCC11              Pass: 100%/1   | Total: 41m 24s | Avg: 41m 24s | Max: 41m 24s
      🟩 GCC12              Pass: 100%/1   | Total: 46m 46s | Avg: 46m 46s | Max: 46m 46s
      🟩 GCC13              Pass: 100%/8   | Total:  4h 22m | Avg: 32m 49s | Max: 54m 30s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 50m 01s | Avg: 50m 01s | Max: 50m 01s
      🟥 MSVC14.16          Pass:   0%/1   | Total: 25m 00s | Avg: 25m 00s | Max: 25m 00s
      🟩 MSVC14.29          Pass: 100%/1   | Total:  1h 07m | Avg:  1h 07m | Max:  1h 07m | Hits:  24%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 46m | Avg: 55m 38s | Max:  1h 12m | Hits:  49%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 15m
    🟨 cxx_family
      🟨 Clang              Pass:  78%/19  | Total:  8h 55m | Avg: 28m 11s | Max: 43m 30s
      🟨 GCC                Pass:  68%/19  | Total:  8h 08m | Avg: 25m 41s | Max: 54m 30s
      🟩 Intel              Pass: 100%/1   | Total: 50m 01s | Avg: 50m 01s | Max: 50m 01s
      🟨 MSVC               Pass:  80%/5   | Total:  4h 19m | Avg: 51m 54s | Max:  1h 12m | Hits:  43%/7408  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 15m
    🟨 std
      🟥 11                 Pass:   0%/5   | Total: 10m 05s | Avg:  2m 01s | Max:  2m 17s
      🟥 14                 Pass:   0%/4   | Total: 31m 23s | Avg:  7m 50s | Max: 25m 00s
      🟨 17                 Pass:  83%/12  | Total:  8h 10m | Avg: 40m 50s | Max:  1h 12m | Hits:  24%/3704  
      🟩 20                 Pass: 100%/23  | Total: 14h 54m | Avg: 38m 54s | Max:  1h 15m | Hits:  62%/3704  
    🟨 gpu
      🟨 v100               Pass:  76%/46  | Total:  1d 00h | Avg: 32m 06s | Max:  1h 15m | Hits:  43%/7408  
    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 50m 54s | Avg: 25m 27s | Max: 35m 34s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 26m 46s | Avg: 26m 46s | Max: 26m 46s
    
  • 🟩 cudax: Pass: 100%/26 | Total: 2h 46m | Avg: 6m 25s | Max: 34m 19s | Hits: 90%/312

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 32m | Avg:  6m 57s | Max: 34m 19s | Hits:  90%/312   
      🟩 arm64              Pass: 100%/4   | Total: 13m 55s | Avg:  3m 28s | Max:  3m 41s
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 16m 58s | Avg:  5m 39s | Max:  9m 53s | Hits:  91%/156   
      🟩 12.5               Pass: 100%/2   | Total: 12m 04s | Avg:  6m 02s | Max:  6m 15s
      🟩 12.6               Pass: 100%/21  | Total:  2h 17m | Avg:  6m 33s | Max: 34m 19s | Hits:  90%/156   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 16m 58s | Avg:  5m 39s | Max:  9m 53s | Hits:  91%/156   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 12m 04s | Avg:  6m 02s | Max:  6m 15s
      🟩 nvcc12.6           Pass: 100%/21  | Total:  2h 17m | Avg:  6m 33s | Max: 34m 19s | Hits:  90%/156   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  2h 46m | Avg:  6m 25s | Max: 34m 19s | Hits:  90%/312   
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  3m 42s | Avg:  3m 42s | Max:  3m 42s
      🟩 Clang10            Pass: 100%/1   | Total:  4m 37s | Avg:  4m 37s | Max:  4m 37s
      🟩 Clang11            Pass: 100%/1   | Total:  3m 52s | Avg:  3m 52s | Max:  3m 52s
      🟩 Clang12            Pass: 100%/1   | Total:  3m 35s | Avg:  3m 35s | Max:  3m 35s
      🟩 Clang13            Pass: 100%/1   | Total:  3m 52s | Avg:  3m 52s | Max:  3m 52s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 09s | Avg:  4m 09s | Max:  4m 09s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 49s | Avg:  3m 49s | Max:  3m 49s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 02s | Avg:  4m 02s | Max:  4m 02s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 54s | Avg:  3m 54s | Max:  3m 54s
      🟩 Clang18            Pass: 100%/4   | Total: 34m 48s | Avg:  8m 42s | Max: 23m 59s
      🟩 GCC9               Pass: 100%/1   | Total:  3m 23s | Avg:  3m 23s | Max:  3m 23s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 37s | Avg:  3m 37s | Max:  3m 37s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 53s | Avg:  3m 53s | Max:  3m 53s
      🟩 GCC12              Pass: 100%/2   | Total: 38m 27s | Avg: 19m 13s | Max: 34m 19s
      🟩 GCC13              Pass: 100%/4   | Total: 13m 50s | Avg:  3m 27s | Max:  3m 41s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  9m 53s | Avg:  9m 53s | Max:  9m 53s | Hits:  91%/156   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 25s | Avg: 11m 25s | Max: 11m 25s | Hits:  90%/156   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 12m 04s | Avg:  6m 02s | Max:  6m 15s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/13  | Total:  1h 10m | Avg:  5m 24s | Max: 23m 59s
      🟩 GCC                Pass: 100%/9   | Total:  1h 03m | Avg:  7m 01s | Max: 34m 19s
      🟩 MSVC               Pass: 100%/2   | Total: 21m 18s | Avg: 10m 39s | Max: 11m 25s | Hits:  90%/312   
      🟩 NVHPC              Pass: 100%/2   | Total: 12m 04s | Avg:  6m 02s | Max:  6m 15s
    🟩 gpu
      🟩 v100               Pass: 100%/26  | Total:  2h 46m | Avg:  6m 25s | Max: 34m 19s | Hits:  90%/312   
    🟩 jobs
      🟩 Build              Pass: 100%/24  | Total:  1h 48m | Avg:  4m 31s | Max: 11m 25s | Hits:  90%/312   
      🟩 Test               Pass: 100%/2   | Total: 58m 18s | Avg: 29m 09s | Max: 34m 19s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 03s | Avg:  3m 03s | Max:  3m 03s
      🟩 90a                Pass: 100%/1   | Total:  3m 35s | Avg:  3m 35s | Max:  3m 35s
    🟩 std
      🟩 17                 Pass: 100%/6   | Total: 22m 54s | Avg:  3m 49s | Max:  5m 49s
      🟩 20                 Pass: 100%/20  | Total:  2h 23m | Avg:  7m 11s | Max: 34m 19s | Hits:  90%/312   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 55s | Avg: 4m 57s | Max: 7m 34s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 34s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 34s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 34s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 34s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 34s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 34s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 55s | Avg:  4m 57s | Max:  7m 34s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 21s | Avg:  2m 21s | Max:  2m 21s
      🟩 Test               Pass: 100%/1   | Total:  7m 34s | Avg:  7m 34s | Max:  7m 34s
    
  • 🟩 python: Pass: 100%/1 | Total: 27m 52s | Avg: 27m 52s | Max: 27m 52s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 27m 52s | Avg: 27m 52s | Max: 27m 52s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 27m 52s | Avg: 27m 52s | Max: 27m 52s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 27m 52s | Avg: 27m 52s | Max: 27m 52s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 27m 52s | Avg: 27m 52s | Max: 27m 52s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 27m 52s | Avg: 27m 52s | Max: 27m 52s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 27m 52s | Avg: 27m 52s | Max: 27m 52s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 27m 52s | Avg: 27m 52s | Max: 27m 52s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 27m 52s | Avg: 27m 52s | Max: 27m 52s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 170)

# Runner
125 linux-amd64-cpu16
19 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@fbusato fbusato self-assigned this Jan 2, 2025
@github-actions
Copy link
Contributor

github-actions bot commented Jan 8, 2025

🟨 CI finished in 2h 27m: Pass: 79%/170 | Total: 2d 18h | Avg: 23m 36s | Max: 1h 24m | Hits: 15%/17650
  • 🟨 libcudacxx: Pass: 72%/48 | Total: 8h 52m | Avg: 11m 05s | Max: 44m 54s | Hits: 11%/7590

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  71%/46  | Total:  8h 45m | Avg: 11m 24s | Max: 44m 54s | Hits:  11%/7590  
      🟩 arm64              Pass: 100%/2   | Total:  7m 00s | Avg:  3m 30s | Max:  3m 36s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 06m | Avg: 16m 30s | Max: 21m 29s
      🔍 nvcc               Pass:  70%/44  | Total:  7h 46m | Avg: 10m 35s | Max: 44m 54s | Hits:  11%/7590  
    🟨 ctk
      🟥 11.1               Pass:   0%/7   | Total: 36m 01s | Avg:  5m 08s | Max: 21m 41s
      🟩 12.5               Pass: 100%/2   | Total:  1h 06m | Avg: 33m 03s | Max: 33m 39s
      🟨 12.6               Pass:  84%/39  | Total:  7h 10m | Avg: 11m 01s | Max: 44m 54s | Hits:  11%/7590  
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 06m | Avg: 16m 30s | Max: 21m 29s
      🟥 nvcc11.1           Pass:   0%/7   | Total: 36m 01s | Avg:  5m 08s | Max: 21m 41s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  1h 06m | Avg: 33m 03s | Max: 33m 39s
      🟨 nvcc12.6           Pass:  82%/35  | Total:  6h 04m | Avg: 10m 24s | Max: 44m 54s | Hits:  11%/7590  
    🟨 cxx
      🟥 Clang9             Pass:   0%/4   | Total: 12m 54s | Avg:  3m 13s | Max:  5m 04s
      🟩 Clang10            Pass: 100%/1   | Total:  4m 42s | Avg:  4m 42s | Max:  4m 42s
      🟩 Clang11            Pass: 100%/1   | Total:  3m 59s | Avg:  3m 59s | Max:  3m 59s
      🟩 Clang12            Pass: 100%/1   | Total:  4m 16s | Avg:  4m 16s | Max:  4m 16s
      🟩 Clang13            Pass: 100%/1   | Total:  4m 12s | Avg:  4m 12s | Max:  4m 12s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 05s | Avg:  4m 05s | Max:  4m 05s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 37s | Avg:  4m 37s | Max:  4m 37s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 31s | Avg:  4m 31s | Max:  4m 31s
      🟩 Clang17            Pass: 100%/1   | Total:  4m 14s | Avg:  4m 14s | Max:  4m 14s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 36m | Avg: 12m 02s | Max: 21m 29s
      🟥 GCC6               Pass:   0%/2   | Total:  3m 42s | Avg:  1m 51s | Max:  2m 00s
      🟥 GCC7               Pass:   0%/2   | Total:  5m 27s | Avg:  2m 43s | Max:  3m 21s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 35s | Avg:  3m 35s | Max:  3m 35s
      🟨 GCC9               Pass:  33%/3   | Total:  8m 41s | Avg:  2m 53s | Max:  3m 40s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 39s | Avg:  3m 39s | Max:  3m 39s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 50s | Avg:  3m 50s | Max:  3m 50s
      🟩 GCC12              Pass: 100%/1   | Total:  3m 59s | Avg:  3m 59s | Max:  3m 59s
      🟨 GCC13              Pass:  80%/10  | Total:  2h 02m | Avg: 12m 16s | Max: 26m 14s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 25m 33s | Avg: 25m 33s | Max: 25m 33s
      🟥 MSVC14.16          Pass:   0%/1   | Total: 21m 41s | Avg: 21m 41s | Max: 21m 41s
      🟩 MSVC14.29          Pass: 100%/1   | Total: 38m 36s | Avg: 38m 36s | Max: 38m 36s | Hits:  11%/2481  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  1h 20m | Avg: 40m 22s | Max: 44m 54s | Hits:  11%/5109  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  1h 06m | Avg: 33m 03s | Max: 33m 39s
    🟨 cxx_family
      🟨 Clang              Pass:  80%/20  | Total:  2h 23m | Avg:  7m 11s | Max: 21m 29s
      🟨 GCC                Pass:  61%/21  | Total:  2h 35m | Avg:  7m 24s | Max: 26m 14s
      🟩 Intel              Pass: 100%/1   | Total: 25m 33s | Avg: 25m 33s | Max: 25m 33s
      🟨 MSVC               Pass:  75%/4   | Total:  2h 21m | Avg: 35m 15s | Max: 44m 54s | Hits:  11%/7590  
      🟩 NVHPC              Pass: 100%/2   | Total:  1h 06m | Avg: 33m 03s | Max: 33m 39s
    🟨 jobs
      🟨 Build              Pass:  73%/41  | Total:  6h 46m | Avg:  9m 55s | Max: 44m 54s | Hits:  11%/7590  
      🟨 NVRTC              Pass:  50%/4   | Total:  1h 28m | Avg: 22m 11s | Max: 26m 14s
      🟩 Test               Pass: 100%/2   | Total: 34m 28s | Avg: 17m 14s | Max: 17m 53s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 58s | Avg:  1m 58s | Max:  1m 58s
    🟨 std
      🟥 11                 Pass:   0%/6   | Total: 30m 42s | Avg:  5m 07s | Max: 20m 35s
      🟥 14                 Pass:   0%/5   | Total: 53m 16s | Avg: 10m 39s | Max: 21m 41s
      🟨 17                 Pass:  84%/13  | Total:  3h 25m | Avg: 15m 48s | Max: 38m 36s | Hits:  11%/4962  
      🟩 20                 Pass: 100%/23  | Total:  4h 00m | Avg: 10m 27s | Max: 44m 54s | Hits:  10%/2628  
    🟨 gpu
      🟨 v100               Pass:  72%/48  | Total:  8h 52m | Avg: 11m 05s | Max: 44m 54s | Hits:  11%/7590  
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 12m 44s | Avg: 12m 44s | Max: 12m 44s
      🟩 90a                Pass: 100%/2   | Total: 17m 17s | Avg:  8m 38s | Max: 13m 22s
    
  • 🟨 cub: Pass: 76%/47 | Total: 1d 06h | Avg: 39m 34s | Max: 1h 11m | Hits: 2%/2340

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  75%/45  | Total:  1d 05h | Avg: 38m 41s | Max:  1h 11m | Hits:   2%/2340  
      🟩 arm64              Pass: 100%/2   | Total:  1h 59m | Avg: 59m 33s | Max:  1h 02m
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m
      🔍 nvcc               Pass:  75%/45  | Total:  1d 04h | Avg: 38m 39s | Max:  1h 11m | Hits:   2%/2340  
    🔍 gpu: v100 🔍
      🟩 h100               Pass: 100%/2   | Total: 41m 01s | Avg: 20m 30s | Max: 24m 53s
      🔍 v100               Pass:  75%/45  | Total:  1d 06h | Avg: 40m 25s | Max:  1h 11m | Hits:   2%/2340  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  72%/40  | Total:  1d 04h | Avg: 43m 05s | Max:  1h 11m | Hits:   2%/2340  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 18m 27s | Avg: 18m 27s | Max: 18m 27s
      🟩 GraphCapture       Pass: 100%/1   | Total: 14m 27s | Avg: 14m 27s | Max: 14m 27s
      🟩 HostLaunch         Pass: 100%/3   | Total: 52m 58s | Avg: 17m 39s | Max: 18m 38s
      🟩 TestGPU            Pass: 100%/2   | Total: 50m 28s | Avg: 25m 14s | Max: 25m 45s
    🟨 ctk
      🟥 11.1               Pass:   0%/7   | Total: 34m 02s | Avg:  4m 51s | Max: 19m 19s
      🟩 12.5               Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 11m
      🟨 12.6               Pass:  89%/38  | Total:  1d 04h | Avg: 44m 21s | Max:  1h 11m | Hits:   2%/2340  
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 01m
      🟥 nvcc11.1           Pass:   0%/7   | Total: 34m 02s | Avg:  4m 51s | Max: 19m 19s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 11m
      🟨 nvcc12.6           Pass:  88%/36  | Total:  1d 02h | Avg: 43m 28s | Max:  1h 11m | Hits:   2%/2340  
    🟨 cxx
      🟥 Clang9             Pass:   0%/4   | Total: 12m 20s | Avg:  3m 05s | Max:  3m 20s
      🟩 Clang10            Pass: 100%/1   | Total: 57m 21s | Avg: 57m 21s | Max: 57m 21s
      🟩 Clang11            Pass: 100%/1   | Total: 54m 25s | Avg: 54m 25s | Max: 54m 25s
      🟩 Clang12            Pass: 100%/1   | Total: 55m 48s | Avg: 55m 48s | Max: 55m 48s
      🟩 Clang13            Pass: 100%/1   | Total: 54m 40s | Avg: 54m 40s | Max: 54m 40s
      🟩 Clang14            Pass: 100%/1   | Total: 55m 17s | Avg: 55m 17s | Max: 55m 17s
      🟩 Clang15            Pass: 100%/1   | Total: 59m 36s | Avg: 59m 36s | Max: 59m 36s
      🟩 Clang16            Pass: 100%/1   | Total: 57m 02s | Avg: 57m 02s | Max: 57m 02s
      🟩 Clang17            Pass: 100%/1   | Total: 59m 11s | Avg: 59m 11s | Max: 59m 11s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 36m | Avg: 48m 06s | Max:  1h 01m
      🟥 GCC6               Pass:   0%/2   | Total:  3m 47s | Avg:  1m 53s | Max:  1m 56s
      🟥 GCC7               Pass:   0%/2   | Total:  6m 13s | Avg:  3m 06s | Max:  3m 13s
      🟩 GCC8               Pass: 100%/1   | Total: 55m 20s | Avg: 55m 20s | Max: 55m 20s
      🟨 GCC9               Pass:  33%/3   | Total:  1h 03m | Avg: 21m 10s | Max: 58m 18s
      🟩 GCC10              Pass: 100%/1   | Total: 57m 23s | Avg: 57m 23s | Max: 57m 23s
      🟩 GCC11              Pass: 100%/1   | Total: 57m 35s | Avg: 57m 35s | Max: 57m 35s
      🟩 GCC12              Pass: 100%/3   | Total:  1h 45m | Avg: 35m 15s | Max:  1h 04m
      🟩 GCC13              Pass: 100%/8   | Total:  4h 43m | Avg: 35m 29s | Max:  1h 04m
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
      🟥 MSVC14.16          Pass:   0%/1   | Total: 19m 19s | Avg: 19m 19s | Max: 19m 19s
      🟩 MSVC14.29          Pass: 100%/1   | Total:  1h 03m | Avg:  1h 03m | Max:  1h 03m | Hits:   2%/780   
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 11m | Hits:   2%/1560  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 11m
    🟨 cxx_family
      🟨 Clang              Pass:  78%/19  | Total: 13h 22m | Avg: 42m 13s | Max:  1h 01m
      🟨 GCC                Pass:  71%/21  | Total: 10h 33m | Avg: 30m 09s | Max:  1h 04m
      🟩 Intel              Pass: 100%/1   | Total:  1h 00m | Avg:  1h 00m | Max:  1h 00m
      🟨 MSVC               Pass:  75%/4   | Total:  3h 43m | Avg: 55m 50s | Max:  1h 11m | Hits:   2%/2340  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 20m | Avg:  1h 10m | Max:  1h 11m
    🟨 std
      🟥 11                 Pass:   0%/5   | Total: 13m 59s | Avg:  2m 47s | Max:  3m 16s
      🟥 14                 Pass:   0%/4   | Total: 27m 30s | Avg:  6m 52s | Max: 19m 19s
      🟨 17                 Pass:  83%/12  | Total: 10h 11m | Avg: 50m 58s | Max:  1h 11m | Hits:   2%/1560  
      🟩 20                 Pass: 100%/26  | Total: 20h 06m | Avg: 46m 24s | Max:  1h 11m | Hits:   2%/780   
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 41m 01s | Avg: 20m 30s | Max: 24m 53s
      🟩 90a                Pass: 100%/1   | Total: 26m 42s | Avg: 26m 42s | Max: 26m 42s
    
  • 🟨 thrust: Pass: 76%/46 | Total: 1d 00h | Avg: 31m 19s | Max: 1h 24m | Hits: 25%/7408

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  75%/44  | Total: 22h 45m | Avg: 31m 01s | Max:  1h 24m | Hits:  25%/7408  
      🟩 arm64              Pass: 100%/2   | Total:  1h 15m | Avg: 37m 47s | Max: 40m 20s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 14m | Avg: 37m 19s | Max: 37m 55s
      🔍 nvcc               Pass:  75%/44  | Total: 22h 46m | Avg: 31m 03s | Max:  1h 24m | Hits:  25%/7408  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  72%/40  | Total: 22h 45m | Avg: 34m 08s | Max:  1h 24m | Hits:   0%/5556  
      🟩 TestCPU            Pass: 100%/3   | Total: 37m 34s | Avg: 12m 31s | Max: 22m 27s | Hits:  99%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total: 37m 54s | Avg: 12m 38s | Max: 13m 21s
    🟨 ctk
      🟥 11.1               Pass:   0%/7   | Total: 35m 21s | Avg:  5m 03s | Max: 24m 16s
      🟩 12.5               Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 14m
      🟨 12.6               Pass:  89%/37  | Total: 21h 01m | Avg: 34m 05s | Max:  1h 24m | Hits:  25%/7408  
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 14m | Avg: 37m 19s | Max: 37m 55s
      🟥 nvcc11.1           Pass:   0%/7   | Total: 35m 21s | Avg:  5m 03s | Max: 24m 16s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 14m
      🟨 nvcc12.6           Pass:  88%/35  | Total: 19h 46m | Avg: 33m 53s | Max:  1h 24m | Hits:  25%/7408  
    🟨 cxx
      🟥 Clang9             Pass:   0%/4   | Total:  8m 20s | Avg:  2m 05s | Max:  2m 16s
      🟩 Clang10            Pass: 100%/1   | Total: 39m 02s | Avg: 39m 02s | Max: 39m 02s
      🟩 Clang11            Pass: 100%/1   | Total: 40m 12s | Avg: 40m 12s | Max: 40m 12s
      🟩 Clang12            Pass: 100%/1   | Total: 38m 47s | Avg: 38m 47s | Max: 38m 47s
      🟩 Clang13            Pass: 100%/1   | Total: 39m 00s | Avg: 39m 00s | Max: 39m 00s
      🟩 Clang14            Pass: 100%/1   | Total: 41m 32s | Avg: 41m 32s | Max: 41m 32s
      🟩 Clang15            Pass: 100%/1   | Total: 39m 02s | Avg: 39m 02s | Max: 39m 02s
      🟩 Clang16            Pass: 100%/1   | Total: 42m 09s | Avg: 42m 09s | Max: 42m 09s
      🟩 Clang17            Pass: 100%/1   | Total: 39m 00s | Avg: 39m 00s | Max: 39m 00s
      🟩 Clang18            Pass: 100%/7   | Total:  3h 28m | Avg: 29m 50s | Max: 40m 19s
      🟥 GCC6               Pass:   0%/2   | Total:  3m 35s | Avg:  1m 47s | Max:  1m 59s
      🟥 GCC7               Pass:   0%/2   | Total:  4m 11s | Avg:  2m 05s | Max:  2m 15s
      🟩 GCC8               Pass: 100%/1   | Total: 35m 56s | Avg: 35m 56s | Max: 35m 56s
      🟨 GCC9               Pass:  33%/3   | Total: 43m 37s | Avg: 14m 32s | Max: 40m 01s
      🟩 GCC10              Pass: 100%/1   | Total: 40m 38s | Avg: 40m 38s | Max: 40m 38s
      🟩 GCC11              Pass: 100%/1   | Total: 44m 57s | Avg: 44m 57s | Max: 44m 57s
      🟩 GCC12              Pass: 100%/1   | Total: 42m 45s | Avg: 42m 45s | Max: 42m 45s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 41m | Avg: 27m 39s | Max: 44m 21s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total: 51m 09s | Avg: 51m 09s | Max: 51m 09s
      🟥 MSVC14.16          Pass:   0%/1   | Total: 24m 16s | Avg: 24m 16s | Max: 24m 16s
      🟩 MSVC14.29          Pass: 100%/1   | Total:  1h 11m | Avg:  1h 11m | Max:  1h 11m | Hits:   0%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 56m | Avg: 58m 55s | Max:  1h 24m | Hits:  33%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 14m
    🟨 cxx_family
      🟨 Clang              Pass:  78%/19  | Total:  8h 55m | Avg: 28m 12s | Max: 42m 09s
      🟨 GCC                Pass:  68%/19  | Total:  7h 16m | Avg: 22m 59s | Max: 44m 57s
      🟩 Intel              Pass: 100%/1   | Total: 51m 09s | Avg: 51m 09s | Max: 51m 09s
      🟨 MSVC               Pass:  80%/5   | Total:  4h 32m | Avg: 54m 29s | Max:  1h 24m | Hits:  25%/7408  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 14m
    🟨 std
      🟥 11                 Pass:   0%/5   | Total:  9m 56s | Avg:  1m 59s | Max:  2m 10s
      🟥 14                 Pass:   0%/4   | Total: 30m 23s | Avg:  7m 35s | Max: 24m 16s
      🟨 17                 Pass:  83%/12  | Total:  8h 22m | Avg: 41m 51s | Max:  1h 11m | Hits:   0%/3704  
      🟩 20                 Pass: 100%/23  | Total: 14h 11m | Avg: 37m 02s | Max:  1h 24m | Hits:  49%/3704  
    🟨 gpu
      🟨 v100               Pass:  76%/46  | Total:  1d 00h | Avg: 31m 19s | Max:  1h 24m | Hits:  25%/7408  
    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 46m 21s | Avg: 23m 10s | Max: 34m 39s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 29m 40s | Avg: 29m 40s | Max: 29m 40s
    
  • 🟩 cudax: Pass: 100%/26 | Total: 2h 25m | Avg: 5m 34s | Max: 19m 40s | Hits: 9%/312

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 11m | Avg:  5m 57s | Max: 19m 40s | Hits:   9%/312   
      🟩 arm64              Pass: 100%/4   | Total: 14m 02s | Avg:  3m 30s | Max:  3m 40s
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 18m 52s | Avg:  6m 17s | Max: 11m 49s | Hits:   9%/156   
      🟩 12.5               Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 31s
      🟩 12.6               Pass: 100%/21  | Total:  1h 53m | Avg:  5m 24s | Max: 19m 40s | Hits:   9%/156   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 18m 52s | Avg:  6m 17s | Max: 11m 49s | Hits:   9%/156   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 31s
      🟩 nvcc12.6           Pass: 100%/21  | Total:  1h 53m | Avg:  5m 24s | Max: 19m 40s | Hits:   9%/156   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  2h 25m | Avg:  5m 34s | Max: 19m 40s | Hits:   9%/312   
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  3m 35s | Avg:  3m 35s | Max:  3m 35s
      🟩 Clang10            Pass: 100%/1   | Total:  4m 23s | Avg:  4m 23s | Max:  4m 23s
      🟩 Clang11            Pass: 100%/1   | Total:  3m 53s | Avg:  3m 53s | Max:  3m 53s
      🟩 Clang12            Pass: 100%/1   | Total:  3m 42s | Avg:  3m 42s | Max:  3m 42s
      🟩 Clang13            Pass: 100%/1   | Total:  4m 02s | Avg:  4m 02s | Max:  4m 02s
      🟩 Clang14            Pass: 100%/1   | Total:  3m 52s | Avg:  3m 52s | Max:  3m 52s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 53s | Avg:  3m 53s | Max:  3m 53s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 52s | Avg:  3m 52s | Max:  3m 52s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 56s | Avg:  3m 56s | Max:  3m 56s
      🟩 Clang18            Pass: 100%/4   | Total: 30m 19s | Avg:  7m 34s | Max: 19m 40s
      🟩 GCC9               Pass: 100%/1   | Total:  3m 28s | Avg:  3m 28s | Max:  3m 28s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 55s | Avg:  3m 55s | Max:  3m 55s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 51s | Avg:  3m 51s | Max:  3m 51s
      🟩 GCC12              Pass: 100%/2   | Total: 19m 10s | Avg:  9m 35s | Max: 15m 10s
      🟩 GCC13              Pass: 100%/4   | Total: 13m 27s | Avg:  3m 21s | Max:  3m 40s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 11m 49s | Avg: 11m 49s | Max: 11m 49s | Hits:   9%/156   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 23s | Avg: 11m 23s | Max: 11m 23s | Hits:   9%/156   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 31s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/13  | Total:  1h 05m | Avg:  5m 02s | Max: 19m 40s
      🟩 GCC                Pass: 100%/9   | Total: 43m 51s | Avg:  4m 52s | Max: 15m 10s
      🟩 MSVC               Pass: 100%/2   | Total: 23m 12s | Avg: 11m 36s | Max: 11m 49s | Hits:   9%/312   
      🟩 NVHPC              Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 31s
    🟩 gpu
      🟩 v100               Pass: 100%/26  | Total:  2h 25m | Avg:  5m 34s | Max: 19m 40s | Hits:   9%/312   
    🟩 jobs
      🟩 Build              Pass: 100%/24  | Total:  1h 50m | Avg:  4m 35s | Max: 11m 49s | Hits:   9%/312   
      🟩 Test               Pass: 100%/2   | Total: 34m 50s | Avg: 17m 25s | Max: 19m 40s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 12s | Avg:  3m 12s | Max:  3m 12s
      🟩 90a                Pass: 100%/1   | Total:  3m 03s | Avg:  3m 03s | Max:  3m 03s
    🟩 std
      🟩 17                 Pass: 100%/6   | Total: 23m 18s | Avg:  3m 53s | Max:  6m 02s
      🟩 20                 Pass: 100%/20  | Total:  2h 01m | Avg:  6m 05s | Max: 19m 40s | Hits:   9%/312   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 9m 54s | Avg: 4m 57s | Max: 7m 42s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 42s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 42s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 42s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 42s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 42s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 42s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  9m 54s | Avg:  4m 57s | Max:  7m 42s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 12s | Avg:  2m 12s | Max:  2m 12s
      🟩 Test               Pass: 100%/1   | Total:  7m 42s | Avg:  7m 42s | Max:  7m 42s
    
  • 🟩 python: Pass: 100%/1 | Total: 26m 28s | Avg: 26m 28s | Max: 26m 28s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 26m 28s | Avg: 26m 28s | Max: 26m 28s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 26m 28s | Avg: 26m 28s | Max: 26m 28s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 26m 28s | Avg: 26m 28s | Max: 26m 28s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 26m 28s | Avg: 26m 28s | Max: 26m 28s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 26m 28s | Avg: 26m 28s | Max: 26m 28s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 26m 28s | Avg: 26m 28s | Max: 26m 28s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 26m 28s | Avg: 26m 28s | Max: 26m 28s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 26m 28s | Avg: 26m 28s | Max: 26m 28s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 170)

# Runner
125 linux-amd64-cpu16
19 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@fbusato fbusato requested a review from miscco January 9, 2025 01:01
@github-actions
Copy link
Contributor

github-actions bot commented Jan 9, 2025

🟨 CI finished in 2h 28m: Pass: 78%/164 | Total: 2d 15h | Avg: 23m 08s | Max: 1h 16m | Hits: 419%/15310
  • 🟨 cub: Pass: 71%/45 | Total: 1d 05h | Avg: 38m 54s | Max: 1h 11m

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  69%/43  | Total:  1d 03h | Avg: 37m 55s | Max:  1h 11m
      🟩 arm64              Pass: 100%/2   | Total:  2h 00m | Avg:  1h 00m | Max:  1h 04m
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 57m | Avg: 58m 41s | Max: 58m 52s
      🔍 nvcc               Pass:  69%/43  | Total:  1d 03h | Avg: 37m 59s | Max:  1h 11m
    🔍 gpu: v100 🔍
      🟩 h100               Pass: 100%/2   | Total: 40m 41s | Avg: 20m 20s | Max: 24m 38s
      🔍 v100               Pass:  69%/43  | Total:  1d 04h | Avg: 39m 46s | Max:  1h 11m
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  65%/38  | Total:  1d 02h | Avg: 42m 16s | Max:  1h 11m
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 18m 59s | Avg: 18m 59s | Max: 18m 59s
      🟩 GraphCapture       Pass: 100%/1   | Total: 15m 59s | Avg: 15m 59s | Max: 15m 59s
      🟩 HostLaunch         Pass: 100%/3   | Total: 56m 59s | Avg: 18m 59s | Max: 22m 09s
      🟩 TestGPU            Pass: 100%/2   | Total: 52m 27s | Avg: 26m 13s | Max: 29m 15s
    🟨 ctk
      🟥 11.1               Pass:   0%/6   | Total: 15m 52s | Avg:  2m 38s | Max:  2m 52s
      🟩 12.5               Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 11m
      🟨 12.6               Pass:  81%/37  | Total:  1d 02h | Avg: 43m 01s | Max:  1h 08m
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 57m | Avg: 58m 41s | Max: 58m 52s
      🟥 nvcc11.1           Pass:   0%/6   | Total: 15m 52s | Avg:  2m 38s | Max:  2m 52s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 11m
      🟨 nvcc12.6           Pass:  80%/35  | Total:  1d 00h | Avg: 42m 07s | Max:  1h 08m
    🟨 cxx
      🟥 Clang9             Pass:   0%/4   | Total: 12m 09s | Avg:  3m 02s | Max:  3m 23s
      🟩 Clang10            Pass: 100%/1   | Total: 55m 19s | Avg: 55m 19s | Max: 55m 19s
      🟩 Clang11            Pass: 100%/1   | Total: 57m 36s | Avg: 57m 36s | Max: 57m 36s
      🟩 Clang12            Pass: 100%/1   | Total: 53m 50s | Avg: 53m 50s | Max: 53m 50s
      🟩 Clang13            Pass: 100%/1   | Total: 56m 35s | Avg: 56m 35s | Max: 56m 35s
      🟩 Clang14            Pass: 100%/1   | Total: 52m 50s | Avg: 52m 50s | Max: 52m 50s
      🟩 Clang15            Pass: 100%/1   | Total: 59m 25s | Avg: 59m 25s | Max: 59m 25s
      🟩 Clang16            Pass: 100%/1   | Total: 55m 43s | Avg: 55m 43s | Max: 55m 43s
      🟩 Clang17            Pass: 100%/1   | Total: 52m 53s | Avg: 52m 53s | Max: 52m 53s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 39m | Avg: 48m 28s | Max: 59m 16s
      🟥 GCC7               Pass:   0%/4   | Total: 11m 23s | Avg:  2m 50s | Max:  3m 11s
      🟩 GCC8               Pass: 100%/1   | Total: 54m 14s | Avg: 54m 14s | Max: 54m 14s
      🟨 GCC9               Pass:  33%/3   | Total: 58m 22s | Avg: 19m 27s | Max: 53m 11s
      🟩 GCC10              Pass: 100%/1   | Total: 55m 47s | Avg: 55m 47s | Max: 55m 47s
      🟩 GCC11              Pass: 100%/1   | Total: 58m 19s | Avg: 58m 19s | Max: 58m 19s
      🟩 GCC12              Pass: 100%/3   | Total:  1h 37m | Avg: 32m 33s | Max: 56m 59s
      🟩 GCC13              Pass: 100%/8   | Total:  4h 37m | Avg: 34m 37s | Max:  1h 04m
      🟥 MSVC14.29          Pass:   0%/1   | Total:  1h 04m | Avg:  1h 04m | Max:  1h 04m
      🟥 MSVC14.39          Pass:   0%/2   | Total:  2h 14m | Avg:  1h 07m | Max:  1h 08m
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 11m
    🟨 cxx_family
      🟨 Clang              Pass:  78%/19  | Total: 13h 15m | Avg: 41m 52s | Max: 59m 25s
      🟨 GCC                Pass:  71%/21  | Total: 10h 12m | Avg: 29m 10s | Max:  1h 04m
      🟥 MSVC               Pass:   0%/3   | Total:  3h 19m | Avg:  1h 06m | Max:  1h 08m
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 23m | Avg:  1h 11m | Max:  1h 11m
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 40m 41s | Avg: 20m 20s | Max: 24m 38s
      🟩 90a                Pass: 100%/1   | Total: 24m 10s | Avg: 24m 10s | Max: 24m 10s
    🟨 std
      🟥 11                 Pass:   0%/5   | Total: 14m 06s | Avg:  2m 49s | Max:  3m 16s
      🟥 14                 Pass:   0%/2   | Total:  6m 34s | Avg:  3m 17s | Max:  3m 23s
      🟨 17                 Pass:  58%/12  | Total:  9h 08m | Avg: 45m 40s | Max:  1h 11m
      🟨 20                 Pass:  96%/26  | Total: 19h 42m | Avg: 45m 27s | Max:  1h 11m
    
  • 🟨 libcudacxx: Pass: 73%/46 | Total: 8h 29m | Avg: 11m 04s | Max: 32m 21s | Hits: 657%/7590

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  72%/44  | Total:  8h 22m | Avg: 11m 24s | Max: 32m 21s | Hits: 657%/7590  
      🟩 arm64              Pass: 100%/2   | Total:  7m 06s | Avg:  3m 33s | Max:  3m 41s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 05m | Avg: 16m 26s | Max: 20m 20s
      🔍 nvcc               Pass:  71%/42  | Total:  7h 23m | Avg: 10m 33s | Max: 32m 21s | Hits: 657%/7590  
    🟨 ctk
      🟥 11.1               Pass:   0%/6   | Total: 26m 14s | Avg:  4m 22s | Max: 14m 40s
      🟩 12.5               Pass: 100%/2   | Total: 38m 46s | Avg: 19m 23s | Max: 30m 26s
      🟨 12.6               Pass:  84%/38  | Total:  7h 24m | Avg: 11m 41s | Max: 32m 21s | Hits: 657%/7590  
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 05m | Avg: 16m 26s | Max: 20m 20s
      🟥 nvcc11.1           Pass:   0%/6   | Total: 26m 14s | Avg:  4m 22s | Max: 14m 40s
      🟩 nvcc12.5           Pass: 100%/2   | Total: 38m 46s | Avg: 19m 23s | Max: 30m 26s
      🟨 nvcc12.6           Pass:  82%/34  | Total:  6h 18m | Avg: 11m 07s | Max: 32m 21s | Hits: 657%/7590  
    🟨 cxx
      🟥 Clang9             Pass:   0%/4   | Total: 12m 06s | Avg:  3m 01s | Max:  4m 23s
      🟩 Clang10            Pass: 100%/1   | Total:  5m 17s | Avg:  5m 17s | Max:  5m 17s
      🟩 Clang11            Pass: 100%/1   | Total:  3m 52s | Avg:  3m 52s | Max:  3m 52s
      🟩 Clang12            Pass: 100%/1   | Total:  3m 59s | Avg:  3m 59s | Max:  3m 59s
      🟩 Clang13            Pass: 100%/1   | Total: 20m 31s | Avg: 20m 31s | Max: 20m 31s
      🟩 Clang14            Pass: 100%/1   | Total:  3m 55s | Avg:  3m 55s | Max:  3m 55s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 34s | Avg:  4m 34s | Max:  4m 34s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 30s | Avg:  4m 30s | Max:  4m 30s
      🟩 Clang17            Pass: 100%/1   | Total:  4m 14s | Avg:  4m 14s | Max:  4m 14s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 46m | Avg: 13m 19s | Max: 20m 20s
      🟥 GCC7               Pass:   0%/4   | Total: 21m 54s | Avg:  5m 28s | Max: 14m 40s
      🟩 GCC8               Pass: 100%/1   | Total:  3m 34s | Avg:  3m 34s | Max:  3m 34s
      🟨 GCC9               Pass:  33%/3   | Total:  8m 29s | Avg:  2m 49s | Max:  4m 00s
      🟩 GCC10              Pass: 100%/1   | Total:  4m 01s | Avg:  4m 01s | Max:  4m 01s
      🟩 GCC11              Pass: 100%/1   | Total: 20m 39s | Avg: 20m 39s | Max: 20m 39s
      🟩 GCC12              Pass: 100%/1   | Total:  3m 41s | Avg:  3m 41s | Max:  3m 41s
      🟨 GCC13              Pass:  80%/10  | Total:  2h 35m | Avg: 15m 33s | Max: 26m 31s
      🟩 MSVC14.29          Pass: 100%/1   | Total: 26m 43s | Avg: 26m 43s | Max: 26m 43s | Hits: 681%/2481  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 56m 18s | Avg: 28m 09s | Max: 32m 21s | Hits: 646%/5109  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 38m 46s | Avg: 19m 23s | Max: 30m 26s
    🟨 cxx_family
      🟨 Clang              Pass:  80%/20  | Total:  2h 49m | Avg:  8m 28s | Max: 20m 31s
      🟨 GCC                Pass:  61%/21  | Total:  3h 37m | Avg: 10m 22s | Max: 26m 31s
      🟩 MSVC               Pass: 100%/3   | Total:  1h 23m | Avg: 27m 40s | Max: 32m 21s | Hits: 657%/7590  
      🟩 NVHPC              Pass: 100%/2   | Total: 38m 46s | Avg: 19m 23s | Max: 30m 26s
    🟨 jobs
      🟨 Build              Pass:  74%/39  | Total:  6h 08m | Avg:  9m 27s | Max: 32m 21s | Hits: 657%/7590  
      🟨 NVRTC              Pass:  50%/4   | Total:  1h 34m | Avg: 23m 34s | Max: 25m 56s
      🟩 Test               Pass: 100%/2   | Total: 44m 05s | Avg: 22m 02s | Max: 26m 31s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 53s | Avg:  1m 53s | Max:  1m 53s
    🟨 std
      🟥 11                 Pass:   0%/6   | Total: 31m 24s | Avg:  5m 14s | Max: 22m 13s
      🟥 14                 Pass:   0%/3   | Total: 30m 01s | Avg: 10m 00s | Max: 21m 59s
      🟨 17                 Pass:  76%/13  | Total:  2h 57m | Avg: 13m 39s | Max: 30m 26s | Hits: 682%/4962  
      🟩 20                 Pass: 100%/23  | Total:  4h 28m | Avg: 11m 39s | Max: 32m 21s | Hits: 612%/2628  
    🟨 gpu
      🟨 v100               Pass:  73%/46  | Total:  8h 29m | Avg: 11m 04s | Max: 32m 21s | Hits: 657%/7590  
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 43s | Avg: 13m 43s | Max: 13m 43s
      🟩 90a                Pass: 100%/2   | Total: 16m 18s | Avg:  8m 09s | Max: 12m 26s
    
  • 🟨 thrust: Pass: 77%/44 | Total: 22h 30m | Avg: 30m 41s | Max: 1h 16m | Hits: 168%/7408

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  76%/42  | Total: 21h 10m | Avg: 30m 15s | Max:  1h 16m | Hits: 168%/7408  
      🟩 arm64              Pass: 100%/2   | Total:  1h 19m | Avg: 39m 48s | Max: 41m 55s
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 05m | Avg: 32m 41s | Max: 33m 21s
      🔍 nvcc               Pass:  76%/42  | Total: 21h 25m | Avg: 30m 35s | Max:  1h 16m | Hits: 168%/7408  
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  73%/38  | Total: 20h 57m | Avg: 33m 04s | Max:  1h 16m | Hits: 102%/5556  
      🟩 TestCPU            Pass: 100%/3   | Total: 52m 23s | Avg: 17m 27s | Max: 37m 18s | Hits: 365%/1852  
      🟩 TestGPU            Pass: 100%/3   | Total: 41m 04s | Avg: 13m 41s | Max: 15m 06s
    🟨 ctk
      🟥 11.1               Pass:   0%/6   | Total: 10m 42s | Avg:  1m 47s | Max:  2m 01s
      🟩 12.5               Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 16m
      🟨 12.6               Pass:  88%/36  | Total: 19h 55m | Avg: 33m 12s | Max:  1h 13m | Hits: 168%/7408  
    🟨 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 05m | Avg: 32m 41s | Max: 33m 21s
      🟥 nvcc11.1           Pass:   0%/6   | Total: 10m 42s | Avg:  1m 47s | Max:  2m 01s
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 16m
      🟨 nvcc12.6           Pass:  88%/34  | Total: 18h 50m | Avg: 33m 14s | Max:  1h 13m | Hits: 168%/7408  
    🟨 cxx
      🟥 Clang9             Pass:   0%/4   | Total:  8m 42s | Avg:  2m 10s | Max:  2m 25s
      🟩 Clang10            Pass: 100%/1   | Total: 38m 35s | Avg: 38m 35s | Max: 38m 35s
      🟩 Clang11            Pass: 100%/1   | Total: 40m 18s | Avg: 40m 18s | Max: 40m 18s
      🟩 Clang12            Pass: 100%/1   | Total: 40m 09s | Avg: 40m 09s | Max: 40m 09s
      🟩 Clang13            Pass: 100%/1   | Total: 38m 09s | Avg: 38m 09s | Max: 38m 09s
      🟩 Clang14            Pass: 100%/1   | Total: 36m 11s | Avg: 36m 11s | Max: 36m 11s
      🟩 Clang15            Pass: 100%/1   | Total: 36m 51s | Avg: 36m 51s | Max: 36m 51s
      🟩 Clang16            Pass: 100%/1   | Total: 38m 49s | Avg: 38m 49s | Max: 38m 49s
      🟩 Clang17            Pass: 100%/1   | Total: 39m 55s | Avg: 39m 55s | Max: 39m 55s
      🟩 Clang18            Pass: 100%/7   | Total:  3h 22m | Avg: 28m 56s | Max: 39m 58s
      🟥 GCC7               Pass:   0%/4   | Total:  7m 38s | Avg:  1m 54s | Max:  2m 07s
      🟩 GCC8               Pass: 100%/1   | Total: 39m 55s | Avg: 39m 55s | Max: 39m 55s
      🟨 GCC9               Pass:  33%/3   | Total: 45m 51s | Avg: 15m 17s | Max: 42m 35s
      🟩 GCC10              Pass: 100%/1   | Total: 41m 45s | Avg: 41m 45s | Max: 41m 45s
      🟩 GCC11              Pass: 100%/1   | Total: 40m 32s | Avg: 40m 32s | Max: 40m 32s
      🟩 GCC12              Pass: 100%/1   | Total: 44m 28s | Avg: 44m 28s | Max: 44m 28s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 36m | Avg: 27m 01s | Max: 42m 43s
      🟩 MSVC14.29          Pass: 100%/1   | Total:  1h 13m | Avg:  1h 13m | Max:  1h 13m | Hits: 104%/1852  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 55m | Avg: 58m 36s | Max:  1h 11m | Hits: 189%/5556  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 16m
    🟨 cxx_family
      🟨 Clang              Pass:  78%/19  | Total:  8h 40m | Avg: 27m 22s | Max: 40m 18s
      🟨 GCC                Pass:  68%/19  | Total:  7h 16m | Avg: 22m 57s | Max: 44m 28s
      🟩 MSVC               Pass: 100%/4   | Total:  4h 09m | Avg:  1h 02m | Max:  1h 13m | Hits: 168%/7408  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 24m | Avg:  1h 12m | Max:  1h 16m
    🟨 std
      🟥 11                 Pass:   0%/5   | Total:  9m 51s | Avg:  1m 58s | Max:  2m 16s
      🟥 14                 Pass:   0%/2   | Total:  4m 32s | Avg:  2m 16s | Max:  2m 25s
      🟨 17                 Pass:  75%/12  | Total:  7h 27m | Avg: 37m 19s | Max:  1h 13m | Hits: 102%/3704  
      🟩 20                 Pass: 100%/23  | Total: 14h 02m | Avg: 36m 37s | Max:  1h 16m | Hits: 233%/3704  
    🟨 gpu
      🟨 v100               Pass:  77%/44  | Total: 22h 30m | Avg: 30m 41s | Max:  1h 16m | Hits: 168%/7408  
    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 45m 49s | Avg: 22m 54s | Max: 33m 56s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 24m 16s | Avg: 24m 16s | Max: 24m 16s
    
  • 🟩 cudax: Pass: 100%/26 | Total: 2h 30m | Avg: 5m 46s | Max: 19m 04s | Hits: 574%/312

    🟩 cpu
      🟩 amd64              Pass: 100%/22  | Total:  2h 15m | Avg:  6m 10s | Max: 19m 04s | Hits: 574%/312   
      🟩 arm64              Pass: 100%/4   | Total: 14m 20s | Avg:  3m 35s | Max:  3m 43s
    🟩 ctk
      🟩 12.0               Pass: 100%/3   | Total: 18m 56s | Avg:  6m 18s | Max: 11m 34s | Hits: 574%/156   
      🟩 12.5               Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 21s
      🟩 12.6               Pass: 100%/21  | Total:  1h 58m | Avg:  5m 39s | Max: 19m 04s | Hits: 574%/156   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/3   | Total: 18m 56s | Avg:  6m 18s | Max: 11m 34s | Hits: 574%/156   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 21s
      🟩 nvcc12.6           Pass: 100%/21  | Total:  1h 58m | Avg:  5m 39s | Max: 19m 04s | Hits: 574%/156   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/26  | Total:  2h 30m | Avg:  5m 46s | Max: 19m 04s | Hits: 574%/312   
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  3m 38s | Avg:  3m 38s | Max:  3m 38s
      🟩 Clang10            Pass: 100%/1   | Total:  4m 22s | Avg:  4m 22s | Max:  4m 22s
      🟩 Clang11            Pass: 100%/1   | Total:  3m 35s | Avg:  3m 35s | Max:  3m 35s
      🟩 Clang12            Pass: 100%/1   | Total:  3m 45s | Avg:  3m 45s | Max:  3m 45s
      🟩 Clang13            Pass: 100%/1   | Total:  3m 53s | Avg:  3m 53s | Max:  3m 53s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 00s | Avg:  4m 00s | Max:  4m 00s
      🟩 Clang15            Pass: 100%/1   | Total:  4m 12s | Avg:  4m 12s | Max:  4m 12s
      🟩 Clang16            Pass: 100%/1   | Total:  4m 10s | Avg:  4m 10s | Max:  4m 10s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 47s | Avg:  3m 47s | Max:  3m 47s
      🟩 Clang18            Pass: 100%/4   | Total: 30m 09s | Avg:  7m 32s | Max: 19m 04s
      🟩 GCC9               Pass: 100%/1   | Total:  3m 44s | Avg:  3m 44s | Max:  3m 44s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 51s | Avg:  3m 51s | Max:  3m 51s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 37s | Avg:  3m 37s | Max:  3m 37s
      🟩 GCC12              Pass: 100%/2   | Total: 22m 41s | Avg: 11m 20s | Max: 18m 37s
      🟩 GCC13              Pass: 100%/4   | Total: 13m 56s | Avg:  3m 29s | Max:  3m 43s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 11m 34s | Avg: 11m 34s | Max: 11m 34s | Hits: 574%/156   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 12m 50s | Avg: 12m 50s | Max: 12m 50s | Hits: 574%/156   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 21s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/13  | Total:  1h 05m | Avg:  5m 02s | Max: 19m 04s
      🟩 GCC                Pass: 100%/9   | Total: 47m 49s | Avg:  5m 18s | Max: 18m 37s
      🟩 MSVC               Pass: 100%/2   | Total: 24m 24s | Avg: 12m 12s | Max: 12m 50s | Hits: 574%/312   
      🟩 NVHPC              Pass: 100%/2   | Total: 12m 33s | Avg:  6m 16s | Max:  6m 21s
    🟩 gpu
      🟩 v100               Pass: 100%/26  | Total:  2h 30m | Avg:  5m 46s | Max: 19m 04s | Hits: 574%/312   
    🟩 jobs
      🟩 Build              Pass: 100%/24  | Total:  1h 52m | Avg:  4m 41s | Max: 12m 50s | Hits: 574%/312   
      🟩 Test               Pass: 100%/2   | Total: 37m 41s | Avg: 18m 50s | Max: 19m 04s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  3m 13s | Avg:  3m 13s | Max:  3m 13s
      🟩 90a                Pass: 100%/1   | Total:  3m 20s | Avg:  3m 20s | Max:  3m 20s
    🟩 std
      🟩 17                 Pass: 100%/6   | Total: 23m 52s | Avg:  3m 58s | Max:  6m 12s
      🟩 20                 Pass: 100%/20  | Total:  2h 06m | Avg:  6m 19s | Max: 19m 04s | Hits: 574%/312   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 8m 50s | Avg: 4m 25s | Max: 6m 33s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 33s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 33s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 33s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 33s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 33s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 33s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  6m 33s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 17s | Avg:  2m 17s | Max:  2m 17s
      🟩 Test               Pass: 100%/1   | Total:  6m 33s | Avg:  6m 33s | Max:  6m 33s
    
  • 🟩 python: Pass: 100%/1 | Total: 26m 14s | Avg: 26m 14s | Max: 26m 14s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 26m 14s | Avg: 26m 14s | Max: 26m 14s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 26m 14s | Avg: 26m 14s | Max: 26m 14s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 26m 14s | Avg: 26m 14s | Max: 26m 14s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 26m 14s | Avg: 26m 14s | Max: 26m 14s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 26m 14s | Avg: 26m 14s | Max: 26m 14s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 26m 14s | Avg: 26m 14s | Max: 26m 14s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 26m 14s | Avg: 26m 14s | Max: 26m 14s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 26m 14s | Avg: 26m 14s | Max: 26m 14s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 164)

# Runner
122 linux-amd64-cpu16
19 linux-amd64-gpu-v100-latest-1
12 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@fbusato fbusato added the 3.0 Targeted for 3.0 release label Jan 16, 2025
@fbusato fbusato changed the title [DO NOT MERGE] ceil_div return common type and optmize ceil_div return common type and optmize Jan 21, 2025
@github-actions
Copy link
Contributor

🟩 CI finished in 5h 36m: Pass: 100%/135 | Total: 2d 18h | Avg: 29m 35s | Max: 1h 44m | Hits: 360%/23291
  • 🟩 cub: Pass: 100%/38 | Total: 1d 08h | Avg: 51m 39s | Max: 1h 44m | Hits: 48%/3528

    🟩 cpu
      🟩 amd64              Pass: 100%/36  | Total:  1d 06h | Avg: 51m 32s | Max:  1h 44m | Hits:  48%/3528  
      🟩 arm64              Pass: 100%/2   | Total:  1h 47m | Avg: 53m 47s | Max: 53m 49s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  4h 42m | Avg: 56m 34s | Max:  1h 06m | Hits:  72%/882   
      🟩 12.5               Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 08m
      🟩 12.6               Pass: 100%/31  | Total:  1d 01h | Avg: 49m 50s | Max:  1h 44m | Hits:  40%/2646  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  1h 46m | Avg: 53m 17s | Max: 53m 58s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  4h 42m | Avg: 56m 34s | Max:  1h 06m | Hits:  72%/882   
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 08m
      🟩 nvcc12.6           Pass: 100%/29  | Total: 23h 58m | Avg: 49m 36s | Max:  1h 44m | Hits:  40%/2646  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  1h 46m | Avg: 53m 17s | Max: 53m 58s
      🟩 nvcc               Pass: 100%/36  | Total:  1d 06h | Avg: 51m 34s | Max:  1h 44m | Hits:  48%/3528  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  3h 34m | Avg: 53m 38s | Max: 54m 28s
      🟩 Clang15            Pass: 100%/1   | Total: 50m 59s | Avg: 50m 59s | Max: 50m 59s
      🟩 Clang16            Pass: 100%/1   | Total: 54m 35s | Avg: 54m 35s | Max: 54m 35s
      🟩 Clang17            Pass: 100%/1   | Total: 52m 33s | Avg: 52m 33s | Max: 52m 33s
      🟩 Clang18            Pass: 100%/7   | Total:  5h 11m | Avg: 44m 25s | Max: 53m 58s
      🟩 GCC7               Pass: 100%/2   | Total:  1h 40m | Avg: 50m 17s | Max: 50m 54s
      🟩 GCC8               Pass: 100%/1   | Total: 51m 12s | Avg: 51m 12s | Max: 51m 12s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 50m | Avg: 55m 05s | Max: 57m 50s
      🟩 GCC10              Pass: 100%/1   | Total: 52m 40s | Avg: 52m 40s | Max: 52m 40s
      🟩 GCC11              Pass: 100%/1   | Total: 55m 24s | Avg: 55m 24s | Max: 55m 24s
      🟩 GCC12              Pass: 100%/3   | Total:  1h 40m | Avg: 33m 21s | Max: 54m 59s
      🟩 GCC13              Pass: 100%/8   | Total:  6h 39m | Avg: 49m 54s | Max:  1h 44m
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 16m | Avg:  1h 08m | Max:  1h 10m | Hits:  56%/1764  
      🟩 MSVC14.39          Pass: 100%/2   | Total:  2h 19m | Avg:  1h 09m | Max:  1h 11m | Hits:  40%/1764  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 08m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total: 11h 23m | Avg: 48m 50s | Max: 54m 35s
      🟩 GCC                Pass: 100%/18  | Total: 14h 29m | Avg: 48m 17s | Max:  1h 44m
      🟩 MSVC               Pass: 100%/4   | Total:  4h 35m | Avg:  1h 08m | Max:  1h 11m | Hits:  48%/3528  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 08m
    🟩 gpu
      🟩 h100               Pass: 100%/2   | Total: 45m 06s | Avg: 22m 33s | Max: 25m 42s
      🟩 v100               Pass: 100%/36  | Total:  1d 07h | Avg: 53m 16s | Max:  1h 44m | Hits:  48%/3528  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total:  1d 04h | Avg: 54m 12s | Max:  1h 11m | Hits:  48%/3528  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 56m 39s | Avg: 56m 39s | Max: 56m 39s
      🟩 GraphCapture       Pass: 100%/1   | Total:  1h 44m | Avg:  1h 44m | Max:  1h 44m
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 12m | Avg: 24m 00s | Max: 29m 58s
      🟩 TestGPU            Pass: 100%/2   | Total: 49m 52s | Avg: 24m 56s | Max: 26m 50s
    🟩 sm
      🟩 90                 Pass: 100%/2   | Total: 45m 06s | Avg: 22m 33s | Max: 25m 42s
      🟩 90a                Pass: 100%/1   | Total: 21m 34s | Avg: 21m 34s | Max: 21m 34s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total: 13h 27m | Avg: 57m 39s | Max:  1h 11m | Hits:  51%/2646  
      🟩 20                 Pass: 100%/24  | Total: 19h 16m | Avg: 48m 10s | Max:  1h 44m | Hits:  39%/882   
    
  • 🟩 libcudacxx: Pass: 100%/37 | Total: 9h 15m | Avg: 15m 01s | Max: 30m 52s | Hits: 653%/10061

    🟩 cpu
      🟩 amd64              Pass: 100%/35  | Total:  8h 34m | Avg: 14m 41s | Max: 30m 52s | Hits: 653%/10061 
      🟩 arm64              Pass: 100%/2   | Total: 41m 29s | Avg: 20m 44s | Max: 21m 18s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  1h 09m | Avg: 13m 48s | Max: 25m 12s | Hits: 687%/2470  
      🟩 12.5               Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max:  8m 40s
      🟩 12.6               Pass: 100%/30  | Total:  7h 49m | Avg: 15m 38s | Max: 30m 52s | Hits: 642%/7591  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total:  1h 09m | Avg: 17m 27s | Max: 22m 43s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  1h 09m | Avg: 13m 48s | Max: 25m 12s | Hits: 687%/2470  
      🟩 nvcc12.5           Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max:  8m 40s
      🟩 nvcc12.6           Pass: 100%/26  | Total:  6h 39m | Avg: 15m 22s | Max: 30m 52s | Hits: 642%/7591  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total:  1h 09m | Avg: 17m 27s | Max: 22m 43s
      🟩 nvcc               Pass: 100%/33  | Total:  8h 05m | Avg: 14m 43s | Max: 30m 52s | Hits: 653%/10061 
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total: 30m 32s | Avg:  7m 38s | Max: 17m 45s
      🟩 Clang15            Pass: 100%/1   | Total: 19m 19s | Avg: 19m 19s | Max: 19m 19s
      🟩 Clang16            Pass: 100%/1   | Total: 16m 57s | Avg: 16m 57s | Max: 16m 57s
      🟩 Clang17            Pass: 100%/1   | Total: 21m 27s | Avg: 21m 27s | Max: 21m 27s
      🟩 Clang18            Pass: 100%/8   | Total:  2h 12m | Avg: 16m 35s | Max: 22m 43s
      🟩 GCC7               Pass: 100%/2   | Total: 22m 15s | Avg: 11m 07s | Max: 18m 28s
      🟩 GCC8               Pass: 100%/1   | Total: 20m 46s | Avg: 20m 46s | Max: 20m 46s
      🟩 GCC9               Pass: 100%/2   | Total:  7m 47s | Avg:  3m 53s | Max:  4m 01s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 58s | Avg:  3m 58s | Max:  3m 58s
      🟩 GCC11              Pass: 100%/1   | Total:  4m 00s | Avg:  4m 00s | Max:  4m 00s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 38s | Avg:  4m 38s | Max:  4m 38s
      🟩 GCC13              Pass: 100%/8   | Total:  2h 20m | Avg: 17m 35s | Max: 25m 07s
      🟩 MSVC14.29          Pass: 100%/2   | Total: 53m 49s | Avg: 26m 54s | Max: 28m 37s | Hits: 654%/4950  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 59m 25s | Avg: 29m 42s | Max: 30m 52s | Hits: 652%/5111  
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max:  8m 40s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/15  | Total:  3h 41m | Avg: 14m 44s | Max: 22m 43s
      🟩 GCC                Pass: 100%/16  | Total:  3h 24m | Avg: 12m 45s | Max: 25m 07s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 53m | Avg: 28m 18s | Max: 30m 52s | Hits: 653%/10061 
      🟩 NVHPC              Pass: 100%/2   | Total: 17m 15s | Avg:  8m 37s | Max:  8m 40s
    🟩 gpu
      🟩 v100               Pass: 100%/37  | Total:  9h 15m | Avg: 15m 01s | Max: 30m 52s | Hits: 653%/10061 
    🟩 jobs
      🟩 Build              Pass: 100%/32  | Total:  7h 49m | Avg: 14m 41s | Max: 30m 52s | Hits: 653%/10061 
      🟩 NVRTC              Pass: 100%/2   | Total: 47m 38s | Avg: 23m 49s | Max: 23m 50s
      🟩 Test               Pass: 100%/2   | Total: 36m 04s | Avg: 18m 02s | Max: 19m 14s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 57s | Avg:  1m 57s | Max:  1m 57s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 13m 39s | Avg: 13m 39s | Max: 13m 39s
      🟩 90a                Pass: 100%/2   | Total: 16m 39s | Avg:  8m 19s | Max: 12m 59s
    🟩 std
      🟩 17                 Pass: 100%/15  | Total:  3h 57m | Avg: 15m 50s | Max: 28m 37s | Hits: 665%/7430  
      🟩 20                 Pass: 100%/21  | Total:  5h 16m | Avg: 15m 02s | Max: 30m 52s | Hits: 619%/2631  
    
  • 🟩 thrust: Pass: 100%/37 | Total: 21h 41m | Avg: 35m 10s | Max: 1h 20m | Hits: 156%/9180

    🟩 cmake_options
      🟩 -DTHRUST_DISPATCH_TYPE=Force32bit Pass: 100%/2   | Total: 43m 13s | Avg: 21m 36s | Max: 28m 52s
    🟩 cpu
      🟩 amd64              Pass: 100%/35  | Total: 20h 43m | Avg: 35m 32s | Max:  1h 20m | Hits: 156%/9180  
      🟩 arm64              Pass: 100%/2   | Total: 57m 46s | Avg: 28m 53s | Max: 30m 14s
    🟩 ctk
      🟩 12.0               Pass: 100%/5   | Total:  3h 07m | Avg: 37m 26s | Max:  1h 05m | Hits:  97%/1836  
      🟩 12.5               Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 10m
      🟩 12.6               Pass: 100%/30  | Total: 16h 18m | Avg: 32m 37s | Max:  1h 20m | Hits: 171%/7344  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 59m 39s | Avg: 29m 49s | Max: 30m 37s
      🟩 nvcc12.0           Pass: 100%/5   | Total:  3h 07m | Avg: 37m 26s | Max:  1h 05m | Hits:  97%/1836  
      🟩 nvcc12.5           Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 10m
      🟩 nvcc12.6           Pass: 100%/28  | Total: 15h 19m | Avg: 32m 49s | Max:  1h 20m | Hits: 171%/7344  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 59m 39s | Avg: 29m 49s | Max: 30m 37s
      🟩 nvcc               Pass: 100%/35  | Total: 20h 41m | Avg: 35m 28s | Max:  1h 20m | Hits: 156%/9180  
    🟩 cxx
      🟩 Clang14            Pass: 100%/4   | Total:  2h 01m | Avg: 30m 17s | Max: 31m 04s
      🟩 Clang15            Pass: 100%/1   | Total: 34m 09s | Avg: 34m 09s | Max: 34m 09s
      🟩 Clang16            Pass: 100%/1   | Total: 33m 31s | Avg: 33m 31s | Max: 33m 31s
      🟩 Clang17            Pass: 100%/1   | Total: 34m 07s | Avg: 34m 07s | Max: 34m 07s
      🟩 Clang18            Pass: 100%/7   | Total:  2h 57m | Avg: 25m 18s | Max: 32m 14s
      🟩 GCC7               Pass: 100%/2   | Total: 57m 32s | Avg: 28m 46s | Max: 29m 07s
      🟩 GCC8               Pass: 100%/1   | Total: 31m 12s | Avg: 31m 12s | Max: 31m 12s
      🟩 GCC9               Pass: 100%/2   | Total:  1h 05m | Avg: 32m 38s | Max: 33m 12s
      🟩 GCC10              Pass: 100%/1   | Total: 31m 51s | Avg: 31m 51s | Max: 31m 51s
      🟩 GCC11              Pass: 100%/1   | Total: 38m 59s | Avg: 38m 59s | Max: 38m 59s
      🟩 GCC12              Pass: 100%/1   | Total: 37m 05s | Avg: 37m 05s | Max: 37m 05s
      🟩 GCC13              Pass: 100%/8   | Total:  3h 02m | Avg: 22m 51s | Max: 33m 29s
      🟩 MSVC14.29          Pass: 100%/2   | Total:  2h 12m | Avg:  1h 06m | Max:  1h 07m | Hits: 111%/3672  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  3h 08m | Avg:  1h 02m | Max:  1h 20m | Hits: 187%/5508  
      🟩 NVHPC24.7          Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 10m
    🟩 cxx_family
      🟩 Clang              Pass: 100%/14  | Total:  6h 40m | Avg: 28m 34s | Max: 34m 09s
      🟩 GCC                Pass: 100%/16  | Total:  7h 24m | Avg: 27m 48s | Max: 38m 59s
      🟩 MSVC               Pass: 100%/5   | Total:  5h 21m | Avg:  1h 04m | Max:  1h 20m | Hits: 156%/9180  
      🟩 NVHPC              Pass: 100%/2   | Total:  2h 15m | Avg:  1h 07m | Max:  1h 10m
    🟩 gpu
      🟩 v100               Pass: 100%/37  | Total: 21h 41m | Avg: 35m 10s | Max:  1h 20m | Hits: 156%/9180  
    🟩 jobs
      🟩 Build              Pass: 100%/31  | Total: 19h 55m | Avg: 38m 33s | Max:  1h 20m | Hits: 104%/7344  
      🟩 TestCPU            Pass: 100%/3   | Total: 50m 46s | Avg: 16m 55s | Max: 35m 03s | Hits: 365%/1836  
      🟩 TestGPU            Pass: 100%/3   | Total: 55m 37s | Avg: 18m 32s | Max: 22m 33s
    🟩 sm
      🟩 90a                Pass: 100%/1   | Total: 17m 09s | Avg: 17m 09s | Max: 17m 09s
    🟩 std
      🟩 17                 Pass: 100%/14  | Total:  9h 38m | Avg: 41m 17s | Max:  1h 13m | Hits: 107%/5508  
      🟩 20                 Pass: 100%/21  | Total: 11h 20m | Avg: 32m 23s | Max:  1h 20m | Hits: 231%/3672  
    
  • 🟩 cudax: Pass: 100%/20 | Total: 1h 55m | Avg: 5m 46s | Max: 18m 54s | Hits: 383%/522

    🟩 cpu
      🟩 amd64              Pass: 100%/16  | Total:  1h 45m | Avg:  6m 34s | Max: 18m 54s | Hits: 383%/522   
      🟩 arm64              Pass: 100%/4   | Total: 10m 30s | Avg:  2m 37s | Max:  2m 42s
    🟩 ctk
      🟩 12.0               Pass: 100%/1   | Total: 11m 37s | Avg: 11m 37s | Max: 11m 37s | Hits: 383%/261   
      🟩 12.5               Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 34s
      🟩 12.6               Pass: 100%/17  | Total:  1h 32m | Avg:  5m 28s | Max: 18m 54s | Hits: 383%/261   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/1   | Total: 11m 37s | Avg: 11m 37s | Max: 11m 37s | Hits: 383%/261   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 34s
      🟩 nvcc12.6           Pass: 100%/17  | Total:  1h 32m | Avg:  5m 28s | Max: 18m 54s | Hits: 383%/261   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/20  | Total:  1h 55m | Avg:  5m 46s | Max: 18m 54s | Hits: 383%/522   
    🟩 cxx
      🟩 Clang14            Pass: 100%/1   | Total:  3m 10s | Avg:  3m 10s | Max:  3m 10s
      🟩 Clang15            Pass: 100%/1   | Total:  3m 24s | Avg:  3m 24s | Max:  3m 24s
      🟩 Clang16            Pass: 100%/1   | Total:  3m 22s | Avg:  3m 22s | Max:  3m 22s
      🟩 Clang17            Pass: 100%/1   | Total:  3m 29s | Avg:  3m 29s | Max:  3m 29s
      🟩 Clang18            Pass: 100%/4   | Total: 26m 47s | Avg:  6m 41s | Max: 18m 14s
      🟩 GCC10              Pass: 100%/1   | Total:  3m 18s | Avg:  3m 18s | Max:  3m 18s
      🟩 GCC11              Pass: 100%/1   | Total:  3m 19s | Avg:  3m 19s | Max:  3m 19s
      🟩 GCC12              Pass: 100%/2   | Total: 22m 16s | Avg: 11m 08s | Max: 18m 54s
      🟩 GCC13              Pass: 100%/4   | Total: 10m 51s | Avg:  2m 42s | Max:  3m 00s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 11m 37s | Avg: 11m 37s | Max: 11m 37s | Hits: 383%/261   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 13m 01s | Avg: 13m 01s | Max: 13m 01s | Hits: 383%/261   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 34s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/8   | Total: 40m 12s | Avg:  5m 01s | Max: 18m 14s
      🟩 GCC                Pass: 100%/8   | Total: 39m 44s | Avg:  4m 58s | Max: 18m 54s
      🟩 MSVC               Pass: 100%/2   | Total: 24m 38s | Avg: 12m 19s | Max: 13m 01s | Hits: 383%/522   
      🟩 NVHPC              Pass: 100%/2   | Total: 11m 02s | Avg:  5m 31s | Max:  5m 34s
    🟩 gpu
      🟩 v100               Pass: 100%/20  | Total:  1h 55m | Avg:  5m 46s | Max: 18m 54s | Hits: 383%/522   
    🟩 jobs
      🟩 Build              Pass: 100%/18  | Total:  1h 18m | Avg:  4m 21s | Max: 13m 01s | Hits: 383%/522   
      🟩 Test               Pass: 100%/2   | Total: 37m 08s | Avg: 18m 34s | Max: 18m 54s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 40s | Avg:  2m 40s | Max:  2m 40s
      🟩 90a                Pass: 100%/1   | Total:  3m 00s | Avg:  3m 00s | Max:  3m 00s
    🟩 std
      🟩 17                 Pass: 100%/4   | Total: 13m 28s | Avg:  3m 22s | Max:  5m 34s
      🟩 20                 Pass: 100%/16  | Total:  1h 42m | Avg:  6m 23s | Max: 18m 54s | Hits: 383%/522   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 04s | Avg: 5m 02s | Max: 8m 08s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 08s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 08s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 08s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 08s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 08s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 08s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 04s | Avg:  5m 02s | Max:  8m 08s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  1m 56s | Avg:  1m 56s | Max:  1m 56s
      🟩 Test               Pass: 100%/1   | Total:  8m 08s | Avg:  8m 08s | Max:  8m 08s
    
  • 🟩 python: Pass: 100%/1 | Total: 50m 04s | Avg: 50m 04s | Max: 50m 04s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 50m 04s | Avg: 50m 04s | Max: 50m 04s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 50m 04s | Avg: 50m 04s | Max: 50m 04s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 50m 04s | Avg: 50m 04s | Max: 50m 04s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 50m 04s | Avg: 50m 04s | Max: 50m 04s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 50m 04s | Avg: 50m 04s | Max: 50m 04s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 50m 04s | Avg: 50m 04s | Max: 50m 04s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 50m 04s | Avg: 50m 04s | Max: 50m 04s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 50m 04s | Avg: 50m 04s | Max: 50m 04s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 135)

# Runner
92 linux-amd64-cpu16
17 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16
10 linux-arm64-cpu16
1 linux-amd64-gpu-h100-latest-1-testing

@fbusato fbusato merged commit 0b5a5d8 into NVIDIA:main Jan 23, 2025
147 of 150 checks passed
@bernhardmgruber bernhardmgruber added libcu++ For all items related to libcu++ breaking Breaking change labels Jan 24, 2025
davebayer pushed a commit to davebayer/cccl that referenced this pull request Jan 29, 2025
@fbusato fbusato deleted the ceil_div-common_type-optmize branch February 11, 2025 18:24
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

3.0 Targeted for 3.0 release breaking Breaking change libcu++ For all items related to libcu++

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

[FEA]: ceil_div should return the resulting type of its operation

3 participants