Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add inclusive_scan with initial value support (warp/block) #1749

Merged
merged 28 commits into from
Jun 11, 2024

Conversation

gonidelis
Copy link
Member

@gonidelis gonidelis commented May 16, 2024

Doesn't fix but works towards #693.

Adds warp_scan with initial value support on warp and block level. Tests added.

todo:

  • add documentation for block inclusive_scan with init value overloads.
  • move documentation examples to standalone test executables (in order to improve test coverage for all overloads).

@gonidelis gonidelis marked this pull request as ready for review May 16, 2024 22:37
@gonidelis gonidelis requested review from a team as code owners May 16, 2024 22:37
Copy link
Contributor

🟩 CI Results [ Failed: 0 | Passed: 198 | Total: 198 ]
  • 🟩 Project thrust [ Failed: 0 | Passed: 99 | Total: 99 ]

    🟩 cpu
      🟩 amd64 (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 arm64 (0% Fail)              Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 ctk
      🟩 11.1 (0% Fail)               Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 11.8 (0% Fail)               Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 12.4 (0% Fail)               Failed:  0  -- Passed: 81  -- Total: 81 
    🟩 cudacxx_full
      🟩 clang-cuda16 (0% Fail)       Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc11.1 (0% Fail)           Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 nvcc11.8 (0% Fail)           Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 nvcc12.4 (0% Fail)           Failed:  0  -- Passed: 79  -- Total: 79 
    🟩 cudacxx_name
      🟩 clang-cuda (0% Fail)         Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc (0% Fail)               Failed:  0  -- Passed: 97  -- Total: 97 
    🟩 cxx_full
      🟩 clang9 (0% Fail)             Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 clang10 (0% Fail)            Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 clang11 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang12 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang13 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang14 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang15 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang16 (0% Fail)            Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 gcc6 (0% Fail)               Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 gcc7 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc8 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc9 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc10 (0% Fail)              Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 gcc11 (0% Fail)              Failed:  0  -- Passed:  7  -- Total:  7 
      🟩 gcc12 (0% Fail)              Failed:  0  -- Passed: 16  -- Total: 16 
      🟩 Intel2023.2.0 (0% Fail)      Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC14.16 (0% Fail)          Failed:  0  -- Passed:  1  -- Total:  1 
      🟩 MSVC14.29 (0% Fail)          Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 MSVC14.39 (0% Fail)          Failed:  0  -- Passed:  3  -- Total:  3 
    🟩 cxx_name
      🟩 clang (0% Fail)              Failed:  0  -- Passed: 43  -- Total: 43 
      🟩 gcc (0% Fail)                Failed:  0  -- Passed: 47  -- Total: 47 
      🟩 Intel (0% Fail)              Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 gpu
      🟩 v100 (0% Fail)               Failed:  0  -- Passed: 99  -- Total: 99 
    🟩 jobs
      🟩 build (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 test (0% Fail)               Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 os
      🟩 ubuntu18.04 (0% Fail)        Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 ubuntu20.04 (0% Fail)        Failed:  0  -- Passed: 35  -- Total: 35 
      🟩 ubuntu22.04 (0% Fail)        Failed:  0  -- Passed: 44  -- Total: 44 
      🟩 windows2022 (0% Fail)        Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 sm
      🟩 60;70;80;90 (0% Fail)        Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 90a (0% Fail)                Failed:  0  -- Passed:  4  -- Total:  4 
    🟩 std
      🟩 11 (0% Fail)                 Failed:  0  -- Passed: 26  -- Total: 26 
      🟩 14 (0% Fail)                 Failed:  0  -- Passed: 29  -- Total: 29 
      🟩 17 (0% Fail)                 Failed:  0  -- Passed: 28  -- Total: 28 
      🟩 20 (0% Fail)                 Failed:  0  -- Passed: 16  -- Total: 16 
    
  • 🟩 Project cub [ Failed: 0 | Passed: 99 | Total: 99 ]

    🟩 cpu
      🟩 amd64 (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 arm64 (0% Fail)              Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 ctk
      🟩 11.1 (0% Fail)               Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 11.8 (0% Fail)               Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 12.4 (0% Fail)               Failed:  0  -- Passed: 81  -- Total: 81 
    🟩 cudacxx_full
      🟩 clang-cuda16 (0% Fail)       Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc11.1 (0% Fail)           Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 nvcc11.8 (0% Fail)           Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 nvcc12.4 (0% Fail)           Failed:  0  -- Passed: 79  -- Total: 79 
    🟩 cudacxx_name
      🟩 clang-cuda (0% Fail)         Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc (0% Fail)               Failed:  0  -- Passed: 97  -- Total: 97 
    🟩 cxx_full
      🟩 clang9 (0% Fail)             Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 clang10 (0% Fail)            Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 clang11 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang12 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang13 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang14 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang15 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang16 (0% Fail)            Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 gcc6 (0% Fail)               Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 gcc7 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc8 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc9 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc10 (0% Fail)              Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 gcc11 (0% Fail)              Failed:  0  -- Passed:  7  -- Total:  7 
      🟩 gcc12 (0% Fail)              Failed:  0  -- Passed: 16  -- Total: 16 
      🟩 Intel2023.2.0 (0% Fail)      Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC14.16 (0% Fail)          Failed:  0  -- Passed:  1  -- Total:  1 
      🟩 MSVC14.29 (0% Fail)          Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 MSVC14.39 (0% Fail)          Failed:  0  -- Passed:  3  -- Total:  3 
    🟩 cxx_name
      🟩 clang (0% Fail)              Failed:  0  -- Passed: 43  -- Total: 43 
      🟩 gcc (0% Fail)                Failed:  0  -- Passed: 47  -- Total: 47 
      🟩 Intel (0% Fail)              Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 gpu
      🟩 v100 (0% Fail)               Failed:  0  -- Passed: 99  -- Total: 99 
    🟩 jobs
      🟩 build (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 test (0% Fail)               Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 os
      🟩 ubuntu18.04 (0% Fail)        Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 ubuntu20.04 (0% Fail)        Failed:  0  -- Passed: 35  -- Total: 35 
      🟩 ubuntu22.04 (0% Fail)        Failed:  0  -- Passed: 44  -- Total: 44 
      🟩 windows2022 (0% Fail)        Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 sm
      🟩 60;70;80;90 (0% Fail)        Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 90a (0% Fail)                Failed:  0  -- Passed:  4  -- Total:  4 
    🟩 std
      🟩 11 (0% Fail)                 Failed:  0  -- Passed: 26  -- Total: 26 
      🟩 14 (0% Fail)                 Failed:  0  -- Passed: 29  -- Total: 29 
      🟩 17 (0% Fail)                 Failed:  0  -- Passed: 28  -- Total: 28 
      🟩 20 (0% Fail)                 Failed:  0  -- Passed: 16  -- Total: 16 
    

🏃‍ Runner counts (total jobs: 198)

# Runner
154 linux-amd64-cpu16
16 linux-arm64-cpu16
16 linux-amd64-gpu-v100-latest-1
12 windows-amd64-cpu16

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental

Copy link
Contributor

🟩 CI Results [ Failed: 0 | Passed: 198 | Total: 198 ]
  • 🟩 Project thrust [ Failed: 0 | Passed: 99 | Total: 99 ]

    🟩 cpu
      🟩 amd64 (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 arm64 (0% Fail)              Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 ctk
      🟩 11.1 (0% Fail)               Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 11.8 (0% Fail)               Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 12.4 (0% Fail)               Failed:  0  -- Passed: 81  -- Total: 81 
    🟩 cudacxx_full
      🟩 clang-cuda16 (0% Fail)       Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc11.1 (0% Fail)           Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 nvcc11.8 (0% Fail)           Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 nvcc12.4 (0% Fail)           Failed:  0  -- Passed: 79  -- Total: 79 
    🟩 cudacxx_name
      🟩 clang-cuda (0% Fail)         Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc (0% Fail)               Failed:  0  -- Passed: 97  -- Total: 97 
    🟩 cxx_full
      🟩 clang9 (0% Fail)             Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 clang10 (0% Fail)            Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 clang11 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang12 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang13 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang14 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang15 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang16 (0% Fail)            Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 gcc6 (0% Fail)               Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 gcc7 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc8 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc9 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc10 (0% Fail)              Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 gcc11 (0% Fail)              Failed:  0  -- Passed:  7  -- Total:  7 
      🟩 gcc12 (0% Fail)              Failed:  0  -- Passed: 16  -- Total: 16 
      🟩 Intel2023.2.0 (0% Fail)      Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC14.16 (0% Fail)          Failed:  0  -- Passed:  1  -- Total:  1 
      🟩 MSVC14.29 (0% Fail)          Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 MSVC14.39 (0% Fail)          Failed:  0  -- Passed:  3  -- Total:  3 
    🟩 cxx_name
      🟩 clang (0% Fail)              Failed:  0  -- Passed: 43  -- Total: 43 
      🟩 gcc (0% Fail)                Failed:  0  -- Passed: 47  -- Total: 47 
      🟩 Intel (0% Fail)              Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 gpu
      🟩 v100 (0% Fail)               Failed:  0  -- Passed: 99  -- Total: 99 
    🟩 jobs
      🟩 build (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 test (0% Fail)               Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 os
      🟩 ubuntu18.04 (0% Fail)        Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 ubuntu20.04 (0% Fail)        Failed:  0  -- Passed: 35  -- Total: 35 
      🟩 ubuntu22.04 (0% Fail)        Failed:  0  -- Passed: 44  -- Total: 44 
      🟩 windows2022 (0% Fail)        Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 sm
      🟩 60;70;80;90 (0% Fail)        Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 90a (0% Fail)                Failed:  0  -- Passed:  4  -- Total:  4 
    🟩 std
      🟩 11 (0% Fail)                 Failed:  0  -- Passed: 26  -- Total: 26 
      🟩 14 (0% Fail)                 Failed:  0  -- Passed: 29  -- Total: 29 
      🟩 17 (0% Fail)                 Failed:  0  -- Passed: 28  -- Total: 28 
      🟩 20 (0% Fail)                 Failed:  0  -- Passed: 16  -- Total: 16 
    
  • 🟩 Project cub [ Failed: 0 | Passed: 99 | Total: 99 ]

    🟩 cpu
      🟩 amd64 (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 arm64 (0% Fail)              Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 ctk
      🟩 11.1 (0% Fail)               Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 11.8 (0% Fail)               Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 12.4 (0% Fail)               Failed:  0  -- Passed: 81  -- Total: 81 
    🟩 cudacxx_full
      🟩 clang-cuda16 (0% Fail)       Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc11.1 (0% Fail)           Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 nvcc11.8 (0% Fail)           Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 nvcc12.4 (0% Fail)           Failed:  0  -- Passed: 79  -- Total: 79 
    🟩 cudacxx_name
      🟩 clang-cuda (0% Fail)         Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc (0% Fail)               Failed:  0  -- Passed: 97  -- Total: 97 
    🟩 cxx_full
      🟩 clang9 (0% Fail)             Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 clang10 (0% Fail)            Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 clang11 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang12 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang13 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang14 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang15 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang16 (0% Fail)            Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 gcc6 (0% Fail)               Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 gcc7 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc8 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc9 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc10 (0% Fail)              Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 gcc11 (0% Fail)              Failed:  0  -- Passed:  7  -- Total:  7 
      🟩 gcc12 (0% Fail)              Failed:  0  -- Passed: 16  -- Total: 16 
      🟩 Intel2023.2.0 (0% Fail)      Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC14.16 (0% Fail)          Failed:  0  -- Passed:  1  -- Total:  1 
      🟩 MSVC14.29 (0% Fail)          Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 MSVC14.39 (0% Fail)          Failed:  0  -- Passed:  3  -- Total:  3 
    🟩 cxx_name
      🟩 clang (0% Fail)              Failed:  0  -- Passed: 43  -- Total: 43 
      🟩 gcc (0% Fail)                Failed:  0  -- Passed: 47  -- Total: 47 
      🟩 Intel (0% Fail)              Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 gpu
      🟩 v100 (0% Fail)               Failed:  0  -- Passed: 99  -- Total: 99 
    🟩 jobs
      🟩 build (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 test (0% Fail)               Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 os
      🟩 ubuntu18.04 (0% Fail)        Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 ubuntu20.04 (0% Fail)        Failed:  0  -- Passed: 35  -- Total: 35 
      🟩 ubuntu22.04 (0% Fail)        Failed:  0  -- Passed: 44  -- Total: 44 
      🟩 windows2022 (0% Fail)        Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 sm
      🟩 60;70;80;90 (0% Fail)        Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 90a (0% Fail)                Failed:  0  -- Passed:  4  -- Total:  4 
    🟩 std
      🟩 11 (0% Fail)                 Failed:  0  -- Passed: 26  -- Total: 26 
      🟩 14 (0% Fail)                 Failed:  0  -- Passed: 29  -- Total: 29 
      🟩 17 (0% Fail)                 Failed:  0  -- Passed: 28  -- Total: 28 
      🟩 20 (0% Fail)                 Failed:  0  -- Passed: 16  -- Total: 16 
    

🏃‍ Runner counts (total jobs: 198)

# Runner
154 linux-amd64-cpu16
16 linux-arm64-cpu16
16 linux-amd64-gpu-v100-latest-1
12 windows-amd64-cpu16

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental

Copy link
Contributor

@bernhardmgruber bernhardmgruber left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In general, I have troubles understanding what an initial value to a scan is, so maybe the documentation could be improved in this regard. However, it seems the term is used in several places, so it's out of scope for this PR.

cub/cub/warp/warp_scan.cuh Outdated Show resolved Hide resolved
cub/cub/warp/warp_scan.cuh Outdated Show resolved Hide resolved
cub/test/catch2_test_warp_scan.cu Outdated Show resolved Hide resolved
@gonidelis gonidelis changed the title Add inclusive_scan with initial value support on warp level Add inclusive_scan with initial value support (warp/block) May 20, 2024
Copy link
Contributor

🟨 CI Results [ Failed: 3 | Passed: 195 | Total: 198 ]
  • 🟩 Project thrust [ Failed: 0 | Passed: 99 | Total: 99 ]

    🟩 cpu
      🟩 amd64 (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 arm64 (0% Fail)              Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 ctk
      🟩 11.1 (0% Fail)               Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 11.8 (0% Fail)               Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 12.4 (0% Fail)               Failed:  0  -- Passed: 81  -- Total: 81 
    🟩 cudacxx_full
      🟩 clang-cuda16 (0% Fail)       Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc11.1 (0% Fail)           Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 nvcc11.8 (0% Fail)           Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 nvcc12.4 (0% Fail)           Failed:  0  -- Passed: 79  -- Total: 79 
    🟩 cudacxx_name
      🟩 clang-cuda (0% Fail)         Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc (0% Fail)               Failed:  0  -- Passed: 97  -- Total: 97 
    🟩 cxx_full
      🟩 clang9 (0% Fail)             Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 clang10 (0% Fail)            Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 clang11 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang12 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang13 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang14 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang15 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang16 (0% Fail)            Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 gcc6 (0% Fail)               Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 gcc7 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc8 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc9 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc10 (0% Fail)              Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 gcc11 (0% Fail)              Failed:  0  -- Passed:  7  -- Total:  7 
      🟩 gcc12 (0% Fail)              Failed:  0  -- Passed: 16  -- Total: 16 
      🟩 Intel2023.2.0 (0% Fail)      Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC14.16 (0% Fail)          Failed:  0  -- Passed:  1  -- Total:  1 
      🟩 MSVC14.29 (0% Fail)          Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 MSVC14.39 (0% Fail)          Failed:  0  -- Passed:  3  -- Total:  3 
    🟩 cxx_name
      🟩 clang (0% Fail)              Failed:  0  -- Passed: 43  -- Total: 43 
      🟩 gcc (0% Fail)                Failed:  0  -- Passed: 47  -- Total: 47 
      🟩 Intel (0% Fail)              Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 gpu
      🟩 v100 (0% Fail)               Failed:  0  -- Passed: 99  -- Total: 99 
    🟩 jobs
      🟩 build (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 test (0% Fail)               Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 os
      🟩 ubuntu18.04 (0% Fail)        Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 ubuntu20.04 (0% Fail)        Failed:  0  -- Passed: 35  -- Total: 35 
      🟩 ubuntu22.04 (0% Fail)        Failed:  0  -- Passed: 44  -- Total: 44 
      🟩 windows2022 (0% Fail)        Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 sm
      🟩 60;70;80;90 (0% Fail)        Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 90a (0% Fail)                Failed:  0  -- Passed:  4  -- Total:  4 
    🟩 std
      🟩 11 (0% Fail)                 Failed:  0  -- Passed: 26  -- Total: 26 
      🟩 14 (0% Fail)                 Failed:  0  -- Passed: 29  -- Total: 29 
      🟩 17 (0% Fail)                 Failed:  0  -- Passed: 28  -- Total: 28 
      🟩 20 (0% Fail)                 Failed:  0  -- Passed: 16  -- Total: 16 
    
  • 🟨 Project cub [ Failed: 3 | Passed: 96 | Total: 99 ]

    🔍 cpu: amd64 🔍
      🔍 amd64 (3% Fail)              Failed:  3  -- Passed: 88  -- Total: 91 
      🟩 arm64 (0% Fail)              Failed:  0  -- Passed:  8  -- Total:  8 
    🔍 ctk: 12.4 🔍
      🟩 11.1 (0% Fail)               Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 11.8 (0% Fail)               Failed:  0  -- Passed:  3  -- Total:  3 
      🔍 12.4 (3% Fail)               Failed:  3  -- Passed: 78  -- Total: 81 
    🔍 cudacxx_full: nvcc12.4 🔍
      🟩 clang-cuda16 (0% Fail)       Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc11.1 (0% Fail)           Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 nvcc11.8 (0% Fail)           Failed:  0  -- Passed:  3  -- Total:  3 
      🔍 nvcc12.4 (3% Fail)           Failed:  3  -- Passed: 76  -- Total: 79 
    🔍 cudacxx_name: nvcc 🔍
      🟩 clang-cuda (0% Fail)         Failed:  0  -- Passed:  2  -- Total:  2 
      🔍 nvcc (3% Fail)               Failed:  3  -- Passed: 94  -- Total: 97 
    🚨 cxx_full: MSVC14.39 🚨
      🟩 clang9 (0% Fail)             Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 clang10 (0% Fail)            Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 clang11 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang12 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang13 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang14 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang15 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang16 (0% Fail)            Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 gcc6 (0% Fail)               Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 gcc7 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc8 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc9 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc10 (0% Fail)              Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 gcc11 (0% Fail)              Failed:  0  -- Passed:  7  -- Total:  7 
      🟩 gcc12 (0% Fail)              Failed:  0  -- Passed: 16  -- Total: 16 
      🟩 Intel2023.2.0 (0% Fail)      Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC14.16 (0% Fail)          Failed:  0  -- Passed:  1  -- Total:  1 
      🟩 MSVC14.29 (0% Fail)          Failed:  0  -- Passed:  2  -- Total:  2 
      🔥 MSVC14.39 (100% Fail)        Failed:  3  -- Passed:  0  -- Total:  3 
    🔍 cxx_name: MSVC 🔍
      🟩 clang (0% Fail)              Failed:  0  -- Passed: 43  -- Total: 43 
      🟩 gcc (0% Fail)                Failed:  0  -- Passed: 47  -- Total: 47 
      🟩 Intel (0% Fail)              Failed:  0  -- Passed:  3  -- Total:  3 
      🔍 MSVC (50% Fail)              Failed:  3  -- Passed:  3  -- Total:  6 
    🔍 jobs: build 🔍
      🔍 build (3% Fail)              Failed:  3  -- Passed: 88  -- Total: 91 
      🟩 test (0% Fail)               Failed:  0  -- Passed:  8  -- Total:  8 
    🔍 os: windows2022 🔍
      🟩 ubuntu18.04 (0% Fail)        Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 ubuntu20.04 (0% Fail)        Failed:  0  -- Passed: 35  -- Total: 35 
      🟩 ubuntu22.04 (0% Fail)        Failed:  0  -- Passed: 44  -- Total: 44 
      🔍 windows2022 (50% Fail)       Failed:  3  -- Passed:  3  -- Total:  6 
    🟨 std
      🟩 11 (0% Fail)                 Failed:  0  -- Passed: 26  -- Total: 26 
      🟨 14 (3% Fail)                 Failed:  1  -- Passed: 28  -- Total: 29 
      🟨 17 (3% Fail)                 Failed:  1  -- Passed: 27  -- Total: 28 
      🟨 20 (6% Fail)                 Failed:  1  -- Passed: 15  -- Total: 16 
    🟨 gpu
      🟨 v100 (3% Fail)               Failed:  3  -- Passed: 96  -- Total: 99 
    🟩 sm
      🟩 60;70;80;90 (0% Fail)        Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 90a (0% Fail)                Failed:  0  -- Passed:  4  -- Total:  4 
    

🏃‍ Runner counts (total jobs: 198)

# Runner
154 linux-amd64-cpu16
16 linux-arm64-cpu16
16 linux-amd64-gpu-v100-latest-1
12 windows-amd64-cpu16

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental

Copy link
Contributor

🟩 CI Results [ Failed: 0 | Passed: 198 | Total: 198 ]
  • 🟩 Project thrust [ Failed: 0 | Passed: 99 | Total: 99 ]

    🟩 cpu
      🟩 amd64 (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 arm64 (0% Fail)              Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 ctk
      🟩 11.1 (0% Fail)               Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 11.8 (0% Fail)               Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 12.4 (0% Fail)               Failed:  0  -- Passed: 81  -- Total: 81 
    🟩 cudacxx_full
      🟩 clang-cuda16 (0% Fail)       Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc11.1 (0% Fail)           Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 nvcc11.8 (0% Fail)           Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 nvcc12.4 (0% Fail)           Failed:  0  -- Passed: 79  -- Total: 79 
    🟩 cudacxx_name
      🟩 clang-cuda (0% Fail)         Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc (0% Fail)               Failed:  0  -- Passed: 97  -- Total: 97 
    🟩 cxx_full
      🟩 clang9 (0% Fail)             Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 clang10 (0% Fail)            Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 clang11 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang12 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang13 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang14 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang15 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang16 (0% Fail)            Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 gcc6 (0% Fail)               Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 gcc7 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc8 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc9 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc10 (0% Fail)              Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 gcc11 (0% Fail)              Failed:  0  -- Passed:  7  -- Total:  7 
      🟩 gcc12 (0% Fail)              Failed:  0  -- Passed: 16  -- Total: 16 
      🟩 Intel2023.2.0 (0% Fail)      Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC14.16 (0% Fail)          Failed:  0  -- Passed:  1  -- Total:  1 
      🟩 MSVC14.29 (0% Fail)          Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 MSVC14.39 (0% Fail)          Failed:  0  -- Passed:  3  -- Total:  3 
    🟩 cxx_name
      🟩 clang (0% Fail)              Failed:  0  -- Passed: 43  -- Total: 43 
      🟩 gcc (0% Fail)                Failed:  0  -- Passed: 47  -- Total: 47 
      🟩 Intel (0% Fail)              Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 gpu
      🟩 v100 (0% Fail)               Failed:  0  -- Passed: 99  -- Total: 99 
    🟩 jobs
      🟩 build (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 test (0% Fail)               Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 os
      🟩 ubuntu18.04 (0% Fail)        Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 ubuntu20.04 (0% Fail)        Failed:  0  -- Passed: 35  -- Total: 35 
      🟩 ubuntu22.04 (0% Fail)        Failed:  0  -- Passed: 44  -- Total: 44 
      🟩 windows2022 (0% Fail)        Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 sm
      🟩 60;70;80;90 (0% Fail)        Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 90a (0% Fail)                Failed:  0  -- Passed:  4  -- Total:  4 
    🟩 std
      🟩 11 (0% Fail)                 Failed:  0  -- Passed: 26  -- Total: 26 
      🟩 14 (0% Fail)                 Failed:  0  -- Passed: 29  -- Total: 29 
      🟩 17 (0% Fail)                 Failed:  0  -- Passed: 28  -- Total: 28 
      🟩 20 (0% Fail)                 Failed:  0  -- Passed: 16  -- Total: 16 
    
  • 🟩 Project cub [ Failed: 0 | Passed: 99 | Total: 99 ]

    🟩 cpu
      🟩 amd64 (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 arm64 (0% Fail)              Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 ctk
      🟩 11.1 (0% Fail)               Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 11.8 (0% Fail)               Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 12.4 (0% Fail)               Failed:  0  -- Passed: 81  -- Total: 81 
    🟩 cudacxx_full
      🟩 clang-cuda16 (0% Fail)       Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc11.1 (0% Fail)           Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 nvcc11.8 (0% Fail)           Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 nvcc12.4 (0% Fail)           Failed:  0  -- Passed: 79  -- Total: 79 
    🟩 cudacxx_name
      🟩 clang-cuda (0% Fail)         Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc (0% Fail)               Failed:  0  -- Passed: 97  -- Total: 97 
    🟩 cxx_full
      🟩 clang9 (0% Fail)             Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 clang10 (0% Fail)            Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 clang11 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang12 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang13 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang14 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang15 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang16 (0% Fail)            Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 gcc6 (0% Fail)               Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 gcc7 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc8 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc9 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc10 (0% Fail)              Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 gcc11 (0% Fail)              Failed:  0  -- Passed:  7  -- Total:  7 
      🟩 gcc12 (0% Fail)              Failed:  0  -- Passed: 16  -- Total: 16 
      🟩 Intel2023.2.0 (0% Fail)      Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC14.16 (0% Fail)          Failed:  0  -- Passed:  1  -- Total:  1 
      🟩 MSVC14.29 (0% Fail)          Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 MSVC14.39 (0% Fail)          Failed:  0  -- Passed:  3  -- Total:  3 
    🟩 cxx_name
      🟩 clang (0% Fail)              Failed:  0  -- Passed: 43  -- Total: 43 
      🟩 gcc (0% Fail)                Failed:  0  -- Passed: 47  -- Total: 47 
      🟩 Intel (0% Fail)              Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 gpu
      🟩 v100 (0% Fail)               Failed:  0  -- Passed: 99  -- Total: 99 
    🟩 jobs
      🟩 build (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 test (0% Fail)               Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 os
      🟩 ubuntu18.04 (0% Fail)        Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 ubuntu20.04 (0% Fail)        Failed:  0  -- Passed: 35  -- Total: 35 
      🟩 ubuntu22.04 (0% Fail)        Failed:  0  -- Passed: 44  -- Total: 44 
      🟩 windows2022 (0% Fail)        Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 sm
      🟩 60;70;80;90 (0% Fail)        Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 90a (0% Fail)                Failed:  0  -- Passed:  4  -- Total:  4 
    🟩 std
      🟩 11 (0% Fail)                 Failed:  0  -- Passed: 26  -- Total: 26 
      🟩 14 (0% Fail)                 Failed:  0  -- Passed: 29  -- Total: 29 
      🟩 17 (0% Fail)                 Failed:  0  -- Passed: 28  -- Total: 28 
      🟩 20 (0% Fail)                 Failed:  0  -- Passed: 16  -- Total: 16 
    

🏃‍ Runner counts (total jobs: 198)

# Runner
154 linux-amd64-cpu16
16 linux-arm64-cpu16
16 linux-amd64-gpu-v100-latest-1
12 windows-amd64-cpu16

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental

cub/cub/block/block_scan.cuh Show resolved Hide resolved
cub/cub/block/block_scan.cuh Show resolved Hide resolved
Copy link
Collaborator

@gevtushenko gevtushenko left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A few comments on value-based API. I think the issue would be caught by an _api tests we write for literalinclude as opposed to code-block based documentation. If you have spare moment, it'd be nice to rewrite new docs to use literalincludes.

cub/cub/block/block_scan.cuh Outdated Show resolved Hide resolved
cub/test/catch2_test_block_scan.cu Show resolved Hide resolved
cub/cub/block/block_scan.cuh Outdated Show resolved Hide resolved
Copy link
Contributor

🟩 CI Results [ Failed: 0 | Passed: 198 | Total: 198 ]
  • 🟩 Project thrust [ Failed: 0 | Passed: 99 | Total: 99 ]

    🟩 cpu
      🟩 amd64 (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 arm64 (0% Fail)              Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 ctk
      🟩 11.1 (0% Fail)               Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 11.8 (0% Fail)               Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 12.4 (0% Fail)               Failed:  0  -- Passed: 81  -- Total: 81 
    🟩 cudacxx_full
      🟩 clang-cuda16 (0% Fail)       Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc11.1 (0% Fail)           Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 nvcc11.8 (0% Fail)           Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 nvcc12.4 (0% Fail)           Failed:  0  -- Passed: 79  -- Total: 79 
    🟩 cudacxx_name
      🟩 clang-cuda (0% Fail)         Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc (0% Fail)               Failed:  0  -- Passed: 97  -- Total: 97 
    🟩 cxx_full
      🟩 clang9 (0% Fail)             Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 clang10 (0% Fail)            Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 clang11 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang12 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang13 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang14 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang15 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang16 (0% Fail)            Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 gcc6 (0% Fail)               Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 gcc7 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc8 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc9 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc10 (0% Fail)              Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 gcc11 (0% Fail)              Failed:  0  -- Passed:  7  -- Total:  7 
      🟩 gcc12 (0% Fail)              Failed:  0  -- Passed: 16  -- Total: 16 
      🟩 Intel2023.2.0 (0% Fail)      Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC14.16 (0% Fail)          Failed:  0  -- Passed:  1  -- Total:  1 
      🟩 MSVC14.29 (0% Fail)          Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 MSVC14.39 (0% Fail)          Failed:  0  -- Passed:  3  -- Total:  3 
    🟩 cxx_name
      🟩 clang (0% Fail)              Failed:  0  -- Passed: 43  -- Total: 43 
      🟩 gcc (0% Fail)                Failed:  0  -- Passed: 47  -- Total: 47 
      🟩 Intel (0% Fail)              Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 gpu
      🟩 v100 (0% Fail)               Failed:  0  -- Passed: 99  -- Total: 99 
    🟩 jobs
      🟩 build (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 test (0% Fail)               Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 os
      🟩 ubuntu18.04 (0% Fail)        Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 ubuntu20.04 (0% Fail)        Failed:  0  -- Passed: 35  -- Total: 35 
      🟩 ubuntu22.04 (0% Fail)        Failed:  0  -- Passed: 44  -- Total: 44 
      🟩 windows2022 (0% Fail)        Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 sm
      🟩 60;70;80;90 (0% Fail)        Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 90a (0% Fail)                Failed:  0  -- Passed:  4  -- Total:  4 
    🟩 std
      🟩 11 (0% Fail)                 Failed:  0  -- Passed: 26  -- Total: 26 
      🟩 14 (0% Fail)                 Failed:  0  -- Passed: 29  -- Total: 29 
      🟩 17 (0% Fail)                 Failed:  0  -- Passed: 28  -- Total: 28 
      🟩 20 (0% Fail)                 Failed:  0  -- Passed: 16  -- Total: 16 
    
  • 🟩 Project cub [ Failed: 0 | Passed: 99 | Total: 99 ]

    🟩 cpu
      🟩 amd64 (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 arm64 (0% Fail)              Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 ctk
      🟩 11.1 (0% Fail)               Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 11.8 (0% Fail)               Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 12.4 (0% Fail)               Failed:  0  -- Passed: 81  -- Total: 81 
    🟩 cudacxx_full
      🟩 clang-cuda16 (0% Fail)       Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc11.1 (0% Fail)           Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 nvcc11.8 (0% Fail)           Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 nvcc12.4 (0% Fail)           Failed:  0  -- Passed: 79  -- Total: 79 
    🟩 cudacxx_name
      🟩 clang-cuda (0% Fail)         Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc (0% Fail)               Failed:  0  -- Passed: 97  -- Total: 97 
    🟩 cxx_full
      🟩 clang9 (0% Fail)             Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 clang10 (0% Fail)            Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 clang11 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang12 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang13 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang14 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang15 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang16 (0% Fail)            Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 gcc6 (0% Fail)               Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 gcc7 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc8 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc9 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc10 (0% Fail)              Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 gcc11 (0% Fail)              Failed:  0  -- Passed:  7  -- Total:  7 
      🟩 gcc12 (0% Fail)              Failed:  0  -- Passed: 16  -- Total: 16 
      🟩 Intel2023.2.0 (0% Fail)      Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC14.16 (0% Fail)          Failed:  0  -- Passed:  1  -- Total:  1 
      🟩 MSVC14.29 (0% Fail)          Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 MSVC14.39 (0% Fail)          Failed:  0  -- Passed:  3  -- Total:  3 
    🟩 cxx_name
      🟩 clang (0% Fail)              Failed:  0  -- Passed: 43  -- Total: 43 
      🟩 gcc (0% Fail)                Failed:  0  -- Passed: 47  -- Total: 47 
      🟩 Intel (0% Fail)              Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 gpu
      🟩 v100 (0% Fail)               Failed:  0  -- Passed: 99  -- Total: 99 
    🟩 jobs
      🟩 build (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 test (0% Fail)               Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 os
      🟩 ubuntu18.04 (0% Fail)        Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 ubuntu20.04 (0% Fail)        Failed:  0  -- Passed: 35  -- Total: 35 
      🟩 ubuntu22.04 (0% Fail)        Failed:  0  -- Passed: 44  -- Total: 44 
      🟩 windows2022 (0% Fail)        Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 sm
      🟩 60;70;80;90 (0% Fail)        Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 90a (0% Fail)                Failed:  0  -- Passed:  4  -- Total:  4 
    🟩 std
      🟩 11 (0% Fail)                 Failed:  0  -- Passed: 26  -- Total: 26 
      🟩 14 (0% Fail)                 Failed:  0  -- Passed: 29  -- Total: 29 
      🟩 17 (0% Fail)                 Failed:  0  -- Passed: 28  -- Total: 28 
      🟩 20 (0% Fail)                 Failed:  0  -- Passed: 16  -- Total: 16 
    

🏃‍ Runner counts (total jobs: 198)

# Runner
154 linux-amd64-cpu16
16 linux-arm64-cpu16
16 linux-amd64-gpu-v100-latest-1
12 windows-amd64-cpu16

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental

1 similar comment
Copy link
Contributor

🟩 CI Results [ Failed: 0 | Passed: 198 | Total: 198 ]
  • 🟩 Project thrust [ Failed: 0 | Passed: 99 | Total: 99 ]

    🟩 cpu
      🟩 amd64 (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 arm64 (0% Fail)              Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 ctk
      🟩 11.1 (0% Fail)               Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 11.8 (0% Fail)               Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 12.4 (0% Fail)               Failed:  0  -- Passed: 81  -- Total: 81 
    🟩 cudacxx_full
      🟩 clang-cuda16 (0% Fail)       Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc11.1 (0% Fail)           Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 nvcc11.8 (0% Fail)           Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 nvcc12.4 (0% Fail)           Failed:  0  -- Passed: 79  -- Total: 79 
    🟩 cudacxx_name
      🟩 clang-cuda (0% Fail)         Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc (0% Fail)               Failed:  0  -- Passed: 97  -- Total: 97 
    🟩 cxx_full
      🟩 clang9 (0% Fail)             Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 clang10 (0% Fail)            Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 clang11 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang12 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang13 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang14 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang15 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang16 (0% Fail)            Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 gcc6 (0% Fail)               Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 gcc7 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc8 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc9 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc10 (0% Fail)              Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 gcc11 (0% Fail)              Failed:  0  -- Passed:  7  -- Total:  7 
      🟩 gcc12 (0% Fail)              Failed:  0  -- Passed: 16  -- Total: 16 
      🟩 Intel2023.2.0 (0% Fail)      Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC14.16 (0% Fail)          Failed:  0  -- Passed:  1  -- Total:  1 
      🟩 MSVC14.29 (0% Fail)          Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 MSVC14.39 (0% Fail)          Failed:  0  -- Passed:  3  -- Total:  3 
    🟩 cxx_name
      🟩 clang (0% Fail)              Failed:  0  -- Passed: 43  -- Total: 43 
      🟩 gcc (0% Fail)                Failed:  0  -- Passed: 47  -- Total: 47 
      🟩 Intel (0% Fail)              Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 gpu
      🟩 v100 (0% Fail)               Failed:  0  -- Passed: 99  -- Total: 99 
    🟩 jobs
      🟩 build (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 test (0% Fail)               Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 os
      🟩 ubuntu18.04 (0% Fail)        Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 ubuntu20.04 (0% Fail)        Failed:  0  -- Passed: 35  -- Total: 35 
      🟩 ubuntu22.04 (0% Fail)        Failed:  0  -- Passed: 44  -- Total: 44 
      🟩 windows2022 (0% Fail)        Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 sm
      🟩 60;70;80;90 (0% Fail)        Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 90a (0% Fail)                Failed:  0  -- Passed:  4  -- Total:  4 
    🟩 std
      🟩 11 (0% Fail)                 Failed:  0  -- Passed: 26  -- Total: 26 
      🟩 14 (0% Fail)                 Failed:  0  -- Passed: 29  -- Total: 29 
      🟩 17 (0% Fail)                 Failed:  0  -- Passed: 28  -- Total: 28 
      🟩 20 (0% Fail)                 Failed:  0  -- Passed: 16  -- Total: 16 
    
  • 🟩 Project cub [ Failed: 0 | Passed: 99 | Total: 99 ]

    🟩 cpu
      🟩 amd64 (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 arm64 (0% Fail)              Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 ctk
      🟩 11.1 (0% Fail)               Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 11.8 (0% Fail)               Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 12.4 (0% Fail)               Failed:  0  -- Passed: 81  -- Total: 81 
    🟩 cudacxx_full
      🟩 clang-cuda16 (0% Fail)       Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc11.1 (0% Fail)           Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 nvcc11.8 (0% Fail)           Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 nvcc12.4 (0% Fail)           Failed:  0  -- Passed: 79  -- Total: 79 
    🟩 cudacxx_name
      🟩 clang-cuda (0% Fail)         Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc (0% Fail)               Failed:  0  -- Passed: 97  -- Total: 97 
    🟩 cxx_full
      🟩 clang9 (0% Fail)             Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 clang10 (0% Fail)            Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 clang11 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang12 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang13 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang14 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang15 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang16 (0% Fail)            Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 gcc6 (0% Fail)               Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 gcc7 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc8 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc9 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc10 (0% Fail)              Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 gcc11 (0% Fail)              Failed:  0  -- Passed:  7  -- Total:  7 
      🟩 gcc12 (0% Fail)              Failed:  0  -- Passed: 16  -- Total: 16 
      🟩 Intel2023.2.0 (0% Fail)      Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC14.16 (0% Fail)          Failed:  0  -- Passed:  1  -- Total:  1 
      🟩 MSVC14.29 (0% Fail)          Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 MSVC14.39 (0% Fail)          Failed:  0  -- Passed:  3  -- Total:  3 
    🟩 cxx_name
      🟩 clang (0% Fail)              Failed:  0  -- Passed: 43  -- Total: 43 
      🟩 gcc (0% Fail)                Failed:  0  -- Passed: 47  -- Total: 47 
      🟩 Intel (0% Fail)              Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 gpu
      🟩 v100 (0% Fail)               Failed:  0  -- Passed: 99  -- Total: 99 
    🟩 jobs
      🟩 build (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 test (0% Fail)               Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 os
      🟩 ubuntu18.04 (0% Fail)        Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 ubuntu20.04 (0% Fail)        Failed:  0  -- Passed: 35  -- Total: 35 
      🟩 ubuntu22.04 (0% Fail)        Failed:  0  -- Passed: 44  -- Total: 44 
      🟩 windows2022 (0% Fail)        Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 sm
      🟩 60;70;80;90 (0% Fail)        Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 90a (0% Fail)                Failed:  0  -- Passed:  4  -- Total:  4 
    🟩 std
      🟩 11 (0% Fail)                 Failed:  0  -- Passed: 26  -- Total: 26 
      🟩 14 (0% Fail)                 Failed:  0  -- Passed: 29  -- Total: 29 
      🟩 17 (0% Fail)                 Failed:  0  -- Passed: 28  -- Total: 28 
      🟩 20 (0% Fail)                 Failed:  0  -- Passed: 16  -- Total: 16 
    

🏃‍ Runner counts (total jobs: 198)

# Runner
154 linux-amd64-cpu16
16 linux-arm64-cpu16
16 linux-amd64-gpu-v100-latest-1
12 windows-amd64-cpu16

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental

Copy link
Collaborator

@gevtushenko gevtushenko left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A few comments on the test and implementation side of value-based API of block scan. Please, let me know when those are addressed and I'll have another look.

cub/test/catch2_test_block_scan.cu Outdated Show resolved Hide resolved
cub/test/catch2_test_block_scan.cu Outdated Show resolved Hide resolved
cub/test/catch2_test_block_scan.cu Outdated Show resolved Hide resolved
cub/test/catch2_test_block_scan.cu Outdated Show resolved Hide resolved
cub/test/catch2_test_block_scan.cu Outdated Show resolved Hide resolved
cub/test/catch2_test_block_scan.cu Outdated Show resolved Hide resolved
cub/test/catch2_test_block_scan.cu Outdated Show resolved Hide resolved
cub/test/catch2_test_block_scan.cu Outdated Show resolved Hide resolved
cub/test/catch2_test_block_scan.cu Outdated Show resolved Hide resolved
cub/cub/block/specializations/block_scan_warp_scans.cuh Outdated Show resolved Hide resolved
Copy link
Contributor

🟩 CI Results [ Failed: 0 | Passed: 198 | Total: 198 ]
  • 🟩 Project thrust [ Failed: 0 | Passed: 99 | Total: 99 ]

    🟩 cpu
      🟩 amd64 (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 arm64 (0% Fail)              Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 ctk
      🟩 11.1 (0% Fail)               Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 11.8 (0% Fail)               Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 12.4 (0% Fail)               Failed:  0  -- Passed: 81  -- Total: 81 
    🟩 cudacxx_full
      🟩 clang-cuda16 (0% Fail)       Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc11.1 (0% Fail)           Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 nvcc11.8 (0% Fail)           Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 nvcc12.4 (0% Fail)           Failed:  0  -- Passed: 79  -- Total: 79 
    🟩 cudacxx_name
      🟩 clang-cuda (0% Fail)         Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc (0% Fail)               Failed:  0  -- Passed: 97  -- Total: 97 
    🟩 cxx_full
      🟩 clang9 (0% Fail)             Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 clang10 (0% Fail)            Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 clang11 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang12 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang13 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang14 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang15 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang16 (0% Fail)            Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 gcc6 (0% Fail)               Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 gcc7 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc8 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc9 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc10 (0% Fail)              Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 gcc11 (0% Fail)              Failed:  0  -- Passed:  7  -- Total:  7 
      🟩 gcc12 (0% Fail)              Failed:  0  -- Passed: 16  -- Total: 16 
      🟩 Intel2023.2.0 (0% Fail)      Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC14.16 (0% Fail)          Failed:  0  -- Passed:  1  -- Total:  1 
      🟩 MSVC14.29 (0% Fail)          Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 MSVC14.39 (0% Fail)          Failed:  0  -- Passed:  3  -- Total:  3 
    🟩 cxx_name
      🟩 clang (0% Fail)              Failed:  0  -- Passed: 43  -- Total: 43 
      🟩 gcc (0% Fail)                Failed:  0  -- Passed: 47  -- Total: 47 
      🟩 Intel (0% Fail)              Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 gpu
      🟩 v100 (0% Fail)               Failed:  0  -- Passed: 99  -- Total: 99 
    🟩 jobs
      🟩 build (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 test (0% Fail)               Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 os
      🟩 ubuntu18.04 (0% Fail)        Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 ubuntu20.04 (0% Fail)        Failed:  0  -- Passed: 35  -- Total: 35 
      🟩 ubuntu22.04 (0% Fail)        Failed:  0  -- Passed: 44  -- Total: 44 
      🟩 windows2022 (0% Fail)        Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 sm
      🟩 60;70;80;90 (0% Fail)        Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 90a (0% Fail)                Failed:  0  -- Passed:  4  -- Total:  4 
    🟩 std
      🟩 11 (0% Fail)                 Failed:  0  -- Passed: 26  -- Total: 26 
      🟩 14 (0% Fail)                 Failed:  0  -- Passed: 29  -- Total: 29 
      🟩 17 (0% Fail)                 Failed:  0  -- Passed: 28  -- Total: 28 
      🟩 20 (0% Fail)                 Failed:  0  -- Passed: 16  -- Total: 16 
    
  • 🟩 Project cub [ Failed: 0 | Passed: 99 | Total: 99 ]

    🟩 cpu
      🟩 amd64 (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 arm64 (0% Fail)              Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 ctk
      🟩 11.1 (0% Fail)               Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 11.8 (0% Fail)               Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 12.4 (0% Fail)               Failed:  0  -- Passed: 81  -- Total: 81 
    🟩 cudacxx_full
      🟩 clang-cuda16 (0% Fail)       Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc11.1 (0% Fail)           Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 nvcc11.8 (0% Fail)           Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 nvcc12.4 (0% Fail)           Failed:  0  -- Passed: 79  -- Total: 79 
    🟩 cudacxx_name
      🟩 clang-cuda (0% Fail)         Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc (0% Fail)               Failed:  0  -- Passed: 97  -- Total: 97 
    🟩 cxx_full
      🟩 clang9 (0% Fail)             Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 clang10 (0% Fail)            Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 clang11 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang12 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang13 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang14 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang15 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang16 (0% Fail)            Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 gcc6 (0% Fail)               Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 gcc7 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc8 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc9 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc10 (0% Fail)              Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 gcc11 (0% Fail)              Failed:  0  -- Passed:  7  -- Total:  7 
      🟩 gcc12 (0% Fail)              Failed:  0  -- Passed: 16  -- Total: 16 
      🟩 Intel2023.2.0 (0% Fail)      Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC14.16 (0% Fail)          Failed:  0  -- Passed:  1  -- Total:  1 
      🟩 MSVC14.29 (0% Fail)          Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 MSVC14.39 (0% Fail)          Failed:  0  -- Passed:  3  -- Total:  3 
    🟩 cxx_name
      🟩 clang (0% Fail)              Failed:  0  -- Passed: 43  -- Total: 43 
      🟩 gcc (0% Fail)                Failed:  0  -- Passed: 47  -- Total: 47 
      🟩 Intel (0% Fail)              Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 gpu
      🟩 v100 (0% Fail)               Failed:  0  -- Passed: 99  -- Total: 99 
    🟩 jobs
      🟩 build (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 test (0% Fail)               Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 os
      🟩 ubuntu18.04 (0% Fail)        Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 ubuntu20.04 (0% Fail)        Failed:  0  -- Passed: 35  -- Total: 35 
      🟩 ubuntu22.04 (0% Fail)        Failed:  0  -- Passed: 44  -- Total: 44 
      🟩 windows2022 (0% Fail)        Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 sm
      🟩 60;70;80;90 (0% Fail)        Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 90a (0% Fail)                Failed:  0  -- Passed:  4  -- Total:  4 
    🟩 std
      🟩 11 (0% Fail)                 Failed:  0  -- Passed: 26  -- Total: 26 
      🟩 14 (0% Fail)                 Failed:  0  -- Passed: 29  -- Total: 29 
      🟩 17 (0% Fail)                 Failed:  0  -- Passed: 28  -- Total: 28 
      🟩 20 (0% Fail)                 Failed:  0  -- Passed: 16  -- Total: 16 
    

🏃‍ Runner counts (total jobs: 198)

# Runner
154 linux-amd64-cpu16
16 linux-arm64-cpu16
16 linux-amd64-gpu-v100-latest-1
12 windows-amd64-cpu16

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental

1 similar comment
Copy link
Contributor

🟩 CI Results [ Failed: 0 | Passed: 198 | Total: 198 ]
  • 🟩 Project thrust [ Failed: 0 | Passed: 99 | Total: 99 ]

    🟩 cpu
      🟩 amd64 (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 arm64 (0% Fail)              Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 ctk
      🟩 11.1 (0% Fail)               Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 11.8 (0% Fail)               Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 12.4 (0% Fail)               Failed:  0  -- Passed: 81  -- Total: 81 
    🟩 cudacxx_full
      🟩 clang-cuda16 (0% Fail)       Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc11.1 (0% Fail)           Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 nvcc11.8 (0% Fail)           Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 nvcc12.4 (0% Fail)           Failed:  0  -- Passed: 79  -- Total: 79 
    🟩 cudacxx_name
      🟩 clang-cuda (0% Fail)         Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc (0% Fail)               Failed:  0  -- Passed: 97  -- Total: 97 
    🟩 cxx_full
      🟩 clang9 (0% Fail)             Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 clang10 (0% Fail)            Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 clang11 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang12 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang13 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang14 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang15 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang16 (0% Fail)            Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 gcc6 (0% Fail)               Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 gcc7 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc8 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc9 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc10 (0% Fail)              Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 gcc11 (0% Fail)              Failed:  0  -- Passed:  7  -- Total:  7 
      🟩 gcc12 (0% Fail)              Failed:  0  -- Passed: 16  -- Total: 16 
      🟩 Intel2023.2.0 (0% Fail)      Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC14.16 (0% Fail)          Failed:  0  -- Passed:  1  -- Total:  1 
      🟩 MSVC14.29 (0% Fail)          Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 MSVC14.39 (0% Fail)          Failed:  0  -- Passed:  3  -- Total:  3 
    🟩 cxx_name
      🟩 clang (0% Fail)              Failed:  0  -- Passed: 43  -- Total: 43 
      🟩 gcc (0% Fail)                Failed:  0  -- Passed: 47  -- Total: 47 
      🟩 Intel (0% Fail)              Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 gpu
      🟩 v100 (0% Fail)               Failed:  0  -- Passed: 99  -- Total: 99 
    🟩 jobs
      🟩 build (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 test (0% Fail)               Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 os
      🟩 ubuntu18.04 (0% Fail)        Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 ubuntu20.04 (0% Fail)        Failed:  0  -- Passed: 35  -- Total: 35 
      🟩 ubuntu22.04 (0% Fail)        Failed:  0  -- Passed: 44  -- Total: 44 
      🟩 windows2022 (0% Fail)        Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 sm
      🟩 60;70;80;90 (0% Fail)        Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 90a (0% Fail)                Failed:  0  -- Passed:  4  -- Total:  4 
    🟩 std
      🟩 11 (0% Fail)                 Failed:  0  -- Passed: 26  -- Total: 26 
      🟩 14 (0% Fail)                 Failed:  0  -- Passed: 29  -- Total: 29 
      🟩 17 (0% Fail)                 Failed:  0  -- Passed: 28  -- Total: 28 
      🟩 20 (0% Fail)                 Failed:  0  -- Passed: 16  -- Total: 16 
    
  • 🟩 Project cub [ Failed: 0 | Passed: 99 | Total: 99 ]

    🟩 cpu
      🟩 amd64 (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 arm64 (0% Fail)              Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 ctk
      🟩 11.1 (0% Fail)               Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 11.8 (0% Fail)               Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 12.4 (0% Fail)               Failed:  0  -- Passed: 81  -- Total: 81 
    🟩 cudacxx_full
      🟩 clang-cuda16 (0% Fail)       Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc11.1 (0% Fail)           Failed:  0  -- Passed: 15  -- Total: 15 
      🟩 nvcc11.8 (0% Fail)           Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 nvcc12.4 (0% Fail)           Failed:  0  -- Passed: 79  -- Total: 79 
    🟩 cudacxx_name
      🟩 clang-cuda (0% Fail)         Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 nvcc (0% Fail)               Failed:  0  -- Passed: 97  -- Total: 97 
    🟩 cxx_full
      🟩 clang9 (0% Fail)             Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 clang10 (0% Fail)            Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 clang11 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang12 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang13 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang14 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang15 (0% Fail)            Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 clang16 (0% Fail)            Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 gcc6 (0% Fail)               Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 gcc7 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc8 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc9 (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
      🟩 gcc10 (0% Fail)              Failed:  0  -- Passed:  4  -- Total:  4 
      🟩 gcc11 (0% Fail)              Failed:  0  -- Passed:  7  -- Total:  7 
      🟩 gcc12 (0% Fail)              Failed:  0  -- Passed: 16  -- Total: 16 
      🟩 Intel2023.2.0 (0% Fail)      Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC14.16 (0% Fail)          Failed:  0  -- Passed:  1  -- Total:  1 
      🟩 MSVC14.29 (0% Fail)          Failed:  0  -- Passed:  2  -- Total:  2 
      🟩 MSVC14.39 (0% Fail)          Failed:  0  -- Passed:  3  -- Total:  3 
    🟩 cxx_name
      🟩 clang (0% Fail)              Failed:  0  -- Passed: 43  -- Total: 43 
      🟩 gcc (0% Fail)                Failed:  0  -- Passed: 47  -- Total: 47 
      🟩 Intel (0% Fail)              Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 MSVC (0% Fail)               Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 gpu
      🟩 v100 (0% Fail)               Failed:  0  -- Passed: 99  -- Total: 99 
    🟩 jobs
      🟩 build (0% Fail)              Failed:  0  -- Passed: 91  -- Total: 91 
      🟩 test (0% Fail)               Failed:  0  -- Passed:  8  -- Total:  8 
    🟩 os
      🟩 ubuntu18.04 (0% Fail)        Failed:  0  -- Passed: 14  -- Total: 14 
      🟩 ubuntu20.04 (0% Fail)        Failed:  0  -- Passed: 35  -- Total: 35 
      🟩 ubuntu22.04 (0% Fail)        Failed:  0  -- Passed: 44  -- Total: 44 
      🟩 windows2022 (0% Fail)        Failed:  0  -- Passed:  6  -- Total:  6 
    🟩 sm
      🟩 60;70;80;90 (0% Fail)        Failed:  0  -- Passed:  3  -- Total:  3 
      🟩 90a (0% Fail)                Failed:  0  -- Passed:  4  -- Total:  4 
    🟩 std
      🟩 11 (0% Fail)                 Failed:  0  -- Passed: 26  -- Total: 26 
      🟩 14 (0% Fail)                 Failed:  0  -- Passed: 29  -- Total: 29 
      🟩 17 (0% Fail)                 Failed:  0  -- Passed: 28  -- Total: 28 
      🟩 20 (0% Fail)                 Failed:  0  -- Passed: 16  -- Total: 16 
    

🏃‍ Runner counts (total jobs: 198)

# Runner
154 linux-amd64-cpu16
16 linux-arm64-cpu16
16 linux-amd64-gpu-v100-latest-1
12 windows-amd64-cpu16

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental

@gonidelis gonidelis force-pushed the inclusive_scan_init branch 2 times, most recently from a932030 to 726e988 Compare May 31, 2024 22:11
Copy link
Contributor

github-actions bot commented Jun 1, 2024

🟨 CI Results: Pass: 97%/249 | Total Time: 4d 16h | Avg Time: 27m 13s | Hits: 60%/244265
  • 🟨 cub: Pass: 95%/131 | Total Time: 2d 16h | Avg Time: 29m 29s | Hits: 50%/104999

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  95%/123 | Total Time:  2d 11h | Avg Time: 29m 15s | Hits:  52%/98183 
      🟩 arm64              Pass: 100%/8   | Total Time:  4h 24m | Avg Time: 33m 01s | Hits:  16%/6816  
    🔍 cudacxx_name: nvcc 🔍
      🟩 clang-cuda         Pass: 100%/2   | Total Time: 43m 53s | Avg Time: 21m 56s | Hits:  13%/1410  
      🔍 nvcc               Pass:  95%/129 | Total Time:  2d 15h | Avg Time: 29m 36s | Hits:  50%/103589
    🚨 cxx_name: MSVC 🚨
      🟩 clang              Pass: 100%/59  | Total Time:  1d 05h | Avg Time: 29m 31s | Hits:  51%/49790 
      🟩 gcc                Pass: 100%/63  | Total Time:  1d 07h | Avg Time: 29m 58s | Hits:  49%/52875 
      🟩 Intel              Pass: 100%/3   | Total Time:  1h 54m | Avg Time: 38m 11s | Hits:  36%/2334  
      🔥 MSVC               Pass:   0%/6   | Total Time:  1h 57m | Avg Time: 19m 39s
    🔍 jobs: Build 🔍
      🔍 Build              Pass:  93%/99  | Total Time:  2d 03h | Avg Time: 31m 18s | Hits:  33%/77735 
      🟩 DeviceLaunch       Pass: 100%/8   | Total Time:  2h 57m | Avg Time: 22m 11s | Hits:  99%/6816  
      🟩 GraphCapture       Pass: 100%/8   | Total Time:  2h 25m | Avg Time: 18m 14s | Hits:  99%/6816  
      🟩 HostLaunch         Pass: 100%/8   | Total Time:  2h 44m | Avg Time: 20m 36s | Hits:  99%/6816  
      🟩 TestGPU            Pass: 100%/8   | Total Time:  4h 34m | Avg Time: 34m 22s | Hits:  95%/6816  
    🚨 os: windows2022 🚨
      🟩 ubuntu18.04        Pass: 100%/14  | Total Time:  6h 02m | Avg Time: 25m 52s | Hits:  36%/10873 
      🟩 ubuntu20.04        Pass: 100%/35  | Total Time: 19h 38m | Avg Time: 33m 40s | Hits:  38%/29890 
      🟩 ubuntu22.04        Pass: 100%/76  | Total Time:  1d 12h | Avg Time: 28m 59s | Hits:  57%/64236 
      🔥 windows2022        Pass:   0%/6   | Total Time:  1h 57m | Avg Time: 19m 39s
    🟨 ctk
      🟨 11.1               Pass:  93%/15  | Total Time:  6h 22m | Avg Time: 25m 30s | Hits:  36%/10873 
      🟩 11.8               Pass: 100%/3   | Total Time:  2h 11m | Avg Time: 43m 42s | Hits:  38%/2556  
      🟨 12.4               Pass:  95%/113 | Total Time:  2d 07h | Avg Time: 29m 38s | Hits:  52%/91570 
    🟨 cudacxx_full
      🟩 clang-cuda17       Pass: 100%/2   | Total Time: 43m 53s | Avg Time: 21m 56s | Hits:  13%/1410  
      🟨 nvcc11.1           Pass:  93%/15  | Total Time:  6h 22m | Avg Time: 25m 30s | Hits:  36%/10873 
      🟩 nvcc11.8           Pass: 100%/3   | Total Time:  2h 11m | Avg Time: 43m 42s | Hits:  38%/2556  
      🟨 nvcc12.4           Pass:  95%/111 | Total Time:  2d 07h | Avg Time: 29m 46s | Hits:  52%/90160 
    🟨 cxx_full
      🟩 clang9             Pass: 100%/6   | Total Time:  3h 00m | Avg Time: 30m 01s | Hits:  37%/4890  
      🟩 clang10            Pass: 100%/3   | Total Time:  1h 43m | Avg Time: 34m 25s | Hits:  38%/2562  
      🟩 clang11            Pass: 100%/4   | Total Time:  2h 08m | Avg Time: 32m 09s | Hits:  38%/3416  
      🟩 clang12            Pass: 100%/4   | Total Time:  2h 17m | Avg Time: 34m 26s | Hits:  38%/3416  
      🟩 clang13            Pass: 100%/4   | Total Time:  2h 08m | Avg Time: 32m 05s | Hits:  38%/3416  
      🟩 clang14            Pass: 100%/4   | Total Time:  2h 13m | Avg Time: 33m 20s | Hits:  38%/3416  
      🟩 clang15            Pass: 100%/4   | Total Time:  2h 15m | Avg Time: 33m 52s | Hits:  38%/3408  
      🟩 clang16            Pass: 100%/4   | Total Time:  2h 10m | Avg Time: 32m 40s | Hits:  38%/3408  
      🟩 clang17            Pass: 100%/26  | Total Time: 11h 04m | Avg Time: 25m 32s | Hits:  68%/21858 
      🟩 gcc6               Pass: 100%/2   | Total Time: 50m 48s | Avg Time: 25m 24s | Hits:  36%/1552  
      🟩 gcc7               Pass: 100%/6   | Total Time:  2h 54m | Avg Time: 29m 08s | Hits:  37%/4893  
      🟩 gcc8               Pass: 100%/6   | Total Time:  3h 02m | Avg Time: 30m 20s | Hits:  37%/4893  
      🟩 gcc9               Pass: 100%/6   | Total Time:  3h 04m | Avg Time: 30m 45s | Hits:  37%/4893  
      🟩 gcc10              Pass: 100%/4   | Total Time:  2h 17m | Avg Time: 34m 20s | Hits:  38%/3416  
      🟩 gcc11              Pass: 100%/7   | Total Time:  4h 28m | Avg Time: 38m 23s | Hits:  38%/5964  
      🟩 gcc12              Pass: 100%/4   | Total Time:  2h 19m | Avg Time: 34m 54s | Hits:  38%/3408  
      🟩 gcc13              Pass: 100%/28  | Total Time: 12h 30m | Avg Time: 26m 48s | Hits:  62%/23856 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total Time:  1h 54m | Avg Time: 38m 11s | Hits:  36%/2334  
      🟥 MSVC14.16          Pass:   0%/1   | Total Time: 20m 19s | Avg Time: 20m 19s
      🟥 MSVC14.29          Pass:   0%/2   | Total Time: 38m 21s | Avg Time: 19m 10s
      🟥 MSVC14.39          Pass:   0%/3   | Total Time: 59m 17s | Avg Time: 19m 45s
    🟨 std
      🟩 11                 Pass: 100%/34  | Total Time: 17h 44m | Avg Time: 31m 19s | Hits:  48%/28537 
      🟨 14                 Pass:  91%/37  | Total Time: 17h 53m | Avg Time: 29m 00s | Hits:  49%/28537 
      🟨 17                 Pass:  94%/36  | Total Time: 17h 21m | Avg Time: 28m 56s | Hits:  49%/28466 
      🟨 20                 Pass:  95%/24  | Total Time: 11h 23m | Avg Time: 28m 27s | Hits:  54%/19459 
    🟨 gpu
      🟨 v100               Pass:  95%/131 | Total Time:  2d 16h | Avg Time: 29m 29s | Hits:  50%/104999
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total Time:  2h 11m | Avg Time: 43m 42s | Hits:  38%/2556  
      🟩 90a                Pass: 100%/4   | Total Time:  1h 14m | Avg Time: 18m 34s | Hits:  16%/3408  
    
  • 🟩 thrust: Pass: 100%/118 | Total Time: 2d 00h | Avg Time: 24m 41s | Hits: 69%/139266

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total Time:  1d 21h | Avg Time: 24m 39s | Hits:  69%/129822
      🟩 arm64              Pass: 100%/8   | Total Time:  3h 21m | Avg Time: 25m 08s | Hits:  63%/9444  
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total Time:  6h 03m | Avg Time: 24m 13s | Hits:  63%/17705 
      🟩 11.8               Pass: 100%/3   | Total Time:  1h 47m | Avg Time: 35m 52s | Hits:  63%/3543  
      🟩 12.4               Pass: 100%/100 | Total Time:  1d 16h | Avg Time: 24m 25s | Hits:  70%/118018
    🟩 cudacxx_full
      🟩 clang-cuda17       Pass: 100%/2   | Total Time: 47m 21s | Avg Time: 23m 40s | Hits:  62%/2360  
      🟩 nvcc11.1           Pass: 100%/15  | Total Time:  6h 03m | Avg Time: 24m 13s | Hits:  63%/17705 
      🟩 nvcc11.8           Pass: 100%/3   | Total Time:  1h 47m | Avg Time: 35m 52s | Hits:  63%/3543  
      🟩 nvcc12.4           Pass: 100%/98  | Total Time:  1d 15h | Avg Time: 24m 26s | Hits:  70%/115658
    🟩 cudacxx_name
      🟩 clang-cuda         Pass: 100%/2   | Total Time: 47m 21s | Avg Time: 23m 40s | Hits:  62%/2360  
      🟩 nvcc               Pass: 100%/116 | Total Time:  1d 23h | Avg Time: 24m 42s | Hits:  69%/136906
    🟩 cxx_full
      🟩 clang9             Pass: 100%/6   | Total Time:  2h 22m | Avg Time: 23m 46s | Hits:  63%/7080  
      🟩 clang10            Pass: 100%/3   | Total Time:  1h 16m | Avg Time: 25m 29s | Hits:  63%/3540  
      🟩 clang11            Pass: 100%/4   | Total Time:  1h 40m | Avg Time: 25m 13s | Hits:  63%/4720  
      🟩 clang12            Pass: 100%/4   | Total Time:  1h 46m | Avg Time: 26m 31s | Hits:  63%/4720  
      🟩 clang13            Pass: 100%/4   | Total Time:  1h 40m | Avg Time: 25m 05s | Hits:  63%/4720  
      🟩 clang14            Pass: 100%/4   | Total Time:  1h 41m | Avg Time: 25m 16s | Hits:  63%/4720  
      🟩 clang15            Pass: 100%/4   | Total Time:  1h 44m | Avg Time: 26m 08s | Hits:  63%/4720  
      🟩 clang16            Pass: 100%/4   | Total Time:  1h 45m | Avg Time: 26m 27s | Hits:  63%/4720  
      🟩 clang17            Pass: 100%/18  | Total Time:  5h 39m | Avg Time: 18m 53s | Hits:  79%/21240 
      🟩 gcc6               Pass: 100%/2   | Total Time: 43m 36s | Avg Time: 21m 48s | Hits:  63%/2360  
      🟩 gcc7               Pass: 100%/6   | Total Time:  2h 25m | Avg Time: 24m 16s | Hits:  63%/7086  
      🟩 gcc8               Pass: 100%/6   | Total Time:  2h 24m | Avg Time: 24m 02s | Hits:  63%/7086  
      🟩 gcc9               Pass: 100%/6   | Total Time:  2h 29m | Avg Time: 24m 53s | Hits:  63%/7086  
      🟩 gcc10              Pass: 100%/4   | Total Time:  1h 54m | Avg Time: 28m 32s | Hits:  63%/4724  
      🟩 gcc11              Pass: 100%/7   | Total Time:  3h 33m | Avg Time: 30m 31s | Hits:  63%/8267  
      🟩 gcc12              Pass: 100%/4   | Total Time:  1h 50m | Avg Time: 27m 37s | Hits:  63%/4724  
      🟩 gcc13              Pass: 100%/20  | Total Time:  5h 51m | Avg Time: 17m 35s | Hits:  77%/23620 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total Time:  1h 36m | Avg Time: 32m 10s | Hits:  63%/3549  
      🟩 MSVC14.16          Pass: 100%/1   | Total Time: 45m 10s | Avg Time: 45m 10s | Hits:  61%/1176  
      🟩 MSVC14.29          Pass: 100%/2   | Total Time:  1h 41m | Avg Time: 50m 32s | Hits:  61%/2352  
      🟩 MSVC14.39          Pass: 100%/6   | Total Time:  3h 40m | Avg Time: 36m 46s | Hits:  80%/7056  
    🟩 cxx_name
      🟩 clang              Pass: 100%/51  | Total Time: 19h 37m | Avg Time: 23m 05s | Hits:  69%/60180 
      🟩 gcc                Pass: 100%/55  | Total Time: 21h 13m | Avg Time: 23m 08s | Hits:  68%/64953 
      🟩 Intel              Pass: 100%/3   | Total Time:  1h 36m | Avg Time: 32m 10s | Hits:  63%/3549  
      🟩 MSVC               Pass: 100%/9   | Total Time:  6h 06m | Avg Time: 40m 45s | Hits:  74%/10584 
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total Time:  2d 00h | Avg Time: 24m 41s | Hits:  69%/139266
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total Time:  1d 20h | Avg Time: 27m 05s | Hits:  63%/116850
      🟩 TestCPU            Pass: 100%/11  | Total Time:  1h 44m | Avg Time:  9m 28s | Hits:  99%/12972 
      🟩 TestGPU            Pass: 100%/8   | Total Time:  2h 07m | Avg Time: 15m 55s | Hits:  99%/9444  
    🟩 os
      🟩 ubuntu18.04        Pass: 100%/14  | Total Time:  5h 18m | Avg Time: 22m 43s | Hits:  63%/16529 
      🟩 ubuntu20.04        Pass: 100%/35  | Total Time: 15h 06m | Avg Time: 25m 53s | Hits:  63%/41313 
      🟩 ubuntu22.04        Pass: 100%/60  | Total Time: 22h 02m | Avg Time: 22m 02s | Hits:  73%/70840 
      🟩 windows2022        Pass: 100%/9   | Total Time:  6h 06m | Avg Time: 40m 45s | Hits:  74%/10584 
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total Time:  1h 47m | Avg Time: 35m 52s | Hits:  63%/3543  
      🟩 90a                Pass: 100%/4   | Total Time:  1h 00m | Avg Time: 15m 11s | Hits:  63%/4724  
    🟩 std
      🟩 11                 Pass: 100%/30  | Total Time: 10h 19m | Avg Time: 20m 39s | Hits:  70%/35418 
      🟩 14                 Pass: 100%/34  | Total Time: 14h 34m | Avg Time: 25m 43s | Hits:  67%/40122 
      🟩 17                 Pass: 100%/33  | Total Time: 14h 43m | Avg Time: 26m 46s | Hits:  68%/38946 
      🟩 20                 Pass: 100%/21  | Total Time:  8h 55m | Avg Time: 25m 30s | Hits:  71%/24780 
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental

🏃‍ Runner counts (total jobs: 249)

# Runner
178 linux-amd64-cpu16
40 linux-amd64-gpu-v100-latest-1
16 linux-arm64-cpu16
15 windows-amd64-cpu16

@gonidelis gonidelis force-pushed the inclusive_scan_init branch 3 times, most recently from f38095f to 5bf71e0 Compare June 3, 2024 21:09
Copy link
Contributor

github-actions bot commented Jun 4, 2024

🟩 CI Results: Pass: 100%/249 | Total Time: 1d 05h | Avg Time: 7m 08s | Hits: 99%/248441
  • 🟩 cub: Pass: 100%/131 | Total Time: 19h 27m | Avg Time: 8m 54s | Hits: 98%/109175

    🟩 cpu
      🟩 amd64              Pass: 100%/123 | Total Time: 18h 55m | Avg Time:  9m 13s | Hits:  98%/102359
      🟩 arm64              Pass: 100%/8   | Total Time: 31m 25s | Avg Time:  3m 55s | Hits:  99%/6816  
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total Time:  1h 45m | Avg Time:  7m 00s | Hits:  95%/11569 
      🟩 11.8               Pass: 100%/3   | Total Time: 13m 25s | Avg Time:  4m 28s | Hits:  99%/2556  
      🟩 12.4               Pass: 100%/113 | Total Time: 17h 28m | Avg Time:  9m 16s | Hits:  98%/95050 
    🟩 cudacxx_full
      🟩 clang-cuda17       Pass: 100%/2   | Total Time:  6m 30s | Avg Time:  3m 15s | Hits:  99%/1410  
      🟩 nvcc11.1           Pass: 100%/15  | Total Time:  1h 45m | Avg Time:  7m 00s | Hits:  95%/11569 
      🟩 nvcc11.8           Pass: 100%/3   | Total Time: 13m 25s | Avg Time:  4m 28s | Hits:  99%/2556  
      🟩 nvcc12.4           Pass: 100%/111 | Total Time: 17h 21m | Avg Time:  9m 23s | Hits:  98%/93640 
    🟩 cudacxx_name
      🟩 clang-cuda         Pass: 100%/2   | Total Time:  6m 30s | Avg Time:  3m 15s | Hits:  99%/1410  
      🟩 nvcc               Pass: 100%/129 | Total Time: 19h 20m | Avg Time:  8m 59s | Hits:  98%/107765
    🟩 cxx_full
      🟩 clang9             Pass: 100%/6   | Total Time: 27m 01s | Avg Time:  4m 30s | Hits:  99%/4890  
      🟩 clang10            Pass: 100%/3   | Total Time: 14m 31s | Avg Time:  4m 50s | Hits:  99%/2562  
      🟩 clang11            Pass: 100%/4   | Total Time: 16m 37s | Avg Time:  4m 09s | Hits:  99%/3416  
      🟩 clang12            Pass: 100%/4   | Total Time: 16m 11s | Avg Time:  4m 02s | Hits:  99%/3416  
      🟩 clang13            Pass: 100%/4   | Total Time: 16m 35s | Avg Time:  4m 08s | Hits:  99%/3416  
      🟩 clang14            Pass: 100%/4   | Total Time: 16m 44s | Avg Time:  4m 11s | Hits:  99%/3416  
      🟩 clang15            Pass: 100%/4   | Total Time: 17m 01s | Avg Time:  4m 15s | Hits:  99%/3408  
      🟩 clang16            Pass: 100%/4   | Total Time: 16m 56s | Avg Time:  4m 14s | Hits:  99%/3408  
      🟩 clang17            Pass: 100%/26  | Total Time:  5h 18m | Avg Time: 12m 14s | Hits:  99%/21858 
      🟩 gcc6               Pass: 100%/2   | Total Time:  6m 46s | Avg Time:  3m 23s | Hits:  99%/1552  
      🟩 gcc7               Pass: 100%/6   | Total Time: 21m 46s | Avg Time:  3m 37s | Hits:  99%/4893  
      🟩 gcc8               Pass: 100%/6   | Total Time: 43m 42s | Avg Time:  7m 17s | Hits:  93%/4893  
      🟩 gcc9               Pass: 100%/6   | Total Time: 22m 43s | Avg Time:  3m 47s | Hits:  99%/4893  
      🟩 gcc10              Pass: 100%/4   | Total Time: 16m 38s | Avg Time:  4m 09s | Hits:  99%/3416  
      🟩 gcc11              Pass: 100%/7   | Total Time: 30m 33s | Avg Time:  4m 21s | Hits:  99%/5964  
      🟩 gcc12              Pass: 100%/4   | Total Time: 17m 19s | Avg Time:  4m 19s | Hits:  99%/3408  
      🟩 gcc13              Pass: 100%/28  | Total Time:  5h 36m | Avg Time: 12m 01s | Hits:  99%/23856 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total Time: 15m 11s | Avg Time:  5m 03s | Hits:  99%/2334  
      🟩 MSVC14.16          Pass: 100%/1   | Total Time: 35m 51s | Avg Time: 35m 51s | Hits:  78%/696   
      🟩 MSVC14.29          Pass: 100%/2   | Total Time:  1h 06m | Avg Time: 33m 04s | Hits:  78%/1392  
      🟩 MSVC14.39          Pass: 100%/3   | Total Time:  1h 33m | Avg Time: 31m 15s | Hits:  78%/2088  
    🟩 cxx_name
      🟩 clang              Pass: 100%/59  | Total Time:  7h 39m | Avg Time:  7m 47s | Hits:  99%/49790 
      🟩 gcc                Pass: 100%/63  | Total Time:  8h 16m | Avg Time:  7m 52s | Hits:  98%/52875 
      🟩 Intel              Pass: 100%/3   | Total Time: 15m 11s | Avg Time:  5m 03s | Hits:  99%/2334  
      🟩 MSVC               Pass: 100%/6   | Total Time:  3h 15m | Avg Time: 32m 37s | Hits:  78%/4176  
    🟩 gpu
      🟩 v100               Pass: 100%/131 | Total Time: 19h 27m | Avg Time:  8m 54s | Hits:  98%/109175
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total Time:  9h 58m | Avg Time:  6m 02s | Hits:  98%/81911 
      🟩 DeviceLaunch       Pass: 100%/8   | Total Time:  2h 08m | Avg Time: 16m 02s | Hits:  99%/6816  
      🟩 GraphCapture       Pass: 100%/8   | Total Time:  1h 52m | Avg Time: 14m 03s | Hits:  99%/6816  
      🟩 HostLaunch         Pass: 100%/8   | Total Time:  2h 15m | Avg Time: 16m 53s | Hits:  99%/6816  
      🟩 TestGPU            Pass: 100%/8   | Total Time:  3h 12m | Avg Time: 24m 07s | Hits:  99%/6816  
    🟩 os
      🟩 ubuntu18.04        Pass: 100%/14  | Total Time:  1h 09m | Avg Time:  4m 57s | Hits:  96%/10873 
      🟩 ubuntu20.04        Pass: 100%/35  | Total Time:  2h 29m | Avg Time:  4m 17s | Hits:  99%/29890 
      🟩 ubuntu22.04        Pass: 100%/76  | Total Time: 12h 32m | Avg Time:  9m 53s | Hits:  99%/64236 
      🟩 windows2022        Pass: 100%/6   | Total Time:  3h 15m | Avg Time: 32m 37s | Hits:  78%/4176  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total Time: 13m 25s | Avg Time:  4m 28s | Hits:  99%/2556  
      🟩 90a                Pass: 100%/4   | Total Time: 13m 51s | Avg Time:  3m 27s | Hits:  99%/3408  
    🟩 std
      🟩 11                 Pass: 100%/34  | Total Time:  3h 52m | Avg Time:  6m 49s | Hits:  99%/28537 
      🟩 14                 Pass: 100%/37  | Total Time:  5h 40m | Avg Time:  9m 12s | Hits:  98%/30625 
      🟩 17                 Pass: 100%/36  | Total Time:  5h 47m | Avg Time:  9m 38s | Hits:  97%/29858 
      🟩 20                 Pass: 100%/24  | Total Time:  4h 06m | Avg Time: 10m 17s | Hits:  98%/20155 
    
  • 🟩 thrust: Pass: 100%/118 | Total Time: 10h 09m | Avg Time: 5m 09s | Hits: 99%/139266

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total Time:  9h 45m | Avg Time:  5m 19s | Hits:  99%/129822
      🟩 arm64              Pass: 100%/8   | Total Time: 24m 35s | Avg Time:  3m 04s | Hits:  99%/9444  
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total Time: 58m 38s | Avg Time:  3m 54s | Hits:  98%/17705 
      🟩 11.8               Pass: 100%/3   | Total Time: 10m 11s | Avg Time:  3m 23s | Hits:  99%/3543  
      🟩 12.4               Pass: 100%/100 | Total Time:  9h 00m | Avg Time:  5m 24s | Hits:  99%/118018
    🟩 cudacxx_full
      🟩 clang-cuda17       Pass: 100%/2   | Total Time:  6m 45s | Avg Time:  3m 22s | Hits: 100%/2360  
      🟩 nvcc11.1           Pass: 100%/15  | Total Time: 58m 38s | Avg Time:  3m 54s | Hits:  98%/17705 
      🟩 nvcc11.8           Pass: 100%/3   | Total Time: 10m 11s | Avg Time:  3m 23s | Hits:  99%/3543  
      🟩 nvcc12.4           Pass: 100%/98  | Total Time:  8h 54m | Avg Time:  5m 26s | Hits:  99%/115658
    🟩 cudacxx_name
      🟩 clang-cuda         Pass: 100%/2   | Total Time:  6m 45s | Avg Time:  3m 22s | Hits: 100%/2360  
      🟩 nvcc               Pass: 100%/116 | Total Time: 10h 02m | Avg Time:  5m 11s | Hits:  99%/136906
    🟩 cxx_full
      🟩 clang9             Pass: 100%/6   | Total Time: 21m 35s | Avg Time:  3m 35s | Hits: 100%/7080  
      🟩 clang10            Pass: 100%/3   | Total Time: 12m 25s | Avg Time:  4m 08s | Hits: 100%/3540  
      🟩 clang11            Pass: 100%/4   | Total Time: 13m 24s | Avg Time:  3m 21s | Hits: 100%/4720  
      🟩 clang12            Pass: 100%/4   | Total Time: 13m 28s | Avg Time:  3m 22s | Hits: 100%/4720  
      🟩 clang13            Pass: 100%/4   | Total Time: 13m 42s | Avg Time:  3m 25s | Hits: 100%/4720  
      🟩 clang14            Pass: 100%/4   | Total Time: 14m 20s | Avg Time:  3m 35s | Hits: 100%/4720  
      🟩 clang15            Pass: 100%/4   | Total Time: 14m 13s | Avg Time:  3m 33s | Hits: 100%/4720  
      🟩 clang16            Pass: 100%/4   | Total Time: 13m 55s | Avg Time:  3m 28s | Hits: 100%/4720  
      🟩 clang17            Pass: 100%/18  | Total Time:  1h 45m | Avg Time:  5m 51s | Hits: 100%/21240 
      🟩 gcc6               Pass: 100%/2   | Total Time:  5m 31s | Avg Time:  2m 45s | Hits:  99%/2360  
      🟩 gcc7               Pass: 100%/6   | Total Time: 18m 07s | Avg Time:  3m 01s | Hits:  99%/7086  
      🟩 gcc8               Pass: 100%/6   | Total Time: 22m 08s | Avg Time:  3m 41s | Hits:  96%/7086  
      🟩 gcc9               Pass: 100%/6   | Total Time: 18m 56s | Avg Time:  3m 09s | Hits:  99%/7086  
      🟩 gcc10              Pass: 100%/4   | Total Time: 13m 55s | Avg Time:  3m 28s | Hits:  99%/4724  
      🟩 gcc11              Pass: 100%/7   | Total Time: 24m 01s | Avg Time:  3m 25s | Hits:  99%/8267  
      🟩 gcc12              Pass: 100%/4   | Total Time: 14m 28s | Avg Time:  3m 37s | Hits:  99%/4724  
      🟩 gcc13              Pass: 100%/20  | Total Time:  2h 02m | Avg Time:  6m 08s | Hits:  99%/23620 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total Time: 13m 34s | Avg Time:  4m 31s | Hits: 100%/3549  
      🟩 MSVC14.16          Pass: 100%/1   | Total Time: 14m 46s | Avg Time: 14m 46s | Hits:  98%/1176  
      🟩 MSVC14.29          Pass: 100%/2   | Total Time: 24m 37s | Avg Time: 12m 18s | Hits:  98%/2352  
      🟩 MSVC14.39          Pass: 100%/6   | Total Time:  1h 34m | Avg Time: 15m 42s | Hits:  98%/7056  
    🟩 cxx_name
      🟩 clang              Pass: 100%/51  | Total Time:  3h 42m | Avg Time:  4m 21s | Hits: 100%/60180 
      🟩 gcc                Pass: 100%/55  | Total Time:  4h 00m | Avg Time:  4m 21s | Hits:  99%/64953 
      🟩 Intel              Pass: 100%/3   | Total Time: 13m 34s | Avg Time:  4m 31s | Hits: 100%/3549  
      🟩 MSVC               Pass: 100%/9   | Total Time:  2h 13m | Avg Time: 14m 50s | Hits:  98%/10584 
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total Time: 10h 09m | Avg Time:  5m 09s | Hits:  99%/139266
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total Time:  6h 37m | Avg Time:  4m 00s | Hits:  99%/116850
      🟩 TestCPU            Pass: 100%/11  | Total Time:  1h 43m | Avg Time:  9m 22s | Hits:  99%/12972 
      🟩 TestGPU            Pass: 100%/8   | Total Time:  1h 49m | Avg Time: 13m 41s | Hits:  99%/9444  
    🟩 os
      🟩 ubuntu18.04        Pass: 100%/14  | Total Time: 43m 52s | Avg Time:  3m 08s | Hits:  98%/16529 
      🟩 ubuntu20.04        Pass: 100%/35  | Total Time:  2h 03m | Avg Time:  3m 31s | Hits:  99%/41313 
      🟩 ubuntu22.04        Pass: 100%/60  | Total Time:  5h 08m | Avg Time:  5m 08s | Hits:  99%/70840 
      🟩 windows2022        Pass: 100%/9   | Total Time:  2h 13m | Avg Time: 14m 50s | Hits:  98%/10584 
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total Time: 10m 11s | Avg Time:  3m 23s | Hits:  99%/3543  
      🟩 90a                Pass: 100%/4   | Total Time: 11m 50s | Avg Time:  2m 57s | Hits:  99%/4724  
    🟩 std
      🟩 11                 Pass: 100%/30  | Total Time:  1h 58m | Avg Time:  3m 56s | Hits:  99%/35418 
      🟩 14                 Pass: 100%/34  | Total Time:  2h 59m | Avg Time:  5m 16s | Hits:  99%/40122 
      🟩 17                 Pass: 100%/33  | Total Time:  3h 07m | Avg Time:  5m 40s | Hits:  99%/38946 
      🟩 20                 Pass: 100%/21  | Total Time:  2h 04m | Avg Time:  5m 55s | Hits:  99%/24780 
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental

🏃‍ Runner counts (total jobs: 249)

# Runner
178 linux-amd64-cpu16
40 linux-amd64-gpu-v100-latest-1
16 linux-arm64-cpu16
15 windows-amd64-cpu16

Copy link
Collaborator

@gevtushenko gevtushenko left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

A few minor documentation fixes below. Thank you for taking your time to improve examples!

//! Snippet
//! +++++++
//!
//! The code snippet below illustrates an inclusive prefix max scan of 512 integer items that
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
//! The code snippet below illustrates an inclusive prefix max scan of 512 integer items that
//! The code snippet below illustrates an inclusive prefix max scan of 128 integer items that

//! +++++++
//!
//! The code snippet below illustrates an inclusive prefix max scan of 512 integer items that
//! are partitioned in a :ref:`blocked arrangement <flexible-data-arrangement>` across 128 threads
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
//! are partitioned in a :ref:`blocked arrangement <flexible-data-arrangement>` across 128 threads
//! are partitioned in a :ref:`blocked arrangement <flexible-data-arrangement>` across 64 threads

//!
//! The code snippet below illustrates an inclusive prefix max scan of 512 integer items that
//! are partitioned in a :ref:`blocked arrangement <flexible-data-arrangement>` across 128 threads
//! where each thread owns 4 consecutive items.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
//! where each thread owns 4 consecutive items.
//! where each thread owns 2 consecutive items.

Comment on lines 2268 to 2271
//! Suppose the set of input ``thread_data`` across the block of threads is
//! ``{[0, -1], [2, -3],[4, -5], ... [126, -127]}``.
//! The corresponding output ``thread_data`` in those threads will be
//! ``{[1, 1], [2, 2],[3, 3], ... [126, 126]}``.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

suggestion: I think we cover this in the kernel itself. I think seeing input and expected output in the docs might be confusing, but feel free to disagree:

Suggested change
//! Suppose the set of input ``thread_data`` across the block of threads is
//! ``{[0, -1], [2, -3],[4, -5], ... [126, -127]}``.
//! The corresponding output ``thread_data`` in those threads will be
//! ``{[1, 1], [2, 2],[3, 3], ... [126, 126]}``.

//!
//! The code snippet below illustrates an inclusive prefix max scan of 128 integer items that
//! are partitioned in a :ref:`blocked arrangement <flexible-data-arrangement>` across 64 threads
//! where each thread owns 4 consecutive items.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
//! where each thread owns 4 consecutive items.
//! where each thread owns 2 consecutive items.

// warp #2 input: {2, 3, 4, 5, ..., 33}
// warp #4 input: {3, 4, 5, 6, ..., 34}

// Collectively compute the block-wide inclusive prefix max scan
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
// Collectively compute the block-wide inclusive prefix max scan
// Collectively compute warp-wide inclusive prefix max scans

// warp #2 input: {1, 1, 1, 1, ..., 1}
// warp #4 input: {1, 1, 1, 1, ..., 1}

// Collectively compute the block-wide inclusive prefix max scan
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
// Collectively compute the block-wide inclusive prefix max scan
// Collectively compute warp-wide inclusive prefix max scans

__global__ void InclusiveScanKernelAggr(int* output, int* d_warp_aggregate)
{
// Specialize WarpScan for type int
typedef cub::WarpScan<int> warp_scan_t;
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
typedef cub::WarpScan<int> warp_scan_t;
using warp_scan_t = cub::WarpScan<int>;


cuda::std::inclusive_scan(start, end, start, sum_op<int>{}, init_val);

expected_aggr.push_back(expected[i * 32 + 31] - init_val); // warp aggregate doed not take
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
expected_aggr.push_back(expected[i * 32 + 31] - init_val); // warp aggregate doed not take
expected_aggr.push_back(expected[i * 32 + 31] - init_val); // warp aggregate does not take

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

question: is this always equivalent to:

Suggested change
expected_aggr.push_back(expected[i * 32 + 31] - init_val); // warp aggregate doed not take
expected_aggr.push_back(32); // warp aggregate does not take

};
// input: {[0, -1], [2, -3],[4, -5], ... [126, -127]}

// Collectively compute the block-wide inclusive prefix max scan
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

question: I was under impression that it's "prefix sum" or "scan". I never heart of "prefix scan". Is this term used in the wild or is it a typo?

Copy link
Contributor

🟨 CI finished in 5h 44m: Pass: 99%/249 | Total: 2d 07h | Avg: 13m 20s | Max: 2h 01m | Hits: 91%/246866
  • 🟨 cub: Pass: 98%/131 | Total: 1d 08h | Avg: 14m 41s | Max: 2h 01m | Hits: 92%/107600

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  98%/123 | Total:  1d 07h | Avg: 15m 19s | Max:  2h 01m | Hits:  92%/100776
      🟩 arm64              Pass: 100%/8   | Total: 38m 56s | Avg:  4m 52s | Max:  5m 24s | Hits:  99%/6824  
    🔍 ctk: 12.4 🔍
      🟩 11.1               Pass: 100%/15  | Total:  1h 47m | Avg:  7m 10s | Max: 51m 28s | Hits:  96%/11584 
      🟩 11.8               Pass: 100%/3   | Total: 14m 24s | Avg:  4m 48s | Max:  5m 10s | Hits:  99%/2559  
      🔍 12.4               Pass:  98%/113 | Total:  1d 06h | Avg: 15m 57s | Max:  2h 01m | Hits:  92%/93457 
    🔍 cudacxx_full: nvcc12.4 🔍
      🟩 clang-cuda17       Pass: 100%/2   | Total:  7m 23s | Avg:  3m 41s | Max:  3m 44s | Hits:  99%/1412  
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 47m | Avg:  7m 10s | Max: 51m 28s | Hits:  96%/11584 
      🟩 nvcc11.8           Pass: 100%/3   | Total: 14m 24s | Avg:  4m 48s | Max:  5m 10s | Hits:  99%/2559  
      🔍 nvcc12.4           Pass:  98%/111 | Total:  1d 05h | Avg: 16m 10s | Max:  2h 01m | Hits:  91%/92045 
    🔍 cudacxx_name: nvcc 🔍
      🟩 clang-cuda         Pass: 100%/2   | Total:  7m 23s | Avg:  3m 41s | Max:  3m 44s | Hits:  99%/1412  
      🔍 nvcc               Pass:  98%/129 | Total:  1d 07h | Avg: 14m 51s | Max:  2h 01m | Hits:  92%/106188
    🔍 cxx_full: gcc13 🔍
      🟩 clang9             Pass: 100%/6   | Total: 28m 00s | Avg:  4m 40s | Max:  5m 45s | Hits:  99%/4896  
      🟩 clang10            Pass: 100%/3   | Total: 15m 42s | Avg:  5m 14s | Max:  5m 15s | Hits:  99%/2565  
      🟩 clang11            Pass: 100%/4   | Total: 19m 07s | Avg:  4m 46s | Max:  5m 21s | Hits:  99%/3420  
      🟩 clang12            Pass: 100%/4   | Total: 17m 57s | Avg:  4m 29s | Max:  4m 39s | Hits:  99%/3420  
      🟩 clang13            Pass: 100%/4   | Total: 17m 49s | Avg:  4m 27s | Max:  4m 37s | Hits:  99%/3420  
      🟩 clang14            Pass: 100%/4   | Total: 17m 51s | Avg:  4m 27s | Max:  4m 42s | Hits:  99%/3420  
      🟩 clang15            Pass: 100%/4   | Total: 18m 08s | Avg:  4m 32s | Max:  4m 43s | Hits:  99%/3412  
      🟩 clang16            Pass: 100%/4   | Total: 19m 51s | Avg:  4m 57s | Max:  5m 31s | Hits:  99%/3412  
      🟩 clang17            Pass: 100%/26  | Total: 10h 06m | Avg: 23m 19s | Max:  2h 01m | Hits:  93%/21884 
      🟩 gcc6               Pass: 100%/2   | Total:  9m 11s | Avg:  4m 35s | Max:  5m 15s | Hits:  99%/1554  
      🟩 gcc7               Pass: 100%/6   | Total: 24m 19s | Avg:  4m 03s | Max:  5m 17s | Hits:  99%/4899  
      🟩 gcc8               Pass: 100%/6   | Total: 26m 55s | Avg:  4m 29s | Max:  6m 25s | Hits:  99%/4899  
      🟩 gcc9               Pass: 100%/6   | Total: 56m 42s | Avg:  9m 27s | Max: 35m 42s | Hits:  92%/4899  
      🟩 gcc10              Pass: 100%/4   | Total: 52m 17s | Avg: 13m 04s | Max: 38m 37s | Hits:  89%/3420  
      🟩 gcc11              Pass: 100%/7   | Total: 32m 21s | Avg:  4m 37s | Max:  5m 10s | Hits:  99%/5971  
      🟩 gcc12              Pass: 100%/4   | Total: 21m 00s | Avg:  5m 15s | Max:  6m 06s | Hits:  99%/3412  
      🔍 gcc13              Pass:  92%/28  | Total:  9h 03m | Avg: 19m 25s | Max:  1h 20m | Hits:  92%/22178 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 52m | Avg: 37m 23s | Max: 38m 52s | Hits:  37%/2337  
      🟩 MSVC14.16          Pass: 100%/1   | Total: 51m 28s | Avg: 51m 28s | Max: 51m 28s | Hits:  39%/697   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 34m | Avg: 47m 04s | Max: 47m 13s | Hits:  39%/1394  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 19m | Avg: 46m 32s | Max: 48m 28s | Hits:  39%/2091  
    🔍 cxx_name: gcc 🔍
      🟩 clang              Pass: 100%/59  | Total: 12h 40m | Avg: 12m 53s | Max:  2h 01m | Hits:  97%/49849 
      🔍 gcc                Pass:  96%/63  | Total: 12h 46m | Avg: 12m 10s | Max:  1h 20m | Hits:  95%/51232 
      🟩 Intel              Pass: 100%/3   | Total:  1h 52m | Avg: 37m 23s | Max: 38m 52s | Hits:  37%/2337  
      🟩 MSVC               Pass: 100%/6   | Total:  4h 45m | Avg: 47m 32s | Max: 51m 28s | Hits:  39%/4182  
    🔍 os: ubuntu22.04 🔍
      🟩 ubuntu18.04        Pass: 100%/14  | Total: 56m 03s | Avg:  4m 00s | Max:  5m 15s | Hits:  99%/10887 
      🟩 ubuntu20.04        Pass: 100%/35  | Total:  3h 49m | Avg:  6m 33s | Max: 38m 37s | Hits:  97%/29925 
      🔍 ubuntu22.04        Pass:  97%/76  | Total: 22h 33m | Avg: 17m 48s | Max:  2h 01m | Hits:  92%/62606 
      🟩 windows2022        Pass: 100%/6   | Total:  4h 45m | Avg: 47m 32s | Max: 51m 28s | Hits:  39%/4182  
    🟨 jobs
      🟩 Build              Pass: 100%/99  | Total: 17h 10m | Avg: 10m 24s | Max: 51m 28s | Hits:  90%/82010 
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  4h 09m | Avg: 31m 10s | Max:  1h 20m | Hits:  99%/6824  
      🟨 GraphCapture       Pass:  87%/8   | Total:  2h 25m | Avg: 18m 11s | Max: 51m 53s | Hits:  99%/5971  
      🟩 HostLaunch         Pass: 100%/8   | Total:  3h 32m | Avg: 26m 33s | Max:  1h 08m | Hits:  99%/6824  
      🟨 TestGPU            Pass:  87%/8   | Total:  4h 47m | Avg: 35m 54s | Max:  2h 01m | Hits:  99%/5971  
    🟨 std
      🟩 11                 Pass: 100%/34  | Total:  7h 46m | Avg: 13m 42s | Max:  1h 08m | Hits:  94%/28571 
      🟨 14                 Pass:  97%/37  | Total: 10h 47m | Avg: 17m 29s | Max:  2h 01m | Hits:  90%/29809 
      🟩 17                 Pass: 100%/36  | Total:  8h 30m | Avg: 14m 11s | Max:  1h 20m | Hits:  92%/29894 
      🟨 20                 Pass:  95%/24  | Total:  5h 00m | Avg: 12m 32s | Max: 47m 46s | Hits:  93%/19326 
    🟨 gpu
      🟨 v100               Pass:  98%/131 | Total:  1d 08h | Avg: 14m 41s | Max:  2h 01m | Hits:  92%/107600
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 14m 24s | Avg:  4m 48s | Max:  5m 10s | Hits:  99%/2559  
      🟩 90a                Pass: 100%/4   | Total: 15m 29s | Avg:  3m 52s | Max:  4m 24s | Hits:  99%/3412  
    
  • 🟩 thrust: Pass: 100%/118 | Total: 23h 16m | Avg: 11m 49s | Max: 58m 07s | Hits: 91%/139266

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total: 22h 40m | Avg: 12m 22s | Max: 58m 07s | Hits:  90%/129822
      🟩 arm64              Pass: 100%/8   | Total: 35m 25s | Avg:  4m 25s | Max:  6m 29s | Hits:  99%/9444  
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  2h 01m | Avg:  8m 06s | Max: 49m 22s | Hits:  94%/17705 
      🟩 11.8               Pass: 100%/3   | Total: 50m 49s | Avg: 16m 56s | Max: 21m 26s | Hits:  81%/3543  
      🟩 12.4               Pass: 100%/100 | Total: 20h 23m | Avg: 12m 14s | Max: 58m 07s | Hits:  91%/118018
    🟩 cudacxx_full
      🟩 clang-cuda17       Pass: 100%/2   | Total:  7m 29s | Avg:  3m 44s | Max:  3m 46s | Hits: 100%/2360  
      🟩 nvcc11.1           Pass: 100%/15  | Total:  2h 01m | Avg:  8m 06s | Max: 49m 22s | Hits:  94%/17705 
      🟩 nvcc11.8           Pass: 100%/3   | Total: 50m 49s | Avg: 16m 56s | Max: 21m 26s | Hits:  81%/3543  
      🟩 nvcc12.4           Pass: 100%/98  | Total: 20h 16m | Avg: 12m 24s | Max: 58m 07s | Hits:  90%/115658
    🟩 cudacxx_name
      🟩 clang-cuda         Pass: 100%/2   | Total:  7m 29s | Avg:  3m 44s | Max:  3m 46s | Hits: 100%/2360  
      🟩 nvcc               Pass: 100%/116 | Total: 23h 08m | Avg: 11m 58s | Max: 58m 07s | Hits:  91%/136906
    🟩 cxx_full
      🟩 clang9             Pass: 100%/6   | Total: 24m 06s | Avg:  4m 01s | Max:  5m 11s | Hits: 100%/7080  
      🟩 clang10            Pass: 100%/3   | Total: 13m 39s | Avg:  4m 33s | Max:  4m 47s | Hits: 100%/3540  
      🟩 clang11            Pass: 100%/4   | Total: 16m 09s | Avg:  4m 02s | Max:  4m 20s | Hits: 100%/4720  
      🟩 clang12            Pass: 100%/4   | Total: 15m 12s | Avg:  3m 48s | Max:  4m 00s | Hits: 100%/4720  
      🟩 clang13            Pass: 100%/4   | Total: 15m 31s | Avg:  3m 52s | Max:  4m 09s | Hits: 100%/4720  
      🟩 clang14            Pass: 100%/4   | Total: 15m 36s | Avg:  3m 54s | Max:  4m 10s | Hits: 100%/4720  
      🟩 clang15            Pass: 100%/4   | Total: 15m 24s | Avg:  3m 51s | Max:  3m 59s | Hits: 100%/4720  
      🟩 clang16            Pass: 100%/4   | Total: 15m 33s | Avg:  3m 53s | Max:  4m 03s | Hits: 100%/4720  
      🟩 clang17            Pass: 100%/18  | Total:  3h 27m | Avg: 11m 32s | Max: 37m 57s | Hits:  94%/21240 
      🟩 gcc6               Pass: 100%/2   | Total:  6m 02s | Avg:  3m 01s | Max:  3m 12s | Hits:  99%/2360  
      🟩 gcc7               Pass: 100%/6   | Total:  1h 13m | Avg: 12m 18s | Max: 23m 54s | Hits:  83%/7086  
      🟩 gcc8               Pass: 100%/6   | Total: 52m 55s | Avg:  8m 49s | Max: 17m 47s | Hits:  89%/7086  
      🟩 gcc9               Pass: 100%/6   | Total:  1h 03m | Avg: 10m 31s | Max: 18m 13s | Hits:  87%/7086  
      🟩 gcc10              Pass: 100%/4   | Total: 39m 31s | Avg:  9m 52s | Max: 13m 12s | Hits:  86%/4724  
      🟩 gcc11              Pass: 100%/7   | Total:  1h 35m | Avg: 13m 40s | Max: 21m 26s | Hits:  83%/8267  
      🟩 gcc12              Pass: 100%/4   | Total: 50m 40s | Avg: 12m 40s | Max: 16m 11s | Hits:  81%/4724  
      🟩 gcc13              Pass: 100%/20  | Total:  3h 21m | Avg: 10m 03s | Max: 22m 44s | Hits:  94%/23620 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 44m | Avg: 34m 40s | Max: 39m 33s | Hits:  63%/3549  
      🟩 MSVC14.16          Pass: 100%/1   | Total: 49m 22s | Avg: 49m 22s | Max: 49m 22s | Hits:  61%/1176  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 42m | Avg: 51m 03s | Max: 52m 51s | Hits:  61%/2352  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  3h 38m | Avg: 36m 29s | Max: 58m 07s | Hits:  80%/7056  
    🟩 cxx_name
      🟩 clang              Pass: 100%/51  | Total:  5h 38m | Avg:  6m 38s | Max: 37m 57s | Hits:  98%/60180 
      🟩 gcc                Pass: 100%/55  | Total:  9h 42m | Avg: 10m 35s | Max: 23m 54s | Hits:  89%/64953 
      🟩 Intel              Pass: 100%/3   | Total:  1h 44m | Avg: 34m 40s | Max: 39m 33s | Hits:  63%/3549  
      🟩 MSVC               Pass: 100%/9   | Total:  6h 10m | Avg: 41m 09s | Max: 58m 07s | Hits:  74%/10584 
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total: 23h 16m | Avg: 11m 49s | Max: 58m 07s | Hits:  91%/139266
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total: 19h 10m | Avg: 11m 37s | Max: 58m 07s | Hits:  89%/116850
      🟩 TestCPU            Pass: 100%/11  | Total:  1h 47m | Avg:  9m 46s | Max: 20m 50s | Hits:  99%/12972 
      🟩 TestGPU            Pass: 100%/8   | Total:  2h 18m | Avg: 17m 15s | Max: 37m 57s | Hits:  99%/9444  
    🟩 os
      🟩 ubuntu18.04        Pass: 100%/14  | Total:  1h 12m | Avg:  5m 09s | Max: 23m 54s | Hits:  96%/16529 
      🟩 ubuntu20.04        Pass: 100%/35  | Total:  4h 23m | Avg:  7m 31s | Max: 18m 13s | Hits:  93%/41313 
      🟩 ubuntu22.04        Pass: 100%/60  | Total: 11h 30m | Avg: 11m 30s | Max: 39m 33s | Hits:  91%/70840 
      🟩 windows2022        Pass: 100%/9   | Total:  6h 10m | Avg: 41m 09s | Max: 58m 07s | Hits:  74%/10584 
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 50m 49s | Avg: 16m 56s | Max: 21m 26s | Hits:  81%/3543  
      🟩 90a                Pass: 100%/4   | Total: 15m 45s | Avg:  3m 56s | Max:  5m 34s | Hits:  96%/4724  
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  3h 44m | Avg:  7m 29s | Max: 27m 05s | Hits:  94%/35418 
      🟩 14                 Pass: 100%/34  | Total:  7h 32m | Avg: 13m 19s | Max: 52m 51s | Hits:  89%/40122 
      🟩 17                 Pass: 100%/33  | Total:  7h 31m | Avg: 13m 41s | Max: 53m 24s | Hits:  89%/38946 
      🟩 20                 Pass: 100%/21  | Total:  4h 26m | Avg: 12m 42s | Max: 58m 07s | Hits:  92%/24780 
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental

🏃‍ Runner counts (total jobs: 249)

# Runner
178 linux-amd64-cpu16
40 linux-amd64-gpu-v100-latest-1
16 linux-arm64-cpu16
15 windows-amd64-cpu16

Copy link
Contributor

🟩 CI finished in 19h 49m: Pass: 100%/249 | Total: 2d 07h | Avg: 13m 28s | Max: 2h 01m | Hits: 91%/248572
  • 🟩 cub: Pass: 100%/131 | Total: 1d 08h | Avg: 14m 57s | Max: 2h 01m | Hits: 92%/109306

    🟩 cpu
      🟩 amd64              Pass: 100%/123 | Total:  1d 08h | Avg: 15m 36s | Max:  2h 01m | Hits:  92%/102482
      🟩 arm64              Pass: 100%/8   | Total: 38m 56s | Avg:  4m 52s | Max:  5m 24s | Hits:  99%/6824  
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 47m | Avg:  7m 10s | Max: 51m 28s | Hits:  96%/11584 
      🟩 11.8               Pass: 100%/3   | Total: 14m 24s | Avg:  4m 48s | Max:  5m 10s | Hits:  99%/2559  
      🟩 12.4               Pass: 100%/113 | Total:  1d 06h | Avg: 16m 15s | Max:  2h 01m | Hits:  92%/95163 
    🟩 cudacxx_full
      🟩 clang-cuda17       Pass: 100%/2   | Total:  7m 23s | Avg:  3m 41s | Max:  3m 44s | Hits:  99%/1412  
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 47m | Avg:  7m 10s | Max: 51m 28s | Hits:  96%/11584 
      🟩 nvcc11.8           Pass: 100%/3   | Total: 14m 24s | Avg:  4m 48s | Max:  5m 10s | Hits:  99%/2559  
      🟩 nvcc12.4           Pass: 100%/111 | Total:  1d 06h | Avg: 16m 29s | Max:  2h 01m | Hits:  92%/93751 
    🟩 cudacxx_name
      🟩 clang-cuda         Pass: 100%/2   | Total:  7m 23s | Avg:  3m 41s | Max:  3m 44s | Hits:  99%/1412  
      🟩 nvcc               Pass: 100%/129 | Total:  1d 08h | Avg: 15m 07s | Max:  2h 01m | Hits:  92%/107894
    🟩 cxx_full
      🟩 clang9             Pass: 100%/6   | Total: 28m 00s | Avg:  4m 40s | Max:  5m 45s | Hits:  99%/4896  
      🟩 clang10            Pass: 100%/3   | Total: 15m 42s | Avg:  5m 14s | Max:  5m 15s | Hits:  99%/2565  
      🟩 clang11            Pass: 100%/4   | Total: 19m 07s | Avg:  4m 46s | Max:  5m 21s | Hits:  99%/3420  
      🟩 clang12            Pass: 100%/4   | Total: 17m 57s | Avg:  4m 29s | Max:  4m 39s | Hits:  99%/3420  
      🟩 clang13            Pass: 100%/4   | Total: 17m 49s | Avg:  4m 27s | Max:  4m 37s | Hits:  99%/3420  
      🟩 clang14            Pass: 100%/4   | Total: 17m 51s | Avg:  4m 27s | Max:  4m 42s | Hits:  99%/3420  
      🟩 clang15            Pass: 100%/4   | Total: 18m 08s | Avg:  4m 32s | Max:  4m 43s | Hits:  99%/3412  
      🟩 clang16            Pass: 100%/4   | Total: 19m 51s | Avg:  4m 57s | Max:  5m 31s | Hits:  99%/3412  
      🟩 clang17            Pass: 100%/26  | Total: 10h 06m | Avg: 23m 19s | Max:  2h 01m | Hits:  93%/21884 
      🟩 gcc6               Pass: 100%/2   | Total:  9m 11s | Avg:  4m 35s | Max:  5m 15s | Hits:  99%/1554  
      🟩 gcc7               Pass: 100%/6   | Total: 24m 19s | Avg:  4m 03s | Max:  5m 17s | Hits:  99%/4899  
      🟩 gcc8               Pass: 100%/6   | Total: 26m 55s | Avg:  4m 29s | Max:  6m 25s | Hits:  99%/4899  
      🟩 gcc9               Pass: 100%/6   | Total: 56m 42s | Avg:  9m 27s | Max: 35m 42s | Hits:  92%/4899  
      🟩 gcc10              Pass: 100%/4   | Total: 52m 17s | Avg: 13m 04s | Max: 38m 37s | Hits:  89%/3420  
      🟩 gcc11              Pass: 100%/7   | Total: 32m 21s | Avg:  4m 37s | Max:  5m 10s | Hits:  99%/5971  
      🟩 gcc12              Pass: 100%/4   | Total: 21m 00s | Avg:  5m 15s | Max:  6m 06s | Hits:  99%/3412  
      🟩 gcc13              Pass: 100%/28  | Total:  9h 38m | Avg: 20m 38s | Max:  1h 20m | Hits:  93%/23884 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 52m | Avg: 37m 23s | Max: 38m 52s | Hits:  37%/2337  
      🟩 MSVC14.16          Pass: 100%/1   | Total: 51m 28s | Avg: 51m 28s | Max: 51m 28s | Hits:  39%/697   
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 34m | Avg: 47m 04s | Max: 47m 13s | Hits:  39%/1394  
      🟩 MSVC14.39          Pass: 100%/3   | Total:  2h 19m | Avg: 46m 32s | Max: 48m 28s | Hits:  39%/2091  
    🟩 cxx_name
      🟩 clang              Pass: 100%/59  | Total: 12h 40m | Avg: 12m 53s | Max:  2h 01m | Hits:  97%/49849 
      🟩 gcc                Pass: 100%/63  | Total: 13h 20m | Avg: 12m 42s | Max:  1h 20m | Hits:  95%/52938 
      🟩 Intel              Pass: 100%/3   | Total:  1h 52m | Avg: 37m 23s | Max: 38m 52s | Hits:  37%/2337  
      🟩 MSVC               Pass: 100%/6   | Total:  4h 45m | Avg: 47m 32s | Max: 51m 28s | Hits:  39%/4182  
    🟩 gpu
      🟩 v100               Pass: 100%/131 | Total:  1d 08h | Avg: 14m 57s | Max:  2h 01m | Hits:  92%/109306
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total: 17h 10m | Avg: 10m 24s | Max: 51m 28s | Hits:  90%/82010 
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  4h 09m | Avg: 31m 10s | Max:  1h 20m | Hits:  99%/6824  
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 35m | Avg: 19m 29s | Max: 51m 53s | Hits:  99%/6824  
      🟩 HostLaunch         Pass: 100%/8   | Total:  3h 32m | Avg: 26m 33s | Max:  1h 08m | Hits:  99%/6824  
      🟩 TestGPU            Pass: 100%/8   | Total:  5h 11m | Avg: 38m 53s | Max:  2h 01m | Hits:  99%/6824  
    🟩 os
      🟩 ubuntu18.04        Pass: 100%/14  | Total: 56m 03s | Avg:  4m 00s | Max:  5m 15s | Hits:  99%/10887 
      🟩 ubuntu20.04        Pass: 100%/35  | Total:  3h 49m | Avg:  6m 33s | Max: 38m 37s | Hits:  97%/29925 
      🟩 ubuntu22.04        Pass: 100%/76  | Total: 23h 08m | Avg: 18m 15s | Max:  2h 01m | Hits:  92%/64312 
      🟩 windows2022        Pass: 100%/6   | Total:  4h 45m | Avg: 47m 32s | Max: 51m 28s | Hits:  39%/4182  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 14m 24s | Avg:  4m 48s | Max:  5m 10s | Hits:  99%/2559  
      🟩 90a                Pass: 100%/4   | Total: 15m 29s | Avg:  3m 52s | Max:  4m 24s | Hits:  99%/3412  
    🟩 std
      🟩 11                 Pass: 100%/34  | Total:  7h 46m | Avg: 13m 42s | Max:  1h 08m | Hits:  94%/28571 
      🟩 14                 Pass: 100%/37  | Total: 11h 10m | Avg: 18m 07s | Max:  2h 01m | Hits:  90%/30662 
      🟩 17                 Pass: 100%/36  | Total:  8h 30m | Avg: 14m 11s | Max:  1h 20m | Hits:  92%/29894 
      🟩 20                 Pass: 100%/24  | Total:  5h 11m | Avg: 12m 58s | Max: 47m 46s | Hits:  93%/20179 
    
  • 🟩 thrust: Pass: 100%/118 | Total: 23h 16m | Avg: 11m 49s | Max: 58m 07s | Hits: 91%/139266

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total: 22h 40m | Avg: 12m 22s | Max: 58m 07s | Hits:  90%/129822
      🟩 arm64              Pass: 100%/8   | Total: 35m 25s | Avg:  4m 25s | Max:  6m 29s | Hits:  99%/9444  
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  2h 01m | Avg:  8m 06s | Max: 49m 22s | Hits:  94%/17705 
      🟩 11.8               Pass: 100%/3   | Total: 50m 49s | Avg: 16m 56s | Max: 21m 26s | Hits:  81%/3543  
      🟩 12.4               Pass: 100%/100 | Total: 20h 23m | Avg: 12m 14s | Max: 58m 07s | Hits:  91%/118018
    🟩 cudacxx_full
      🟩 clang-cuda17       Pass: 100%/2   | Total:  7m 29s | Avg:  3m 44s | Max:  3m 46s | Hits: 100%/2360  
      🟩 nvcc11.1           Pass: 100%/15  | Total:  2h 01m | Avg:  8m 06s | Max: 49m 22s | Hits:  94%/17705 
      🟩 nvcc11.8           Pass: 100%/3   | Total: 50m 49s | Avg: 16m 56s | Max: 21m 26s | Hits:  81%/3543  
      🟩 nvcc12.4           Pass: 100%/98  | Total: 20h 16m | Avg: 12m 24s | Max: 58m 07s | Hits:  90%/115658
    🟩 cudacxx_name
      🟩 clang-cuda         Pass: 100%/2   | Total:  7m 29s | Avg:  3m 44s | Max:  3m 46s | Hits: 100%/2360  
      🟩 nvcc               Pass: 100%/116 | Total: 23h 08m | Avg: 11m 58s | Max: 58m 07s | Hits:  91%/136906
    🟩 cxx_full
      🟩 clang9             Pass: 100%/6   | Total: 24m 06s | Avg:  4m 01s | Max:  5m 11s | Hits: 100%/7080  
      🟩 clang10            Pass: 100%/3   | Total: 13m 39s | Avg:  4m 33s | Max:  4m 47s | Hits: 100%/3540  
      🟩 clang11            Pass: 100%/4   | Total: 16m 09s | Avg:  4m 02s | Max:  4m 20s | Hits: 100%/4720  
      🟩 clang12            Pass: 100%/4   | Total: 15m 12s | Avg:  3m 48s | Max:  4m 00s | Hits: 100%/4720  
      🟩 clang13            Pass: 100%/4   | Total: 15m 31s | Avg:  3m 52s | Max:  4m 09s | Hits: 100%/4720  
      🟩 clang14            Pass: 100%/4   | Total: 15m 36s | Avg:  3m 54s | Max:  4m 10s | Hits: 100%/4720  
      🟩 clang15            Pass: 100%/4   | Total: 15m 24s | Avg:  3m 51s | Max:  3m 59s | Hits: 100%/4720  
      🟩 clang16            Pass: 100%/4   | Total: 15m 33s | Avg:  3m 53s | Max:  4m 03s | Hits: 100%/4720  
      🟩 clang17            Pass: 100%/18  | Total:  3h 27m | Avg: 11m 32s | Max: 37m 57s | Hits:  94%/21240 
      🟩 gcc6               Pass: 100%/2   | Total:  6m 02s | Avg:  3m 01s | Max:  3m 12s | Hits:  99%/2360  
      🟩 gcc7               Pass: 100%/6   | Total:  1h 13m | Avg: 12m 18s | Max: 23m 54s | Hits:  83%/7086  
      🟩 gcc8               Pass: 100%/6   | Total: 52m 55s | Avg:  8m 49s | Max: 17m 47s | Hits:  89%/7086  
      🟩 gcc9               Pass: 100%/6   | Total:  1h 03m | Avg: 10m 31s | Max: 18m 13s | Hits:  87%/7086  
      🟩 gcc10              Pass: 100%/4   | Total: 39m 31s | Avg:  9m 52s | Max: 13m 12s | Hits:  86%/4724  
      🟩 gcc11              Pass: 100%/7   | Total:  1h 35m | Avg: 13m 40s | Max: 21m 26s | Hits:  83%/8267  
      🟩 gcc12              Pass: 100%/4   | Total: 50m 40s | Avg: 12m 40s | Max: 16m 11s | Hits:  81%/4724  
      🟩 gcc13              Pass: 100%/20  | Total:  3h 21m | Avg: 10m 03s | Max: 22m 44s | Hits:  94%/23620 
      🟩 Intel2023.2.0      Pass: 100%/3   | Total:  1h 44m | Avg: 34m 40s | Max: 39m 33s | Hits:  63%/3549  
      🟩 MSVC14.16          Pass: 100%/1   | Total: 49m 22s | Avg: 49m 22s | Max: 49m 22s | Hits:  61%/1176  
      🟩 MSVC14.29          Pass: 100%/2   | Total:  1h 42m | Avg: 51m 03s | Max: 52m 51s | Hits:  61%/2352  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  3h 38m | Avg: 36m 29s | Max: 58m 07s | Hits:  80%/7056  
    🟩 cxx_name
      🟩 clang              Pass: 100%/51  | Total:  5h 38m | Avg:  6m 38s | Max: 37m 57s | Hits:  98%/60180 
      🟩 gcc                Pass: 100%/55  | Total:  9h 42m | Avg: 10m 35s | Max: 23m 54s | Hits:  89%/64953 
      🟩 Intel              Pass: 100%/3   | Total:  1h 44m | Avg: 34m 40s | Max: 39m 33s | Hits:  63%/3549  
      🟩 MSVC               Pass: 100%/9   | Total:  6h 10m | Avg: 41m 09s | Max: 58m 07s | Hits:  74%/10584 
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total: 23h 16m | Avg: 11m 49s | Max: 58m 07s | Hits:  91%/139266
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total: 19h 10m | Avg: 11m 37s | Max: 58m 07s | Hits:  89%/116850
      🟩 TestCPU            Pass: 100%/11  | Total:  1h 47m | Avg:  9m 46s | Max: 20m 50s | Hits:  99%/12972 
      🟩 TestGPU            Pass: 100%/8   | Total:  2h 18m | Avg: 17m 15s | Max: 37m 57s | Hits:  99%/9444  
    🟩 os
      🟩 ubuntu18.04        Pass: 100%/14  | Total:  1h 12m | Avg:  5m 09s | Max: 23m 54s | Hits:  96%/16529 
      🟩 ubuntu20.04        Pass: 100%/35  | Total:  4h 23m | Avg:  7m 31s | Max: 18m 13s | Hits:  93%/41313 
      🟩 ubuntu22.04        Pass: 100%/60  | Total: 11h 30m | Avg: 11m 30s | Max: 39m 33s | Hits:  91%/70840 
      🟩 windows2022        Pass: 100%/9   | Total:  6h 10m | Avg: 41m 09s | Max: 58m 07s | Hits:  74%/10584 
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 50m 49s | Avg: 16m 56s | Max: 21m 26s | Hits:  81%/3543  
      🟩 90a                Pass: 100%/4   | Total: 15m 45s | Avg:  3m 56s | Max:  5m 34s | Hits:  96%/4724  
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  3h 44m | Avg:  7m 29s | Max: 27m 05s | Hits:  94%/35418 
      🟩 14                 Pass: 100%/34  | Total:  7h 32m | Avg: 13m 19s | Max: 52m 51s | Hits:  89%/40122 
      🟩 17                 Pass: 100%/33  | Total:  7h 31m | Avg: 13m 41s | Max: 53m 24s | Hits:  89%/38946 
      🟩 20                 Pass: 100%/21  | Total:  4h 26m | Avg: 12m 42s | Max: 58m 07s | Hits:  92%/24780 
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
+/- CUB
Thrust
CUDA Experimental

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
CUDA Experimental

🏃‍ Runner counts (total jobs: 249)

# Runner
178 linux-amd64-cpu16
40 linux-amd64-gpu-v100-latest-1
16 linux-arm64-cpu16
15 windows-amd64-cpu16

@gonidelis gonidelis merged commit bc08dfa into NVIDIA:main Jun 11, 2024
551 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

None yet

3 participants