Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use the in, out, and inout parameter decorators from cudax::launch #2294

Merged
merged 2 commits into from
Aug 27, 2024

Conversation

ericniebler
Copy link
Collaborator

Description

The vector_add cudax sample makes use of temporary versions of cudax::[in|out|inout] for annotating launch arguments. The sample should now be updated to use the versions that are included with <cuda/experimental/launch.cuh>

Checklist

  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

@ericniebler ericniebler added CUDA Next Feature intended for the Cuda Next experimental library pre-commit.ci autofix apply pre-commit auto-fixes automatically labels Aug 26, 2024
@ericniebler ericniebler requested a review from a team as a code owner August 26, 2024 19:14
@pre-commit-ci pre-commit-ci bot removed the pre-commit.ci autofix apply pre-commit auto-fixes automatically label Aug 26, 2024
@pre-commit-ci pre-commit-ci bot requested review from a team as code owners August 26, 2024 19:16
Copy link

copy-pr-bot bot commented Aug 26, 2024

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@ericniebler ericniebler enabled auto-merge (squash) August 26, 2024 19:22
@ericniebler
Copy link
Collaborator Author

/ok to test

Copy link
Contributor

🟨 CI finished in 8h 31m: Pass: 99%/305 | Total: 1d 09h | Avg: 6m 30s | Max: 43m 03s | Hits: 98%/17461
  • 🟨 cub: Pass: 99%/131 | Total: 19h 02m | Avg: 8m 43s | Max: 43m 03s | Hits: 99%/4278

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  99%/123 | Total: 18h 27m | Avg:  9m 00s | Max: 43m 03s | Hits:  99%/4278  
      🟩 arm64              Pass: 100%/8   | Total: 34m 29s | Avg:  4m 18s | Max:  5m 05s
    🔍 ctk: 12.5 🔍
      🟩 11.1               Pass: 100%/15  | Total:  1h 46m | Avg:  7m 05s | Max: 43m 03s | Hits:  99%/713   
      🟩 11.8               Pass: 100%/3   | Total: 12m 43s | Avg:  4m 14s | Max:  4m 27s
      🔍 12.5               Pass:  99%/113 | Total: 17h 02m | Avg:  9m 03s | Max: 31m 38s | Hits:  99%/3565  
    🔍 cudacxx: nvcc12.5 🔍
      🟩 ClangCUDA17        Pass: 100%/2   | Total:  7m 21s | Avg:  3m 40s | Max:  3m 43s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 46m | Avg:  7m 05s | Max: 43m 03s | Hits:  99%/713   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 12m 43s | Avg:  4m 14s | Max:  4m 27s
      🔍 nvcc12.5           Pass:  99%/111 | Total: 16h 55m | Avg:  9m 08s | Max: 31m 38s | Hits:  99%/3565  
    🔍 cudacxx_family: nvcc 🔍
      🟩 ClangCUDA          Pass: 100%/2   | Total:  7m 21s | Avg:  3m 40s | Max:  3m 43s
      🔍 nvcc               Pass:  99%/129 | Total: 18h 54m | Avg:  8m 47s | Max: 43m 03s | Hits:  99%/4278  
    🔍 cxx: GCC13 🔍
      🟩 Clang9             Pass: 100%/6   | Total: 26m 55s | Avg:  4m 29s | Max:  5m 17s
      🟩 Clang10            Pass: 100%/3   | Total: 15m 55s | Avg:  5m 18s | Max:  5m 45s
      🟩 Clang11            Pass: 100%/4   | Total: 17m 34s | Avg:  4m 23s | Max:  4m 28s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 05s | Avg:  4m 31s | Max:  5m 02s
      🟩 Clang13            Pass: 100%/4   | Total: 17m 57s | Avg:  4m 29s | Max:  4m 42s
      🟩 Clang14            Pass: 100%/4   | Total: 18m 19s | Avg:  4m 34s | Max:  4m 43s
      🟩 Clang15            Pass: 100%/4   | Total: 18m 10s | Avg:  4m 32s | Max:  4m 39s
      🟩 Clang16            Pass: 100%/4   | Total: 17m 35s | Avg:  4m 23s | Max:  4m 30s
      🟩 Clang17            Pass: 100%/26  | Total:  6h 04m | Avg: 14m 00s | Max: 31m 38s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 14s | Avg:  4m 07s | Max:  4m 45s
      🟩 GCC7               Pass: 100%/6   | Total: 22m 54s | Avg:  3m 49s | Max:  4m 22s
      🟩 GCC8               Pass: 100%/6   | Total: 23m 50s | Avg:  3m 58s | Max:  4m 24s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 03m | Avg: 10m 39s | Max: 43m 03s
      🟩 GCC10              Pass: 100%/4   | Total: 17m 39s | Avg:  4m 24s | Max:  4m 49s
      🟩 GCC11              Pass: 100%/7   | Total: 29m 46s | Avg:  4m 15s | Max:  4m 32s
      🟩 GCC12              Pass: 100%/4   | Total: 18m 38s | Avg:  4m 39s | Max:  5m 02s
      🔍 GCC13              Pass:  96%/28  | Total:  5h 51m | Avg: 12m 33s | Max: 28m 36s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 15m 27s | Avg:  5m 09s | Max:  5m 33s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 14m 03s | Avg: 14m 03s | Max: 14m 03s | Hits:  99%/713   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 23m 00s | Avg: 11m 30s | Max: 11m 31s | Hits:  99%/1426  
      🟩 MSVC14.39          Pass: 100%/3   | Total: 38m 07s | Avg: 12m 42s | Max: 13m 01s | Hits:  99%/2139  
    🔍 cxx_family: GCC 🔍
      🟩 Clang              Pass: 100%/59  | Total:  8h 34m | Avg:  8m 43s | Max: 31m 38s
      🔍 GCC                Pass:  98%/63  | Total:  8h 56m | Avg:  8m 31s | Max: 43m 03s
      🟩 Intel              Pass: 100%/3   | Total: 15m 27s | Avg:  5m 09s | Max:  5m 33s
      🟩 MSVC               Pass: 100%/6   | Total:  1h 15m | Avg: 12m 31s | Max: 14m 03s | Hits:  99%/4278  
    🔍 jobs: DeviceLaunch 🔍
      🟩 Build              Pass: 100%/99  | Total:  8h 39m | Avg:  5m 14s | Max: 43m 03s | Hits:  99%/4278  
      🔍 DeviceLaunch       Pass:  87%/8   | Total:  2h 20m | Avg: 17m 34s | Max: 23m 03s
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 07m | Avg: 15m 57s | Max: 18m 13s
      🟩 HostLaunch         Pass: 100%/8   | Total:  2h 26m | Avg: 18m 20s | Max: 21m 19s
      🟩 TestGPU            Pass: 100%/8   | Total:  3h 27m | Avg: 25m 58s | Max: 31m 38s
    🔍 std: 11 🔍
      🔍 11                 Pass:  97%/34  | Total:  4h 18m | Avg:  7m 36s | Max: 28m 36s
      🟩 14                 Pass: 100%/37  | Total:  4h 58m | Avg:  8m 04s | Max: 24m 37s | Hits:  99%/2139  
      🟩 17                 Pass: 100%/36  | Total:  5h 43m | Avg:  9m 32s | Max: 43m 03s | Hits:  99%/1426  
      🟩 20                 Pass: 100%/24  | Total:  4h 01m | Avg: 10m 02s | Max: 31m 38s | Hits:  99%/713   
    🟨 gpu
      🟨 v100               Pass:  99%/131 | Total: 19h 02m | Avg:  8m 43s | Max: 43m 03s | Hits:  99%/4278  
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 12m 43s | Avg:  4m 14s | Max:  4m 27s
      🟩 90a                Pass: 100%/4   | Total: 14m 45s | Avg:  3m 41s | Max:  3m 55s
    
  • 🟩 thrust: Pass: 100%/118 | Total: 11h 22m | Avg: 5m 47s | Max: 18m 59s | Hits: 99%/13077

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total: 10h 50m | Avg:  5m 54s | Max: 18m 59s | Hits:  99%/13077 
      🟩 arm64              Pass: 100%/8   | Total: 32m 18s | Avg:  4m 02s | Max:  5m 01s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 07m | Avg:  4m 28s | Max: 17m 29s | Hits:  98%/1453  
      🟩 11.8               Pass: 100%/3   | Total: 11m 48s | Avg:  3m 56s | Max:  4m 13s
      🟩 12.5               Pass: 100%/100 | Total: 10h 03m | Avg:  6m 02s | Max: 18m 59s | Hits:  99%/11624 
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total:  8m 09s | Avg:  4m 04s | Max:  4m 15s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 07m | Avg:  4m 28s | Max: 17m 29s | Hits:  98%/1453  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 11m 48s | Avg:  3m 56s | Max:  4m 13s
      🟩 nvcc12.5           Pass: 100%/98  | Total:  9h 55m | Avg:  6m 04s | Max: 18m 59s | Hits:  99%/11624 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 09s | Avg:  4m 04s | Max:  4m 15s
      🟩 nvcc               Pass: 100%/116 | Total: 11h 14m | Avg:  5m 48s | Max: 18m 59s | Hits:  99%/13077 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 25m 44s | Avg:  4m 17s | Max:  5m 25s
      🟩 Clang10            Pass: 100%/3   | Total: 14m 37s | Avg:  4m 52s | Max:  5m 15s
      🟩 Clang11            Pass: 100%/4   | Total: 17m 07s | Avg:  4m 16s | Max:  5m 06s
      🟩 Clang12            Pass: 100%/4   | Total: 16m 18s | Avg:  4m 04s | Max:  4m 20s
      🟩 Clang13            Pass: 100%/4   | Total: 16m 05s | Avg:  4m 01s | Max:  4m 30s
      🟩 Clang14            Pass: 100%/4   | Total: 16m 42s | Avg:  4m 10s | Max:  4m 18s
      🟩 Clang15            Pass: 100%/4   | Total: 16m 21s | Avg:  4m 05s | Max:  4m 24s
      🟩 Clang16            Pass: 100%/4   | Total: 16m 34s | Avg:  4m 08s | Max:  4m 26s
      🟩 Clang17            Pass: 100%/18  | Total:  1h 52m | Avg:  6m 14s | Max: 12m 36s
      🟩 GCC6               Pass: 100%/2   | Total:  6m 27s | Avg:  3m 13s | Max:  3m 14s
      🟩 GCC7               Pass: 100%/6   | Total: 21m 18s | Avg:  3m 33s | Max:  4m 06s
      🟩 GCC8               Pass: 100%/6   | Total: 23m 17s | Avg:  3m 52s | Max:  4m 20s
      🟩 GCC9               Pass: 100%/6   | Total: 22m 33s | Avg:  3m 45s | Max:  4m 18s
      🟩 GCC10              Pass: 100%/4   | Total: 15m 50s | Avg:  3m 57s | Max:  4m 11s
      🟩 GCC11              Pass: 100%/7   | Total: 28m 47s | Avg:  4m 06s | Max:  4m 49s
      🟩 GCC12              Pass: 100%/4   | Total: 17m 52s | Avg:  4m 28s | Max:  5m 00s
      🟩 GCC13              Pass: 100%/20  | Total:  2h 09m | Avg:  6m 29s | Max: 13m 50s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 15m 24s | Avg:  5m 08s | Max:  5m 29s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 17m 29s | Avg: 17m 29s | Max: 17m 29s | Hits:  98%/1453  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 29m 30s | Avg: 14m 45s | Max: 15m 14s | Hits:  98%/2906  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  1h 42m | Avg: 17m 06s | Max: 18m 59s | Hits:  99%/8718  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/51  | Total:  4h 11m | Avg:  4m 56s | Max: 12m 36s
      🟩 GCC                Pass: 100%/55  | Total:  4h 25m | Avg:  4m 49s | Max: 13m 50s
      🟩 Intel              Pass: 100%/3   | Total: 15m 24s | Avg:  5m 08s | Max:  5m 29s
      🟩 MSVC               Pass: 100%/9   | Total:  2h 29m | Avg: 16m 37s | Max: 18m 59s | Hits:  99%/13077 
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total: 11h 22m | Avg:  5m 47s | Max: 18m 59s | Hits:  99%/13077 
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total:  7h 53m | Avg:  4m 47s | Max: 17m 29s | Hits:  98%/8718  
      🟩 TestCPU            Pass: 100%/11  | Total:  1h 48m | Avg:  9m 54s | Max: 18m 59s | Hits:  99%/4359  
      🟩 TestGPU            Pass: 100%/8   | Total:  1h 39m | Avg: 12m 28s | Max: 13m 50s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 11m 48s | Avg:  3m 56s | Max:  4m 13s
      🟩 90a                Pass: 100%/4   | Total: 14m 29s | Avg:  3m 37s | Max:  3m 46s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  2h 15m | Avg:  4m 31s | Max: 12m 40s
      🟩 14                 Pass: 100%/34  | Total:  3h 29m | Avg:  6m 09s | Max: 18m 27s | Hits:  99%/5812  
      🟩 17                 Pass: 100%/33  | Total:  3h 19m | Avg:  6m 03s | Max: 18m 40s | Hits:  99%/4359  
      🟩 20                 Pass: 100%/21  | Total:  2h 17m | Avg:  6m 34s | Max: 18m 59s | Hits:  99%/2906  
    
  • 🟩 cudax: Pass: 100%/55 | Total: 2h 29m | Avg: 2m 43s | Max: 9m 02s | Hits: 81%/106

    🟩 cpu
      🟩 amd64              Pass: 100%/51  | Total:  2h 22m | Avg:  2m 48s | Max:  9m 02s | Hits:  81%/106   
      🟩 arm64              Pass: 100%/4   | Total:  6m 57s | Avg:  1m 44s | Max:  1m 48s
    🟩 ctk
      🟩 12.0               Pass: 100%/23  | Total:  1h 04m | Avg:  2m 47s | Max:  8m 49s | Hits:  81%/53    
      🟩 12.5               Pass: 100%/32  | Total:  1h 25m | Avg:  2m 41s | Max:  9m 02s | Hits:  81%/53    
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/23  | Total:  1h 04m | Avg:  2m 47s | Max:  8m 49s | Hits:  81%/53    
      🟩 nvcc12.5           Pass: 100%/32  | Total:  1h 25m | Avg:  2m 41s | Max:  9m 02s | Hits:  81%/53    
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/55  | Total:  2h 29m | Avg:  2m 43s | Max:  9m 02s | Hits:  81%/106   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  4m 32s | Avg:  2m 16s | Max:  2m 22s
      🟩 Clang10            Pass: 100%/2   | Total:  4m 24s | Avg:  2m 12s | Max:  2m 17s
      🟩 Clang11            Pass: 100%/4   | Total:  9m 05s | Avg:  2m 16s | Max:  2m 31s
      🟩 Clang12            Pass: 100%/4   | Total:  9m 14s | Avg:  2m 18s | Max:  2m 31s
      🟩 Clang13            Pass: 100%/4   | Total:  9m 01s | Avg:  2m 15s | Max:  2m 26s
      🟩 Clang14            Pass: 100%/6   | Total: 17m 47s | Avg:  2m 57s | Max:  4m 42s
      🟩 Clang15            Pass: 100%/2   | Total:  4m 39s | Avg:  2m 19s | Max:  2m 26s
      🟩 Clang16            Pass: 100%/6   | Total: 16m 45s | Avg:  2m 47s | Max:  4m 14s
      🟩 GCC9               Pass: 100%/2   | Total:  4m 30s | Avg:  2m 15s | Max:  2m 18s
      🟩 GCC10              Pass: 100%/4   | Total:  8m 34s | Avg:  2m 08s | Max:  2m 25s
      🟩 GCC11              Pass: 100%/4   | Total:  8m 28s | Avg:  2m 07s | Max:  2m 21s
      🟩 GCC12              Pass: 100%/12  | Total: 32m 09s | Avg:  2m 40s | Max:  4m 11s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  2m 56s | Avg:  2m 56s | Max:  2m 56s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  8m 49s | Avg:  8m 49s | Max:  8m 49s | Hits:  81%/53    
      🟩 MSVC14.39          Pass: 100%/1   | Total:  9m 02s | Avg:  9m 02s | Max:  9m 02s | Hits:  81%/53    
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 15m | Avg:  2m 30s | Max:  4m 42s
      🟩 GCC                Pass: 100%/22  | Total: 53m 41s | Avg:  2m 26s | Max:  4m 11s
      🟩 Intel              Pass: 100%/1   | Total:  2m 56s | Avg:  2m 56s | Max:  2m 56s
      🟩 MSVC               Pass: 100%/2   | Total: 17m 51s | Avg:  8m 55s | Max:  9m 02s | Hits:  81%/106   
    🟩 gpu
      🟩 v100               Pass: 100%/55  | Total:  2h 29m | Avg:  2m 43s | Max:  9m 02s | Hits:  81%/106   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total:  1h 57m | Avg:  2m 30s | Max:  9m 02s | Hits:  81%/106   
      🟩 Test               Pass: 100%/8   | Total: 32m 01s | Avg:  4m 00s | Max:  4m 42s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 09s | Avg:  2m 09s | Max:  2m 09s
      🟩 90a                Pass: 100%/1   | Total:  2m 18s | Avg:  2m 18s | Max:  2m 18s
    🟩 std
      🟩 17                 Pass: 100%/31  | Total:  1h 16m | Avg:  2m 28s | Max:  4m 14s
      🟩 20                 Pass: 100%/24  | Total:  1h 13m | Avg:  3m 03s | Max:  9m 02s | Hits:  81%/106   
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 11m 50s | Avg: 11m 50s | Max: 11m 50s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 11m 50s | Avg: 11m 50s | Max: 11m 50s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 11m 50s | Avg: 11m 50s | Max: 11m 50s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 11m 50s | Avg: 11m 50s | Max: 11m 50s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 11m 50s | Avg: 11m 50s | Max: 11m 50s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 11m 50s | Avg: 11m 50s | Max: 11m 50s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 11m 50s | Avg: 11m 50s | Max: 11m 50s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 11m 50s | Avg: 11m 50s | Max: 11m 50s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 11m 50s | Avg: 11m 50s | Max: 11m 50s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
+/- CUDA Experimental
pycuda

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda

🏃‍ Runner counts (total jobs: 305)

# Runner
219 linux-amd64-cpu16
49 linux-amd64-gpu-v100-latest-1
20 linux-arm64-cpu16
17 windows-amd64-cpu16

Copy link
Contributor

🟩 CI finished in 13h 42m: Pass: 100%/305 | Total: 1d 09h | Avg: 6m 33s | Max: 43m 03s | Hits: 98%/17461
  • 🟩 cub: Pass: 100%/131 | Total: 19h 16m | Avg: 8m 49s | Max: 43m 03s | Hits: 99%/4278

    🟩 cpu
      🟩 amd64              Pass: 100%/123 | Total: 18h 42m | Avg:  9m 07s | Max: 43m 03s | Hits:  99%/4278  
      🟩 arm64              Pass: 100%/8   | Total: 34m 29s | Avg:  4m 18s | Max:  5m 05s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 46m | Avg:  7m 05s | Max: 43m 03s | Hits:  99%/713   
      🟩 11.8               Pass: 100%/3   | Total: 12m 43s | Avg:  4m 14s | Max:  4m 27s
      🟩 12.5               Pass: 100%/113 | Total: 17h 17m | Avg:  9m 10s | Max: 31m 38s | Hits:  99%/3565  
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total:  7m 21s | Avg:  3m 40s | Max:  3m 43s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 46m | Avg:  7m 05s | Max: 43m 03s | Hits:  99%/713   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 12m 43s | Avg:  4m 14s | Max:  4m 27s
      🟩 nvcc12.5           Pass: 100%/111 | Total: 17h 10m | Avg:  9m 16s | Max: 31m 38s | Hits:  99%/3565  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  7m 21s | Avg:  3m 40s | Max:  3m 43s
      🟩 nvcc               Pass: 100%/129 | Total: 19h 09m | Avg:  8m 54s | Max: 43m 03s | Hits:  99%/4278  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 26m 55s | Avg:  4m 29s | Max:  5m 17s
      🟩 Clang10            Pass: 100%/3   | Total: 15m 55s | Avg:  5m 18s | Max:  5m 45s
      🟩 Clang11            Pass: 100%/4   | Total: 17m 34s | Avg:  4m 23s | Max:  4m 28s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 05s | Avg:  4m 31s | Max:  5m 02s
      🟩 Clang13            Pass: 100%/4   | Total: 17m 57s | Avg:  4m 29s | Max:  4m 42s
      🟩 Clang14            Pass: 100%/4   | Total: 18m 19s | Avg:  4m 34s | Max:  4m 43s
      🟩 Clang15            Pass: 100%/4   | Total: 18m 10s | Avg:  4m 32s | Max:  4m 39s
      🟩 Clang16            Pass: 100%/4   | Total: 17m 35s | Avg:  4m 23s | Max:  4m 30s
      🟩 Clang17            Pass: 100%/26  | Total:  6h 04m | Avg: 14m 00s | Max: 31m 38s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 14s | Avg:  4m 07s | Max:  4m 45s
      🟩 GCC7               Pass: 100%/6   | Total: 22m 54s | Avg:  3m 49s | Max:  4m 22s
      🟩 GCC8               Pass: 100%/6   | Total: 23m 50s | Avg:  3m 58s | Max:  4m 24s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 03m | Avg: 10m 39s | Max: 43m 03s
      🟩 GCC10              Pass: 100%/4   | Total: 17m 39s | Avg:  4m 24s | Max:  4m 49s
      🟩 GCC11              Pass: 100%/7   | Total: 29m 46s | Avg:  4m 15s | Max:  4m 32s
      🟩 GCC12              Pass: 100%/4   | Total: 18m 38s | Avg:  4m 39s | Max:  5m 02s
      🟩 GCC13              Pass: 100%/28  | Total:  6h 06m | Avg: 13m 04s | Max: 28m 36s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 15m 27s | Avg:  5m 09s | Max:  5m 33s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 14m 03s | Avg: 14m 03s | Max: 14m 03s | Hits:  99%/713   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 23m 00s | Avg: 11m 30s | Max: 11m 31s | Hits:  99%/1426  
      🟩 MSVC14.39          Pass: 100%/3   | Total: 38m 07s | Avg: 12m 42s | Max: 13m 01s | Hits:  99%/2139  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/59  | Total:  8h 34m | Avg:  8m 43s | Max: 31m 38s
      🟩 GCC                Pass: 100%/63  | Total:  9h 11m | Avg:  8m 44s | Max: 43m 03s
      🟩 Intel              Pass: 100%/3   | Total: 15m 27s | Avg:  5m 09s | Max:  5m 33s
      🟩 MSVC               Pass: 100%/6   | Total:  1h 15m | Avg: 12m 31s | Max: 14m 03s | Hits:  99%/4278  
    🟩 gpu
      🟩 v100               Pass: 100%/131 | Total: 19h 16m | Avg:  8m 49s | Max: 43m 03s | Hits:  99%/4278  
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total:  8h 39m | Avg:  5m 14s | Max: 43m 03s | Hits:  99%/4278  
      🟩 DeviceLaunch       Pass: 100%/8   | Total:  2h 35m | Avg: 19m 23s | Max: 23m 03s
      🟩 GraphCapture       Pass: 100%/8   | Total:  2h 07m | Avg: 15m 57s | Max: 18m 13s
      🟩 HostLaunch         Pass: 100%/8   | Total:  2h 26m | Avg: 18m 20s | Max: 21m 19s
      🟩 TestGPU            Pass: 100%/8   | Total:  3h 27m | Avg: 25m 58s | Max: 31m 38s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 12m 43s | Avg:  4m 14s | Max:  4m 27s
      🟩 90a                Pass: 100%/4   | Total: 14m 45s | Avg:  3m 41s | Max:  3m 55s
    🟩 std
      🟩 11                 Pass: 100%/34  | Total:  4h 33m | Avg:  8m 01s | Max: 28m 36s
      🟩 14                 Pass: 100%/37  | Total:  4h 58m | Avg:  8m 04s | Max: 24m 37s | Hits:  99%/2139  
      🟩 17                 Pass: 100%/36  | Total:  5h 43m | Avg:  9m 32s | Max: 43m 03s | Hits:  99%/1426  
      🟩 20                 Pass: 100%/24  | Total:  4h 01m | Avg: 10m 02s | Max: 31m 38s | Hits:  99%/713   
    
  • 🟩 thrust: Pass: 100%/118 | Total: 11h 22m | Avg: 5m 47s | Max: 18m 59s | Hits: 99%/13077

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total: 10h 50m | Avg:  5m 54s | Max: 18m 59s | Hits:  99%/13077 
      🟩 arm64              Pass: 100%/8   | Total: 32m 18s | Avg:  4m 02s | Max:  5m 01s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 07m | Avg:  4m 28s | Max: 17m 29s | Hits:  98%/1453  
      🟩 11.8               Pass: 100%/3   | Total: 11m 48s | Avg:  3m 56s | Max:  4m 13s
      🟩 12.5               Pass: 100%/100 | Total: 10h 03m | Avg:  6m 02s | Max: 18m 59s | Hits:  99%/11624 
    🟩 cudacxx
      🟩 ClangCUDA17        Pass: 100%/2   | Total:  8m 09s | Avg:  4m 04s | Max:  4m 15s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 07m | Avg:  4m 28s | Max: 17m 29s | Hits:  98%/1453  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 11m 48s | Avg:  3m 56s | Max:  4m 13s
      🟩 nvcc12.5           Pass: 100%/98  | Total:  9h 55m | Avg:  6m 04s | Max: 18m 59s | Hits:  99%/11624 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 09s | Avg:  4m 04s | Max:  4m 15s
      🟩 nvcc               Pass: 100%/116 | Total: 11h 14m | Avg:  5m 48s | Max: 18m 59s | Hits:  99%/13077 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 25m 44s | Avg:  4m 17s | Max:  5m 25s
      🟩 Clang10            Pass: 100%/3   | Total: 14m 37s | Avg:  4m 52s | Max:  5m 15s
      🟩 Clang11            Pass: 100%/4   | Total: 17m 07s | Avg:  4m 16s | Max:  5m 06s
      🟩 Clang12            Pass: 100%/4   | Total: 16m 18s | Avg:  4m 04s | Max:  4m 20s
      🟩 Clang13            Pass: 100%/4   | Total: 16m 05s | Avg:  4m 01s | Max:  4m 30s
      🟩 Clang14            Pass: 100%/4   | Total: 16m 42s | Avg:  4m 10s | Max:  4m 18s
      🟩 Clang15            Pass: 100%/4   | Total: 16m 21s | Avg:  4m 05s | Max:  4m 24s
      🟩 Clang16            Pass: 100%/4   | Total: 16m 34s | Avg:  4m 08s | Max:  4m 26s
      🟩 Clang17            Pass: 100%/18  | Total:  1h 52m | Avg:  6m 14s | Max: 12m 36s
      🟩 GCC6               Pass: 100%/2   | Total:  6m 27s | Avg:  3m 13s | Max:  3m 14s
      🟩 GCC7               Pass: 100%/6   | Total: 21m 18s | Avg:  3m 33s | Max:  4m 06s
      🟩 GCC8               Pass: 100%/6   | Total: 23m 17s | Avg:  3m 52s | Max:  4m 20s
      🟩 GCC9               Pass: 100%/6   | Total: 22m 33s | Avg:  3m 45s | Max:  4m 18s
      🟩 GCC10              Pass: 100%/4   | Total: 15m 50s | Avg:  3m 57s | Max:  4m 11s
      🟩 GCC11              Pass: 100%/7   | Total: 28m 47s | Avg:  4m 06s | Max:  4m 49s
      🟩 GCC12              Pass: 100%/4   | Total: 17m 52s | Avg:  4m 28s | Max:  5m 00s
      🟩 GCC13              Pass: 100%/20  | Total:  2h 09m | Avg:  6m 29s | Max: 13m 50s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 15m 24s | Avg:  5m 08s | Max:  5m 29s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 17m 29s | Avg: 17m 29s | Max: 17m 29s | Hits:  98%/1453  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 29m 30s | Avg: 14m 45s | Max: 15m 14s | Hits:  98%/2906  
      🟩 MSVC14.39          Pass: 100%/6   | Total:  1h 42m | Avg: 17m 06s | Max: 18m 59s | Hits:  99%/8718  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/51  | Total:  4h 11m | Avg:  4m 56s | Max: 12m 36s
      🟩 GCC                Pass: 100%/55  | Total:  4h 25m | Avg:  4m 49s | Max: 13m 50s
      🟩 Intel              Pass: 100%/3   | Total: 15m 24s | Avg:  5m 08s | Max:  5m 29s
      🟩 MSVC               Pass: 100%/9   | Total:  2h 29m | Avg: 16m 37s | Max: 18m 59s | Hits:  99%/13077 
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total: 11h 22m | Avg:  5m 47s | Max: 18m 59s | Hits:  99%/13077 
    🟩 jobs
      🟩 Build              Pass: 100%/99  | Total:  7h 53m | Avg:  4m 47s | Max: 17m 29s | Hits:  98%/8718  
      🟩 TestCPU            Pass: 100%/11  | Total:  1h 48m | Avg:  9m 54s | Max: 18m 59s | Hits:  99%/4359  
      🟩 TestGPU            Pass: 100%/8   | Total:  1h 39m | Avg: 12m 28s | Max: 13m 50s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 11m 48s | Avg:  3m 56s | Max:  4m 13s
      🟩 90a                Pass: 100%/4   | Total: 14m 29s | Avg:  3m 37s | Max:  3m 46s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  2h 15m | Avg:  4m 31s | Max: 12m 40s
      🟩 14                 Pass: 100%/34  | Total:  3h 29m | Avg:  6m 09s | Max: 18m 27s | Hits:  99%/5812  
      🟩 17                 Pass: 100%/33  | Total:  3h 19m | Avg:  6m 03s | Max: 18m 40s | Hits:  99%/4359  
      🟩 20                 Pass: 100%/21  | Total:  2h 17m | Avg:  6m 34s | Max: 18m 59s | Hits:  99%/2906  
    
  • 🟩 cudax: Pass: 100%/55 | Total: 2h 29m | Avg: 2m 43s | Max: 9m 02s | Hits: 81%/106

    🟩 cpu
      🟩 amd64              Pass: 100%/51  | Total:  2h 22m | Avg:  2m 48s | Max:  9m 02s | Hits:  81%/106   
      🟩 arm64              Pass: 100%/4   | Total:  6m 57s | Avg:  1m 44s | Max:  1m 48s
    🟩 ctk
      🟩 12.0               Pass: 100%/23  | Total:  1h 04m | Avg:  2m 47s | Max:  8m 49s | Hits:  81%/53    
      🟩 12.5               Pass: 100%/32  | Total:  1h 25m | Avg:  2m 41s | Max:  9m 02s | Hits:  81%/53    
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/23  | Total:  1h 04m | Avg:  2m 47s | Max:  8m 49s | Hits:  81%/53    
      🟩 nvcc12.5           Pass: 100%/32  | Total:  1h 25m | Avg:  2m 41s | Max:  9m 02s | Hits:  81%/53    
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/55  | Total:  2h 29m | Avg:  2m 43s | Max:  9m 02s | Hits:  81%/106   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  4m 32s | Avg:  2m 16s | Max:  2m 22s
      🟩 Clang10            Pass: 100%/2   | Total:  4m 24s | Avg:  2m 12s | Max:  2m 17s
      🟩 Clang11            Pass: 100%/4   | Total:  9m 05s | Avg:  2m 16s | Max:  2m 31s
      🟩 Clang12            Pass: 100%/4   | Total:  9m 14s | Avg:  2m 18s | Max:  2m 31s
      🟩 Clang13            Pass: 100%/4   | Total:  9m 01s | Avg:  2m 15s | Max:  2m 26s
      🟩 Clang14            Pass: 100%/6   | Total: 17m 47s | Avg:  2m 57s | Max:  4m 42s
      🟩 Clang15            Pass: 100%/2   | Total:  4m 39s | Avg:  2m 19s | Max:  2m 26s
      🟩 Clang16            Pass: 100%/6   | Total: 16m 45s | Avg:  2m 47s | Max:  4m 14s
      🟩 GCC9               Pass: 100%/2   | Total:  4m 30s | Avg:  2m 15s | Max:  2m 18s
      🟩 GCC10              Pass: 100%/4   | Total:  8m 34s | Avg:  2m 08s | Max:  2m 25s
      🟩 GCC11              Pass: 100%/4   | Total:  8m 28s | Avg:  2m 07s | Max:  2m 21s
      🟩 GCC12              Pass: 100%/12  | Total: 32m 09s | Avg:  2m 40s | Max:  4m 11s
      🟩 Intel2023.2.0      Pass: 100%/1   | Total:  2m 56s | Avg:  2m 56s | Max:  2m 56s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  8m 49s | Avg:  8m 49s | Max:  8m 49s | Hits:  81%/53    
      🟩 MSVC14.39          Pass: 100%/1   | Total:  9m 02s | Avg:  9m 02s | Max:  9m 02s | Hits:  81%/53    
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  1h 15m | Avg:  2m 30s | Max:  4m 42s
      🟩 GCC                Pass: 100%/22  | Total: 53m 41s | Avg:  2m 26s | Max:  4m 11s
      🟩 Intel              Pass: 100%/1   | Total:  2m 56s | Avg:  2m 56s | Max:  2m 56s
      🟩 MSVC               Pass: 100%/2   | Total: 17m 51s | Avg:  8m 55s | Max:  9m 02s | Hits:  81%/106   
    🟩 gpu
      🟩 v100               Pass: 100%/55  | Total:  2h 29m | Avg:  2m 43s | Max:  9m 02s | Hits:  81%/106   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total:  1h 57m | Avg:  2m 30s | Max:  9m 02s | Hits:  81%/106   
      🟩 Test               Pass: 100%/8   | Total: 32m 01s | Avg:  4m 00s | Max:  4m 42s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 09s | Avg:  2m 09s | Max:  2m 09s
      🟩 90a                Pass: 100%/1   | Total:  2m 18s | Avg:  2m 18s | Max:  2m 18s
    🟩 std
      🟩 17                 Pass: 100%/31  | Total:  1h 16m | Avg:  2m 28s | Max:  4m 14s
      🟩 20                 Pass: 100%/24  | Total:  1h 13m | Avg:  3m 03s | Max:  9m 02s | Hits:  81%/106   
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 11m 50s | Avg: 11m 50s | Max: 11m 50s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 11m 50s | Avg: 11m 50s | Max: 11m 50s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 11m 50s | Avg: 11m 50s | Max: 11m 50s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 11m 50s | Avg: 11m 50s | Max: 11m 50s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 11m 50s | Avg: 11m 50s | Max: 11m 50s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 11m 50s | Avg: 11m 50s | Max: 11m 50s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 11m 50s | Avg: 11m 50s | Max: 11m 50s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 11m 50s | Avg: 11m 50s | Max: 11m 50s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 11m 50s | Avg: 11m 50s | Max: 11m 50s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
+/- Thrust
+/- CUDA Experimental
pycuda

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda

🏃‍ Runner counts (total jobs: 305)

# Runner
219 linux-amd64-cpu16
49 linux-amd64-gpu-v100-latest-1
20 linux-arm64-cpu16
17 windows-amd64-cpu16

@ericniebler ericniebler merged commit dd90bed into NVIDIA:main Aug 27, 2024
318 of 322 checks passed
pciolkosz pushed a commit to pciolkosz/cccl that referenced this pull request Aug 27, 2024
bernhardmgruber pushed a commit to bernhardmgruber/cccl that referenced this pull request Aug 28, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CUDA Next Feature intended for the Cuda Next experimental library
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

3 participants