Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve uninitialized_{async_}buffer API #2713

Merged
merged 1 commit into from
Nov 7, 2024

Conversation

miscco
Copy link
Collaborator

@miscco miscco commented Nov 6, 2024

This implements some improvements to the buffers.

  • We want to be able to convert them if the properties match.
  • Add some improvements regarding symbol visibility
  • Add a way to grow a buffer

@miscco miscco requested review from a team as code owners November 6, 2024 10:55
@miscco miscco added feature request New feature or request. CUDA Next Feature intended for the Cuda Next experimental library labels Nov 6, 2024
Copy link
Contributor

github-actions bot commented Nov 6, 2024

🟩 CI finished in 22m 56s: Pass: 100%/54 | Total: 4h 29m | Avg: 4m 59s | Max: 17m 51s | Hits: 83%/238
  • 🟩 cudax: Pass: 100%/54 | Total: 4h 29m | Avg: 4m 59s | Max: 17m 51s | Hits: 83%/238

    🟩 cpu
      🟩 amd64              Pass: 100%/50  | Total:  4h 15m | Avg:  5m 06s | Max: 17m 51s | Hits:  83%/238   
      🟩 arm64              Pass: 100%/4   | Total: 13m 32s | Avg:  3m 23s | Max:  4m 00s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  1h 33m | Avg:  4m 54s | Max: 16m 27s | Hits:  83%/119   
      🟩 12.5               Pass: 100%/2   | Total: 11m 54s | Avg:  5m 57s | Max:  6m 12s
      🟩 12.6               Pass: 100%/33  | Total:  2h 44m | Avg:  4m 58s | Max: 17m 51s | Hits:  83%/119   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  1h 33m | Avg:  4m 54s | Max: 16m 27s | Hits:  83%/119   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 11m 54s | Avg:  5m 57s | Max:  6m 12s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  2h 44m | Avg:  4m 58s | Max: 17m 51s | Hits:  83%/119   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/54  | Total:  4h 29m | Avg:  4m 59s | Max: 17m 51s | Hits:  83%/238   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  7m 50s | Avg:  3m 55s | Max:  4m 04s
      🟩 Clang10            Pass: 100%/2   | Total:  7m 33s | Avg:  3m 46s | Max:  4m 07s
      🟩 Clang11            Pass: 100%/4   | Total: 13m 27s | Avg:  3m 21s | Max:  3m 33s
      🟩 Clang12            Pass: 100%/4   | Total: 13m 56s | Avg:  3m 29s | Max:  3m 44s
      🟩 Clang13            Pass: 100%/4   | Total: 13m 29s | Avg:  3m 22s | Max:  3m 36s
      🟩 Clang14            Pass: 100%/4   | Total: 26m 48s | Avg:  6m 42s | Max: 16m 03s
      🟩 Clang15            Pass: 100%/2   | Total:  7m 35s | Avg:  3m 47s | Max:  3m 58s
      🟩 Clang16            Pass: 100%/4   | Total: 13m 54s | Avg:  3m 28s | Max:  3m 53s
      🟩 Clang17            Pass: 100%/2   | Total:  7m 09s | Avg:  3m 34s | Max:  3m 37s
      🟩 Clang18            Pass: 100%/2   | Total: 21m 19s | Avg: 10m 39s | Max: 17m 42s
      🟩 GCC9               Pass: 100%/2   | Total:  6m 30s | Avg:  3m 15s | Max:  3m 16s
      🟩 GCC10              Pass: 100%/4   | Total: 13m 44s | Avg:  3m 26s | Max:  3m 39s
      🟩 GCC11              Pass: 100%/4   | Total: 13m 46s | Avg:  3m 26s | Max:  3m 35s
      🟩 GCC12              Pass: 100%/7   | Total:  1h 05m | Avg:  9m 18s | Max: 17m 51s
      🟩 GCC13              Pass: 100%/3   | Total: 10m 03s | Avg:  3m 21s | Max:  4m 00s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  7m 51s | Avg:  7m 51s | Max:  7m 51s | Hits:  83%/119   
      🟩 MSVC14.39          Pass: 100%/1   | Total:  7m 18s | Avg:  7m 18s | Max:  7m 18s | Hits:  83%/119   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 11m 54s | Avg:  5m 57s | Max:  6m 12s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  2h 13m | Avg:  4m 26s | Max: 17m 42s
      🟩 GCC                Pass: 100%/20  | Total:  1h 49m | Avg:  5m 27s | Max: 17m 51s
      🟩 MSVC               Pass: 100%/2   | Total: 15m 09s | Avg:  7m 34s | Max:  7m 51s | Hits:  83%/238   
      🟩 NVHPC              Pass: 100%/2   | Total: 11m 54s | Avg:  5m 57s | Max:  6m 12s
    🟩 gpu
      🟩 v100               Pass: 100%/54  | Total:  4h 29m | Avg:  4m 59s | Max: 17m 51s | Hits:  83%/238   
    🟩 jobs
      🟩 Build              Pass: 100%/49  | Total:  3h 04m | Avg:  3m 45s | Max:  7m 51s | Hits:  83%/238   
      🟩 Test               Pass: 100%/5   | Total:  1h 25m | Avg: 17m 01s | Max: 17m 51s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 51s | Avg:  2m 51s | Max:  2m 51s
      🟩 90a                Pass: 100%/1   | Total:  2m 49s | Avg:  2m 49s | Max:  2m 49s
    🟩 std
      🟩 17                 Pass: 100%/29  | Total:  2h 11m | Avg:  4m 31s | Max: 17m 05s
      🟩 20                 Pass: 100%/25  | Total:  2h 17m | Avg:  5m 30s | Max: 17m 51s | Hits:  83%/238   
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

🏃‍ Runner counts (total jobs: 54)

# Runner
43 linux-amd64-cpu16
5 linux-amd64-gpu-v100-latest-1
4 linux-arm64-cpu16
2 windows-amd64-cpu16

@miscco miscco force-pushed the enhance_uninitialized_buffer branch 2 times, most recently from 36fd17a to 45df2a6 Compare November 6, 2024 11:26
@miscco miscco requested a review from a team as a code owner November 6, 2024 11:26
This implements some improvements to the buffers.

* We want to be able to convert them if the properties match.
* Add some improvements regarding symbol visibility
* Add a way to grow a vector
* Ensure we can simply swap them
@miscco miscco force-pushed the enhance_uninitialized_buffer branch from 45df2a6 to c6c7c88 Compare November 6, 2024 13:20
Copy link
Contributor

github-actions bot commented Nov 6, 2024

🟩 CI finished in 1h 59m: Pass: 100%/394 | Total: 2d 15h | Avg: 9m 36s | Max: 1h 16m | Hits: 82%/25847
  • 🟩 libcudacxx: Pass: 100%/118 | Total: 1d 07h | Avg: 15m 53s | Max: 1h 16m | Hits: 52%/9496

    🟩 cpu
      🟩 amd64              Pass: 100%/110 | Total:  1d 05h | Avg: 15m 56s | Max:  1h 16m | Hits:  52%/9496  
      🟩 arm64              Pass: 100%/8   | Total:  2h 02m | Avg: 15m 19s | Max: 26m 29s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  3h 14m | Avg: 12m 59s | Max: 30m 12s | Hits:  37%/2180  
      🟩 11.8               Pass: 100%/3   | Total: 55m 57s | Avg: 18m 39s | Max: 30m 51s
      🟩 12.5               Pass: 100%/4   | Total:  1h 22m | Avg: 20m 43s | Max: 39m 33s
      🟩 12.6               Pass: 100%/96  | Total:  1d 01h | Avg: 16m 03s | Max:  1h 16m | Hits:  57%/7316  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/12  | Total:  2h 31m | Avg: 12m 35s | Max: 21m 18s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  3h 14m | Avg: 12m 59s | Max: 30m 12s | Hits:  37%/2180  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 55m 57s | Avg: 18m 39s | Max: 30m 51s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  1h 22m | Avg: 20m 43s | Max: 39m 33s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 23h 10m | Avg: 16m 33s | Max:  1h 16m | Hits:  57%/7316  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/12  | Total:  2h 31m | Avg: 12m 35s | Max: 21m 18s
      🟩 nvcc               Pass: 100%/106 | Total:  1d 04h | Avg: 16m 16s | Max:  1h 16m | Hits:  52%/9496  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total:  1h 31m | Avg: 15m 18s | Max: 28m 36s
      🟩 Clang10            Pass: 100%/3   | Total:  1h 09m | Avg: 23m 08s | Max: 26m 05s
      🟩 Clang11            Pass: 100%/4   | Total:  1h 09m | Avg: 17m 19s | Max: 24m 12s
      🟩 Clang12            Pass: 100%/4   | Total:  1h 02m | Avg: 15m 42s | Max: 29m 10s
      🟩 Clang13            Pass: 100%/4   | Total:  1h 06m | Avg: 16m 34s | Max: 24m 08s
      🟩 Clang14            Pass: 100%/4   | Total: 58m 28s | Avg: 14m 37s | Max: 25m 13s
      🟩 Clang15            Pass: 100%/4   | Total:  1h 35m | Avg: 23m 54s | Max: 28m 38s
      🟩 Clang16            Pass: 100%/4   | Total: 40m 05s | Avg: 10m 01s | Max: 26m 53s
      🟩 Clang17            Pass: 100%/4   | Total: 51m 59s | Avg: 12m 59s | Max: 25m 50s
      🟩 Clang18            Pass: 100%/18  | Total:  4h 11m | Avg: 13m 57s | Max: 26m 29s
      🟩 GCC6               Pass: 100%/2   | Total:  5m 25s | Avg:  2m 42s | Max:  2m 46s
      🟩 GCC7               Pass: 100%/6   | Total:  1h 40m | Avg: 16m 43s | Max: 25m 40s
      🟩 GCC8               Pass: 100%/6   | Total:  1h 31m | Avg: 15m 10s | Max: 25m 39s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 38m | Avg: 16m 22s | Max: 28m 29s
      🟩 GCC10              Pass: 100%/4   | Total: 30m 09s | Avg:  7m 32s | Max: 18m 20s
      🟩 GCC11              Pass: 100%/7   | Total:  1h 56m | Avg: 16m 42s | Max: 30m 51s
      🟩 GCC12              Pass: 100%/4   | Total:  1h 16m | Avg: 19m 13s | Max: 28m 46s
      🟩 GCC13              Pass: 100%/17  | Total:  4h 42m | Avg: 16m 35s | Max:  1h 16m
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 17m 11s | Avg:  5m 43s | Max:  6m 12s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 30m 12s | Avg: 30m 12s | Max: 30m 12s | Hits:  37%/2180  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 43m 32s | Avg: 21m 46s | Max: 33m 01s | Hits:  68%/4723  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 43m 45s | Avg: 43m 45s | Max: 43m 45s | Hits:  35%/2593  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  1h 22m | Avg: 20m 43s | Max: 39m 33s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/55  | Total: 14h 17m | Avg: 15m 35s | Max: 29m 10s
      🟩 GCC                Pass: 100%/52  | Total: 13h 21m | Avg: 15m 24s | Max:  1h 16m
      🟩 Intel              Pass: 100%/3   | Total: 17m 11s | Avg:  5m 43s | Max:  6m 12s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 57m | Avg: 29m 22s | Max: 43m 45s | Hits:  52%/9496  
      🟩 NVHPC              Pass: 100%/4   | Total:  1h 22m | Avg: 20m 43s | Max: 39m 33s
    🟩 gpu
      🟩 v100               Pass: 100%/118 | Total:  1d 07h | Avg: 15m 53s | Max:  1h 16m | Hits:  52%/9496  
    🟩 jobs
      🟩 Build              Pass: 100%/110 | Total:  1d 03h | Avg: 15m 11s | Max: 43m 45s | Hits:  52%/9496  
      🟩 NVRTC              Pass: 100%/4   | Total:  1h 34m | Avg: 23m 36s | Max: 27m 23s
      🟩 Test               Pass: 100%/3   | Total:  1h 49m | Avg: 36m 22s | Max:  1h 16m
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 56s | Avg:  1m 56s | Max:  1m 56s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 55m 57s | Avg: 18m 39s | Max: 30m 51s
      🟩 90                 Pass: 100%/4   | Total: 41m 50s | Avg: 10m 27s | Max: 12m 23s
      🟩 90a                Pass: 100%/8   | Total: 55m 50s | Avg:  6m 58s | Max: 12m 14s
    🟩 std
      🟩 11                 Pass: 100%/32  | Total:  6h 32m | Avg: 12m 15s | Max: 24m 52s
      🟩 14                 Pass: 100%/32  | Total:  6h 49m | Avg: 12m 48s | Max: 33m 01s | Hits:  36%/4463  
      🟩 17                 Pass: 100%/30  | Total:  8h 45m | Avg: 17m 30s | Max: 30m 51s | Hits:  99%/2440  
      🟩 20                 Pass: 100%/23  | Total:  9h 06m | Avg: 23m 46s | Max:  1h 16m | Hits:  35%/2593  
    
  • 🟩 cub: Pass: 100%/110 | Total: 13h 58m | Avg: 7m 37s | Max: 42m 46s | Hits: 99%/2948

    🟩 cpu
      🟩 amd64              Pass: 100%/102 | Total: 13h 24m | Avg:  7m 53s | Max: 42m 46s | Hits:  99%/2948  
      🟩 arm64              Pass: 100%/8   | Total: 34m 30s | Avg:  4m 18s | Max:  5m 06s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 09m | Avg:  4m 37s | Max: 11m 54s | Hits:  99%/737   
      🟩 11.8               Pass: 100%/3   | Total: 53m 18s | Avg: 17m 46s | Max: 42m 46s
      🟩 12.5               Pass: 100%/4   | Total: 32m 34s | Avg:  8m 08s | Max:  8m 25s
      🟩 12.6               Pass: 100%/88  | Total: 11h 23m | Avg:  7m 46s | Max: 42m 42s | Hits:  99%/2211  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 15m 56s | Avg:  3m 59s | Max:  4m 01s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 09m | Avg:  4m 37s | Max: 11m 54s | Hits:  99%/737   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 53m 18s | Avg: 17m 46s | Max: 42m 46s
      🟩 nvcc12.5           Pass: 100%/4   | Total: 32m 34s | Avg:  8m 08s | Max:  8m 25s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 11h 07m | Avg:  7m 56s | Max: 42m 42s | Hits:  99%/2211  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 15m 56s | Avg:  3m 59s | Max:  4m 01s
      🟩 nvcc               Pass: 100%/106 | Total: 13h 42m | Avg:  7m 45s | Max: 42m 46s | Hits:  99%/2948  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 29m 26s | Avg:  4m 54s | Max:  5m 47s
      🟩 Clang10            Pass: 100%/3   | Total: 16m 13s | Avg:  5m 24s | Max:  5m 33s
      🟩 Clang11            Pass: 100%/4   | Total: 18m 59s | Avg:  4m 44s | Max:  5m 03s
      🟩 Clang12            Pass: 100%/4   | Total: 19m 08s | Avg:  4m 47s | Max:  5m 18s
      🟩 Clang13            Pass: 100%/4   | Total: 19m 44s | Avg:  4m 56s | Max:  5m 28s
      🟩 Clang14            Pass: 100%/4   | Total: 19m 16s | Avg:  4m 49s | Max:  4m 56s
      🟩 Clang15            Pass: 100%/4   | Total: 20m 11s | Avg:  5m 02s | Max:  5m 22s
      🟩 Clang16            Pass: 100%/4   | Total: 18m 51s | Avg:  4m 42s | Max:  4m 51s
      🟩 Clang17            Pass: 100%/4   | Total: 19m 31s | Avg:  4m 52s | Max:  5m 18s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 57m | Avg: 10m 40s | Max: 42m 42s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 56s | Avg:  3m 58s | Max:  4m 07s
      🟩 GCC7               Pass: 100%/6   | Total: 25m 30s | Avg:  4m 15s | Max:  4m 37s
      🟩 GCC8               Pass: 100%/6   | Total: 57m 22s | Avg:  9m 33s | Max: 36m 10s
      🟩 GCC9               Pass: 100%/6   | Total: 26m 47s | Avg:  4m 27s | Max:  5m 02s
      🟩 GCC10              Pass: 100%/4   | Total: 18m 57s | Avg:  4m 44s | Max:  4m 58s
      🟩 GCC11              Pass: 100%/7   | Total:  1h 12m | Avg: 10m 20s | Max: 42m 46s
      🟩 GCC12              Pass: 100%/4   | Total: 20m 00s | Avg:  5m 00s | Max:  5m 06s
      🟩 GCC13              Pass: 100%/16  | Total:  3h 38m | Avg: 13m 40s | Max: 39m 30s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 17m 22s | Avg:  5m 47s | Max:  6m 02s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 11m 54s | Avg: 11m 54s | Max: 11m 54s | Hits:  99%/737   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 20m 02s | Avg: 10m 01s | Max: 10m 09s | Hits:  99%/1474  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 10m 32s | Avg: 10m 32s | Max: 10m 32s | Hits:  99%/737   
      🟩 NVHPC24.7          Pass: 100%/4   | Total: 32m 34s | Avg:  8m 08s | Max:  8m 25s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  4h 58m | Avg:  6m 13s | Max: 42m 42s
      🟩 GCC                Pass: 100%/51  | Total:  7h 27m | Avg:  8m 46s | Max: 42m 46s
      🟩 Intel              Pass: 100%/3   | Total: 17m 22s | Avg:  5m 47s | Max:  6m 02s
      🟩 MSVC               Pass: 100%/4   | Total: 42m 28s | Avg: 10m 37s | Max: 11m 54s | Hits:  99%/2948  
      🟩 NVHPC              Pass: 100%/4   | Total: 32m 34s | Avg:  8m 08s | Max:  8m 25s
    🟩 gpu
      🟩 v100               Pass: 100%/110 | Total: 13h 58m | Avg:  7m 37s | Max: 42m 46s | Hits:  99%/2948  
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total:  9h 43m | Avg:  5m 43s | Max: 42m 46s | Hits:  99%/2948  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 26m 03s | Avg: 26m 03s | Max: 26m 03s
      🟩 GraphCapture       Pass: 100%/1   | Total: 26m 28s | Avg: 26m 28s | Max: 26m 28s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 21m | Avg: 27m 17s | Max: 37m 36s
      🟩 TestGPU            Pass: 100%/3   | Total:  2h 01m | Avg: 40m 25s | Max: 42m 42s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 53m 18s | Avg: 17m 46s | Max: 42m 46s
      🟩 90a                Pass: 100%/4   | Total: 15m 24s | Avg:  3m 51s | Max:  4m 02s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  3h 07m | Avg:  6m 15s | Max: 39m 30s
      🟩 14                 Pass: 100%/29  | Total:  3h 03m | Avg:  6m 18s | Max: 36m 10s | Hits:  99%/1474  
      🟩 17                 Pass: 100%/27  | Total:  2h 54m | Avg:  6m 28s | Max: 42m 46s | Hits:  99%/737   
      🟩 20                 Pass: 100%/24  | Total:  4h 53m | Avg: 12m 13s | Max: 42m 42s | Hits:  99%/737   
    
  • 🟩 thrust: Pass: 100%/109 | Total: 12h 50m | Avg: 7m 04s | Max: 44m 15s | Hits: 99%/13165

    🟩 cpu
      🟩 amd64              Pass: 100%/101 | Total: 12h 11m | Avg:  7m 14s | Max: 44m 15s | Hits:  99%/13165 
      🟩 arm64              Pass: 100%/8   | Total: 38m 34s | Avg:  4m 49s | Max:  5m 14s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 20m | Avg:  5m 20s | Max: 17m 28s | Hits:  99%/2633  
      🟩 11.8               Pass: 100%/3   | Total: 16m 27s | Avg:  5m 29s | Max:  6m 00s
      🟩 12.5               Pass: 100%/4   | Total:  1h 07m | Avg: 16m 50s | Max: 19m 06s
      🟩 12.6               Pass: 100%/87  | Total: 10h 06m | Avg:  6m 58s | Max: 44m 15s | Hits:  99%/10532 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/4   | Total: 19m 53s | Avg:  4m 58s | Max:  5m 21s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 20m | Avg:  5m 20s | Max: 17m 28s | Hits:  99%/2633  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 16m 27s | Avg:  5m 29s | Max:  6m 00s
      🟩 nvcc12.5           Pass: 100%/4   | Total:  1h 07m | Avg: 16m 50s | Max: 19m 06s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  9h 46m | Avg:  7m 04s | Max: 44m 15s | Hits:  99%/10532 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/4   | Total: 19m 53s | Avg:  4m 58s | Max:  5m 21s
      🟩 nvcc               Pass: 100%/105 | Total: 12h 30m | Avg:  7m 08s | Max: 44m 15s | Hits:  99%/13165 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 32m 39s | Avg:  5m 26s | Max:  6m 35s
      🟩 Clang10            Pass: 100%/3   | Total: 20m 03s | Avg:  6m 41s | Max:  7m 02s
      🟩 Clang11            Pass: 100%/4   | Total: 20m 30s | Avg:  5m 07s | Max:  5m 32s
      🟩 Clang12            Pass: 100%/4   | Total: 59m 35s | Avg: 14m 53s | Max: 44m 15s
      🟩 Clang13            Pass: 100%/4   | Total: 20m 07s | Avg:  5m 01s | Max:  5m 11s
      🟩 Clang14            Pass: 100%/4   | Total: 21m 07s | Avg:  5m 16s | Max:  5m 39s
      🟩 Clang15            Pass: 100%/4   | Total: 22m 07s | Avg:  5m 31s | Max:  5m 40s
      🟩 Clang16            Pass: 100%/4   | Total: 21m 03s | Avg:  5m 15s | Max:  5m 32s
      🟩 Clang17            Pass: 100%/4   | Total: 21m 52s | Avg:  5m 28s | Max:  6m 02s
      🟩 Clang18            Pass: 100%/11  | Total:  1h 12m | Avg:  6m 35s | Max: 20m 31s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 13s | Avg:  4m 06s | Max:  4m 20s
      🟩 GCC7               Pass: 100%/6   | Total: 27m 52s | Avg:  4m 38s | Max:  5m 39s
      🟩 GCC8               Pass: 100%/6   | Total: 29m 11s | Avg:  4m 51s | Max:  5m 32s
      🟩 GCC9               Pass: 100%/6   | Total: 29m 41s | Avg:  4m 56s | Max:  5m 45s
      🟩 GCC10              Pass: 100%/4   | Total: 22m 13s | Avg:  5m 33s | Max:  5m 56s
      🟩 GCC11              Pass: 100%/7   | Total: 39m 16s | Avg:  5m 36s | Max:  6m 20s
      🟩 GCC12              Pass: 100%/4   | Total: 22m 16s | Avg:  5m 34s | Max:  5m 46s
      🟩 GCC13              Pass: 100%/14  | Total:  1h 41m | Avg:  7m 14s | Max: 20m 36s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 20m 28s | Avg:  6m 49s | Max:  6m 58s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 17m 28s | Avg: 17m 28s | Max: 17m 28s | Hits:  99%/2633  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 31m 04s | Avg: 15m 32s | Max: 15m 56s | Hits:  99%/5266  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 42m 27s | Avg: 21m 13s | Max: 24m 11s | Hits:  99%/5266  
      🟩 NVHPC24.7          Pass: 100%/4   | Total:  1h 07m | Avg: 16m 50s | Max: 19m 06s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/48  | Total:  5h 11m | Avg:  6m 29s | Max: 44m 15s
      🟩 GCC                Pass: 100%/49  | Total:  4h 40m | Avg:  5m 42s | Max: 20m 36s
      🟩 Intel              Pass: 100%/3   | Total: 20m 28s | Avg:  6m 49s | Max:  6m 58s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 30m | Avg: 18m 11s | Max: 24m 11s | Hits:  99%/13165 
      🟩 NVHPC              Pass: 100%/4   | Total:  1h 07m | Avg: 16m 50s | Max: 19m 06s
    🟩 gpu
      🟩 v100               Pass: 100%/109 | Total: 12h 50m | Avg:  7m 04s | Max: 44m 15s | Hits:  99%/13165 
    🟩 jobs
      🟩 Build              Pass: 100%/102 | Total: 11h 06m | Avg:  6m 31s | Max: 44m 15s | Hits:  99%/10532 
      🟩 TestCPU            Pass: 100%/4   | Total: 46m 56s | Avg: 11m 44s | Max: 24m 11s | Hits:  99%/2633  
      🟩 TestGPU            Pass: 100%/3   | Total: 57m 05s | Avg: 19m 01s | Max: 20m 36s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 16m 27s | Avg:  5m 29s | Max:  6m 00s
      🟩 90a                Pass: 100%/4   | Total: 18m 11s | Avg:  4m 32s | Max:  4m 47s
    🟩 std
      🟩 11                 Pass: 100%/30  | Total:  2h 49m | Avg:  5m 39s | Max: 15m 58s
      🟩 14                 Pass: 100%/29  | Total:  3h 46m | Avg:  7m 47s | Max: 44m 15s | Hits:  99%/5266  
      🟩 17                 Pass: 100%/27  | Total:  2h 48m | Avg:  6m 13s | Max: 16m 28s | Hits:  99%/2633  
      🟩 20                 Pass: 100%/23  | Total:  3h 26m | Avg:  8m 58s | Max: 24m 11s | Hits:  99%/5266  
    
  • 🟩 cudax: Pass: 100%/54 | Total: 4h 33m | Avg: 5m 03s | Max: 27m 00s | Hits: 83%/238

    🟩 cpu
      🟩 amd64              Pass: 100%/50  | Total:  4h 22m | Avg:  5m 15s | Max: 27m 00s | Hits:  83%/238   
      🟩 arm64              Pass: 100%/4   | Total: 10m 48s | Avg:  2m 42s | Max:  2m 58s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  1h 36m | Avg:  5m 06s | Max: 21m 29s | Hits:  83%/119   
      🟩 12.5               Pass: 100%/2   | Total: 11m 55s | Avg:  5m 57s | Max:  6m 18s
      🟩 12.6               Pass: 100%/33  | Total:  2h 44m | Avg:  4m 59s | Max: 27m 00s | Hits:  83%/119   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  1h 36m | Avg:  5m 06s | Max: 21m 29s | Hits:  83%/119   
      🟩 nvcc12.5           Pass: 100%/2   | Total: 11m 55s | Avg:  5m 57s | Max:  6m 18s
      🟩 nvcc12.6           Pass: 100%/33  | Total:  2h 44m | Avg:  4m 59s | Max: 27m 00s | Hits:  83%/119   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/54  | Total:  4h 33m | Avg:  5m 03s | Max: 27m 00s | Hits:  83%/238   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  6m 27s | Avg:  3m 13s | Max:  3m 21s
      🟩 Clang10            Pass: 100%/2   | Total:  6m 24s | Avg:  3m 12s | Max:  3m 20s
      🟩 Clang11            Pass: 100%/4   | Total: 12m 08s | Avg:  3m 02s | Max:  3m 19s
      🟩 Clang12            Pass: 100%/4   | Total: 11m 43s | Avg:  2m 55s | Max:  3m 01s
      🟩 Clang13            Pass: 100%/4   | Total: 12m 27s | Avg:  3m 06s | Max:  3m 17s
      🟩 Clang14            Pass: 100%/4   | Total: 30m 28s | Avg:  7m 37s | Max: 21m 26s
      🟩 Clang15            Pass: 100%/2   | Total:  5m 58s | Avg:  2m 59s | Max:  3m 00s
      🟩 Clang16            Pass: 100%/4   | Total: 11m 42s | Avg:  2m 55s | Max:  3m 12s
      🟩 Clang17            Pass: 100%/2   | Total:  6m 05s | Avg:  3m 02s | Max:  3m 06s
      🟩 Clang18            Pass: 100%/2   | Total: 24m 33s | Avg: 12m 16s | Max: 21m 42s
      🟩 GCC9               Pass: 100%/2   | Total:  5m 39s | Avg:  2m 49s | Max:  2m 53s
      🟩 GCC10              Pass: 100%/4   | Total: 11m 17s | Avg:  2m 49s | Max:  2m 56s
      🟩 GCC11              Pass: 100%/4   | Total: 13m 16s | Avg:  3m 19s | Max:  3m 43s
      🟩 GCC12              Pass: 100%/7   | Total:  1h 20m | Avg: 11m 31s | Max: 27m 00s
      🟩 GCC13              Pass: 100%/3   | Total:  7m 59s | Avg:  2m 39s | Max:  2m 58s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  7m 33s | Avg:  7m 33s | Max:  7m 33s | Hits:  83%/119   
      🟩 MSVC14.39          Pass: 100%/1   | Total:  7m 09s | Avg:  7m 09s | Max:  7m 09s | Hits:  83%/119   
      🟩 NVHPC24.7          Pass: 100%/2   | Total: 11m 55s | Avg:  5m 57s | Max:  6m 18s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  2h 07m | Avg:  4m 15s | Max: 21m 42s
      🟩 GCC                Pass: 100%/20  | Total:  1h 58m | Avg:  5m 56s | Max: 27m 00s
      🟩 MSVC               Pass: 100%/2   | Total: 14m 42s | Avg:  7m 21s | Max:  7m 33s | Hits:  83%/238   
      🟩 NVHPC              Pass: 100%/2   | Total: 11m 55s | Avg:  5m 57s | Max:  6m 18s
    🟩 gpu
      🟩 v100               Pass: 100%/54  | Total:  4h 33m | Avg:  5m 03s | Max: 27m 00s | Hits:  83%/238   
    🟩 jobs
      🟩 Build              Pass: 100%/49  | Total:  2h 41m | Avg:  3m 17s | Max:  7m 33s | Hits:  83%/238   
      🟩 Test               Pass: 100%/5   | Total:  1h 51m | Avg: 22m 22s | Max: 27m 00s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 35s | Avg:  2m 35s | Max:  2m 35s
      🟩 90a                Pass: 100%/1   | Total:  2m 34s | Avg:  2m 34s | Max:  2m 34s
    🟩 std
      🟩 17                 Pass: 100%/29  | Total:  2h 05m | Avg:  4m 20s | Max: 21m 29s
      🟩 20                 Pass: 100%/25  | Total:  2h 27m | Avg:  5m 54s | Max: 27m 00s | Hits:  83%/238   
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 11m 05s | Avg: 5m 32s | Max: 9m 12s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 11m 05s | Avg:  5m 32s | Max:  9m 12s
    🟩 ctk
      🟩 12.6               Pass: 100%/2   | Total: 11m 05s | Avg:  5m 32s | Max:  9m 12s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/2   | Total: 11m 05s | Avg:  5m 32s | Max:  9m 12s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 11m 05s | Avg:  5m 32s | Max:  9m 12s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 11m 05s | Avg:  5m 32s | Max:  9m 12s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 11m 05s | Avg:  5m 32s | Max:  9m 12s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 11m 05s | Avg:  5m 32s | Max:  9m 12s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  1m 53s | Avg:  1m 53s | Max:  1m 53s
      🟩 Test               Pass: 100%/1   | Total:  9m 12s | Avg:  9m 12s | Max:  9m 12s
    
  • 🟩 python: Pass: 100%/1 | Total: 15m 56s | Avg: 15m 56s | Max: 15m 56s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 15m 56s | Avg: 15m 56s | Max: 15m 56s
    🟩 ctk
      🟩 12.6               Pass: 100%/1   | Total: 15m 56s | Avg: 15m 56s | Max: 15m 56s
    🟩 cudacxx
      🟩 nvcc12.6           Pass: 100%/1   | Total: 15m 56s | Avg: 15m 56s | Max: 15m 56s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 15m 56s | Avg: 15m 56s | Max: 15m 56s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 15m 56s | Avg: 15m 56s | Max: 15m 56s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 15m 56s | Avg: 15m 56s | Max: 15m 56s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 15m 56s | Avg: 15m 56s | Max: 15m 56s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 15m 56s | Avg: 15m 56s | Max: 15m 56s
    

👃 Inspect Changes

Modifications in project?

Project
CCCL Infrastructure
+/- libcu++
CUB
Thrust
+/- CUDA Experimental
python
CCCL C Parallel Library
Catch2Helper

Modifications in project or dependencies?

Project
CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- python
+/- CCCL C Parallel Library
+/- Catch2Helper

🏃‍ Runner counts (total jobs: 394)

# Runner
326 linux-amd64-cpu16
28 linux-arm64-cpu16
25 linux-amd64-gpu-v100-latest-1
15 windows-amd64-cpu16

@miscco miscco merged commit 78922b5 into NVIDIA:main Nov 7, 2024
408 of 424 checks passed
fbusato pushed a commit to fbusato/cccl that referenced this pull request Nov 9, 2024
This implements some improvements to the buffers.

* We want to be able to convert them if the properties match.
* Add some improvements regarding symbol visibility
* Add a way to grow a vector
* Ensure we can simply swap them
fbusato pushed a commit to fbusato/cccl that referenced this pull request Nov 9, 2024
This implements some improvements to the buffers.

* We want to be able to convert them if the properties match.
* Add some improvements regarding symbol visibility
* Add a way to grow a vector
* Ensure we can simply swap them
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CUDA Next Feature intended for the Cuda Next experimental library feature request New feature or request.
Projects
Status: Done
Development

Successfully merging this pull request may close these issues.

2 participants