Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integrate CUDASTF -> CudaX #2572

Merged
merged 364 commits into from
Oct 24, 2024
Merged

Integrate CUDASTF -> CudaX #2572

merged 364 commits into from
Oct 24, 2024

Conversation

caugonnet
Copy link
Contributor

@caugonnet caugonnet commented Oct 13, 2024

Description

closes

Checklist

  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.

Copy link

copy-pr-bot bot commented Oct 13, 2024

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@caugonnet
Copy link
Contributor Author

/ok to test

8 similar comments
@caugonnet
Copy link
Contributor Author

/ok to test

@caugonnet
Copy link
Contributor Author

/ok to test

@caugonnet
Copy link
Contributor Author

/ok to test

@caugonnet
Copy link
Contributor Author

/ok to test

@caugonnet
Copy link
Contributor Author

/ok to test

@caugonnet
Copy link
Contributor Author

/ok to test

@caugonnet
Copy link
Contributor Author

/ok to test

@caugonnet
Copy link
Contributor Author

/ok to test


#pragma once

#include <cuda/experimental/__stf/allocators/block_allocator.cuh>
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We try and include everything we need and avoid transitive includes. That makes it much simpler to work with a large project later

Copy link
Contributor Author

@caugonnet caugonnet Oct 16, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

When you say transitive includes; that's only for these few headers ?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I mean that every header should include everything it needs

cudax/include/cuda/experimental/__stf/utility/traits.cuh Outdated Show resolved Hide resolved
@caugonnet
Copy link
Contributor Author

/ok to test

16 similar comments
@caugonnet
Copy link
Contributor Author

/ok to test

@caugonnet
Copy link
Contributor Author

/ok to test

@caugonnet
Copy link
Contributor Author

/ok to test

@miscco
Copy link
Collaborator

miscco commented Oct 15, 2024

/ok to test

@miscco
Copy link
Collaborator

miscco commented Oct 15, 2024

/ok to test

@caugonnet
Copy link
Contributor Author

/ok to test

@caugonnet
Copy link
Contributor Author

/ok to test

@caugonnet
Copy link
Contributor Author

/ok to test

@caugonnet
Copy link
Contributor Author

/ok to test

@caugonnet
Copy link
Contributor Author

/ok to test

@caugonnet
Copy link
Contributor Author

/ok to test

@caugonnet
Copy link
Contributor Author

/ok to test

@caugonnet
Copy link
Contributor Author

/ok to test

@miscco
Copy link
Collaborator

miscco commented Oct 15, 2024

/ok to test

@miscco
Copy link
Collaborator

miscco commented Oct 15, 2024

/ok to test

@miscco
Copy link
Collaborator

miscco commented Oct 15, 2024

/ok to test

Copy link
Contributor

🟨 CI finished in 32m 19s: Pass: 73%/52 | Total: 10h 33m | Avg: 12m 11s | Max: 19m 25s
  • 🟨 cudax: Pass: 73%/52 | Total: 10h 33m | Avg: 12m 11s | Max: 19m 25s

    🔍 cpu: amd64 🔍
      🔍 amd64              Pass:  70%/48  | Total:  9h 44m | Avg: 12m 10s | Max: 19m 25s
      🟩 arm64              Pass: 100%/4   | Total: 49m 39s | Avg: 12m 24s | Max: 13m 39s
    🟨 cxx
      🟥 Clang9             Pass:   0%/2   | Total:  4m 38s | Avg:  2m 19s | Max:  2m 19s
      🟥 Clang10            Pass:   0%/2   | Total:  4m 45s | Avg:  2m 22s | Max:  2m 26s
      🟨 Clang11            Pass:  50%/4   | Total: 34m 50s | Avg:  8m 42s | Max: 15m 50s
      🟩 Clang12            Pass: 100%/4   | Total: 51m 24s | Avg: 12m 51s | Max: 13m 11s
      🟩 Clang13            Pass: 100%/4   | Total: 50m 53s | Avg: 12m 43s | Max: 13m 23s
      🟨 Clang14            Pass:  75%/4   | Total: 52m 39s | Avg: 13m 09s | Max: 15m 04s
      🟩 Clang15            Pass: 100%/2   | Total: 28m 17s | Avg: 14m 08s | Max: 15m 29s
      🟩 Clang16            Pass: 100%/4   | Total: 52m 52s | Avg: 13m 13s | Max: 14m 47s
      🟩 Clang17            Pass: 100%/2   | Total: 29m 24s | Avg: 14m 42s | Max: 15m 13s
      🟨 Clang18            Pass:  50%/2   | Total: 29m 11s | Avg: 14m 35s | Max: 15m 09s
      🟩 GCC9               Pass: 100%/2   | Total: 26m 26s | Avg: 13m 13s | Max: 13m 34s
      🟩 GCC10              Pass: 100%/4   | Total: 52m 11s | Avg: 13m 02s | Max: 14m 34s
      🟨 GCC11              Pass:  75%/4   | Total: 59m 58s | Avg: 14m 59s | Max: 19m 25s
      🟨 GCC12              Pass:  57%/7   | Total:  1h 37m | Avg: 13m 57s | Max: 16m 25s
      🟩 GCC13              Pass: 100%/3   | Total: 35m 40s | Avg: 11m 53s | Max: 13m 39s
      🟥 MSVC14.36          Pass:   0%/1   | Total: 11m 15s | Avg: 11m 15s | Max: 11m 15s
      🟥 MSVC14.39          Pass:   0%/1   | Total: 11m 37s | Avg: 11m 37s | Max: 11m 37s
    🟨 cudacxx_family
      🟨 nvcc               Pass:  73%/52  | Total: 10h 33m | Avg: 12m 11s | Max: 19m 25s
    🟨 gpu
      🟨 v100               Pass:  73%/52  | Total: 10h 33m | Avg: 12m 11s | Max: 19m 25s
    🟨 ctk
      🟨 12.0               Pass:  63%/19  | Total:  3h 39m | Avg: 11m 32s | Max: 17m 18s
      🟨 12.6               Pass:  78%/33  | Total:  6h 54m | Avg: 12m 33s | Max: 19m 25s
    🟨 cudacxx
      🟨 nvcc12.0           Pass:  63%/19  | Total:  3h 39m | Avg: 11m 32s | Max: 17m 18s
      🟨 nvcc12.6           Pass:  78%/33  | Total:  6h 54m | Avg: 12m 33s | Max: 19m 25s
    🟨 cxx_family
      🟨 Clang              Pass:  73%/30  | Total:  5h 38m | Avg: 11m 17s | Max: 15m 50s
      🟨 GCC                Pass:  80%/20  | Total:  4h 31m | Avg: 13m 35s | Max: 19m 25s
      🟥 MSVC               Pass:   0%/2   | Total: 22m 52s | Avg: 11m 26s | Max: 11m 37s
    🟨 jobs
      🟨 Build              Pass:  80%/47  | Total:  9h 18m | Avg: 11m 52s | Max: 19m 25s
      🟥 Test               Pass:   0%/5   | Total:  1h 15m | Avg: 15m 03s | Max: 15m 23s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total: 10m 05s | Avg: 10m 05s | Max: 10m 05s
      🟩 90a                Pass: 100%/1   | Total: 10m 24s | Avg: 10m 24s | Max: 10m 24s
    🟨 std
      🟨 17                 Pass:  71%/28  | Total:  4h 57m | Avg: 10m 36s | Max: 17m 18s
      🟨 20                 Pass:  75%/24  | Total:  5h 36m | Avg: 14m 01s | Max: 19m 25s
    

🛠️ Override Matrix

[
  {
    "jobs": [
      "build"
    ],
    "project": "cudax",
    "ctk": [
      "12.0",
      "curr"
    ],
    "std": "all",
    "cxx": [
      "gcc9",
      "gcc10",
      "gcc11"
    ]
  },
  {
    "jobs": [
      "build"
    ],
    "project": "cudax",
    "ctk": [
      "12.0",
      "curr"
    ],
    "std": "all",
    "cxx": [
      "clang9",
      "clang10",
      "clang11",
      "clang12",
      "clang13"
    ]
  },
  {
    "jobs": [
      "build"
    ],
    "project": "cudax",
    "ctk": [
      "curr"
    ],
    "std": "all",
    "cxx": [
      "clang14",
      "clang15",
      "clang16",
      "clang17"
    ]
  },
  {
    "jobs": [
      "build"
    ],
    "project": "cudax",
    "ctk": [
      "12.0"
    ],
    "std": 20,
    "cxx": [
      "msvc14.36"
    ]
  },
  {
    "jobs": [
      "build"
    ],
    "project": "cudax",
    "ctk": [
      "curr"
    ],
    "std": 20,
    "cxx": [
      "msvc2022"
    ]
  },
  {
    "jobs": [
      "build"
    ],
    "project": "cudax",
    "ctk": [
      "12.0"
    ],
    "std": 17,
    "cxx": [
      "gcc12"
    ],
    "sm": "90"
  },
  {
    "jobs": [
      "build"
    ],
    "project": "cudax",
    "ctk": [
      "curr"
    ],
    "std": 17,
    "cxx": [
      "gcc13"
    ],
    "sm": "90a"
  },
  {
    "jobs": [
      "build"
    ],
    "project": "cudax",
    "ctk": [
      "curr"
    ],
    "std": "all",
    "cxx": [
      "gcc13",
      "clang16"
    ],
    "cpu": "arm64"
  },
  {
    "jobs": [
      "test"
    ],
    "project": "cudax",
    "ctk": [
      "12.0"
    ],
    "std": "min",
    "cxx": [
      "gcc12"
    ]
  },
  {
    "jobs": [
      "test"
    ],
    "project": "cudax",
    "ctk": [
      "curr"
    ],
    "std": "all",
    "cxx": [
      "gcc12"
    ]
  },
  {
    "jobs": [
      "test"
    ],
    "project": "cudax",
    "ctk": [
      "12.0"
    ],
    "std": "max",
    "cxx": [
      "clang14"
    ]
  },
  {
    "jobs": [
      "test"
    ],
    "project": "cudax",
    "ctk": [
      "curr"
    ],
    "std": "max",
    "cxx": [
      "clang18"
    ]
  }
]

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
pycuda
CCCL C Parallel Library

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CCCL C Parallel Library

🏃‍ Runner counts (total jobs: 52)

# Runner
41 linux-amd64-cpu16
5 linux-amd64-gpu-v100-latest-1
4 linux-arm64-cpu16
2 windows-amd64-cpu16

@miscco
Copy link
Collaborator

miscco commented Oct 15, 2024

/ok to test

@miscco
Copy link
Collaborator

miscco commented Oct 23, 2024

pre-commit.ci autofix

@miscco
Copy link
Collaborator

miscco commented Oct 23, 2024

/ok to test

ci/matrix.yaml Outdated Show resolved Hide resolved
@miscco
Copy link
Collaborator

miscco commented Oct 23, 2024

/ok to test

Copy link
Contributor

🟩 CI finished in 1h 10m: Pass: 100%/372 | Total: 1d 16h | Avg: 6m 31s | Max: 39m 38s | Hits: 94%/27969
  • 🟩 cub: Pass: 100%/104 | Total: 11h 48m | Avg: 6m 48s | Max: 37m 21s | Hits: 99%/2916

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 11h 12m | Avg:  7m 00s | Max: 37m 21s | Hits:  99%/2916  
      🟩 arm64              Pass: 100%/8   | Total: 35m 50s | Avg:  4m 28s | Max:  4m 49s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 11m | Avg:  4m 47s | Max: 15m 51s | Hits:  99%/729   
      🟩 11.8               Pass: 100%/3   | Total: 15m 26s | Avg:  5m 08s | Max:  5m 24s
      🟩 12.6               Pass: 100%/86  | Total: 10h 20m | Avg:  7m 13s | Max: 37m 21s | Hits:  99%/2187  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  7m 57s | Avg:  3m 58s | Max:  4m 09s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 11m | Avg:  4m 47s | Max: 15m 51s | Hits:  99%/729   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 15m 26s | Avg:  5m 08s | Max:  5m 24s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 10h 12m | Avg:  7m 17s | Max: 37m 21s | Hits:  99%/2187  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  7m 57s | Avg:  3m 58s | Max:  4m 09s
      🟩 nvcc               Pass: 100%/102 | Total: 11h 40m | Avg:  6m 51s | Max: 37m 21s | Hits:  99%/2916  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 29m 55s | Avg:  4m 59s | Max:  6m 18s
      🟩 Clang10            Pass: 100%/3   | Total: 17m 54s | Avg:  5m 58s | Max:  6m 06s
      🟩 Clang11            Pass: 100%/4   | Total: 19m 30s | Avg:  4m 52s | Max:  5m 13s
      🟩 Clang12            Pass: 100%/4   | Total: 20m 27s | Avg:  5m 06s | Max:  5m 09s
      🟩 Clang13            Pass: 100%/4   | Total: 19m 21s | Avg:  4m 50s | Max:  5m 06s
      🟩 Clang14            Pass: 100%/4   | Total: 19m 46s | Avg:  4m 56s | Max:  5m 01s
      🟩 Clang15            Pass: 100%/4   | Total: 20m 33s | Avg:  5m 08s | Max:  5m 32s
      🟩 Clang16            Pass: 100%/4   | Total: 20m 29s | Avg:  5m 07s | Max:  5m 27s
      🟩 Clang17            Pass: 100%/4   | Total: 20m 46s | Avg:  5m 11s | Max:  5m 35s
      🟩 Clang18            Pass: 100%/9   | Total:  1h 21m | Avg:  9m 00s | Max: 29m 01s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 58s | Avg:  3m 59s | Max:  3m 59s
      🟩 GCC7               Pass: 100%/6   | Total: 23m 59s | Avg:  3m 59s | Max:  4m 23s
      🟩 GCC8               Pass: 100%/6   | Total: 25m 19s | Avg:  4m 13s | Max:  4m 33s
      🟩 GCC9               Pass: 100%/6   | Total: 28m 08s | Avg:  4m 41s | Max:  5m 24s
      🟩 GCC10              Pass: 100%/4   | Total: 19m 24s | Avg:  4m 51s | Max:  5m 11s
      🟩 GCC11              Pass: 100%/7   | Total: 34m 59s | Avg:  4m 59s | Max:  5m 24s
      🟩 GCC12              Pass: 100%/4   | Total: 19m 44s | Avg:  4m 56s | Max:  5m 20s
      🟩 GCC13              Pass: 100%/16  | Total:  3h 22m | Avg: 12m 41s | Max: 37m 21s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 16m 33s | Avg:  5m 31s | Max:  5m 54s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 51s | Avg: 15m 51s | Max: 15m 51s | Hits:  99%/729   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 27m 33s | Avg: 13m 46s | Max: 14m 07s | Hits:  99%/1458  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 16m 03s | Avg: 16m 03s | Max: 16m 03s | Hits:  99%/729   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  4h 29m | Avg:  5m 51s | Max: 29m 01s
      🟩 GCC                Pass: 100%/51  | Total:  6h 02m | Avg:  7m 06s | Max: 37m 21s
      🟩 Intel              Pass: 100%/3   | Total: 16m 33s | Avg:  5m 31s | Max:  5m 54s
      🟩 MSVC               Pass: 100%/4   | Total: 59m 27s | Avg: 14m 51s | Max: 16m 03s | Hits:  99%/2916  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 11h 48m | Avg:  6m 48s | Max: 37m 21s | Hits:  99%/2916  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  8h 20m | Avg:  5m 12s | Max: 16m 03s | Hits:  99%/2916  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 20m 12s | Avg: 20m 12s | Max: 20m 12s
      🟩 GraphCapture       Pass: 100%/1   | Total: 27m 38s | Avg: 27m 38s | Max: 27m 38s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 10m | Avg: 23m 20s | Max: 30m 06s
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 30m | Avg: 30m 04s | Max: 37m 21s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 15m 26s | Avg:  5m 08s | Max:  5m 24s
      🟩 90a                Pass: 100%/4   | Total: 17m 02s | Avg:  4m 15s | Max:  4m 45s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 43m | Avg:  5m 50s | Max: 23m 52s
      🟩 14                 Pass: 100%/27  | Total:  2h 29m | Avg:  5m 31s | Max: 15m 51s | Hits:  99%/1458  
      🟩 17                 Pass: 100%/26  | Total:  2h 15m | Avg:  5m 12s | Max: 13m 26s | Hits:  99%/729   
      🟩 20                 Pass: 100%/23  | Total:  4h 20m | Avg: 11m 19s | Max: 37m 21s | Hits:  99%/729   
    
  • 🟩 libcudacxx: Pass: 100%/104 | Total: 11h 56m | Avg: 6m 53s | Max: 39m 38s | Hits: 87%/11736

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 11h 27m | Avg:  7m 09s | Max: 39m 38s | Hits:  87%/11736 
      🟩 arm64              Pass: 100%/8   | Total: 29m 37s | Avg:  3m 42s | Max:  4m 18s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 23m | Avg:  5m 34s | Max: 37m 48s | Hits:  48%/2737  
      🟩 11.8               Pass: 100%/3   | Total: 38m 35s | Avg: 12m 51s | Max: 30m 54s
      🟩 12.6               Pass: 100%/86  | Total:  9h 54m | Avg:  6m 54s | Max: 39m 38s | Hits:  99%/8999  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 43m 11s | Avg: 21m 35s | Max: 22m 20s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 23m | Avg:  5m 34s | Max: 37m 48s | Hits:  48%/2737  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 38m 35s | Avg: 12m 51s | Max: 30m 54s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  9h 11m | Avg:  6m 33s | Max: 39m 38s | Hits:  99%/8999  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 43m 11s | Avg: 21m 35s | Max: 22m 20s
      🟩 nvcc               Pass: 100%/102 | Total: 11h 13m | Avg:  6m 36s | Max: 39m 38s | Hits:  87%/11736 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 27m 20s | Avg:  4m 33s | Max:  6m 06s
      🟩 Clang10            Pass: 100%/3   | Total: 17m 48s | Avg:  5m 56s | Max:  6m 22s
      🟩 Clang11            Pass: 100%/4   | Total: 18m 14s | Avg:  4m 33s | Max:  4m 58s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 19s | Avg:  4m 34s | Max:  4m 50s
      🟩 Clang13            Pass: 100%/4   | Total: 19m 47s | Avg:  4m 56s | Max:  5m 17s
      🟩 Clang14            Pass: 100%/4   | Total: 19m 13s | Avg:  4m 48s | Max:  5m 41s
      🟩 Clang15            Pass: 100%/4   | Total: 18m 54s | Avg:  4m 43s | Max:  5m 12s
      🟩 Clang16            Pass: 100%/4   | Total: 18m 40s | Avg:  4m 40s | Max:  4m 53s
      🟩 Clang17            Pass: 100%/4   | Total: 18m 49s | Avg:  4m 42s | Max:  5m 02s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 21m | Avg: 10m 07s | Max: 22m 20s
      🟩 GCC6               Pass: 100%/2   | Total:  6m 14s | Avg:  3m 07s | Max:  3m 09s
      🟩 GCC7               Pass: 100%/6   | Total: 19m 52s | Avg:  3m 18s | Max:  3m 47s
      🟩 GCC8               Pass: 100%/6   | Total: 22m 24s | Avg:  3m 44s | Max:  4m 27s
      🟩 GCC9               Pass: 100%/6   | Total: 20m 53s | Avg:  3m 28s | Max:  4m 12s
      🟩 GCC10              Pass: 100%/4   | Total: 15m 50s | Avg:  3m 57s | Max:  4m 17s
      🟩 GCC11              Pass: 100%/7   | Total: 53m 52s | Avg:  7m 41s | Max: 30m 54s
      🟩 GCC12              Pass: 100%/4   | Total: 16m 40s | Avg:  4m 10s | Max:  4m 40s
      🟩 GCC13              Pass: 100%/17  | Total:  3h 18m | Avg: 11m 40s | Max: 39m 38s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 18m 47s | Avg:  6m 15s | Max:  7m 09s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 37m 48s | Avg: 37m 48s | Max: 37m 48s | Hits:  48%/2737  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 31m 50s | Avg: 15m 55s | Max: 16m 44s | Hits:  99%/5835  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 16m 08s | Avg: 16m 08s | Max: 16m 08s | Hits:  99%/3164  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/45  | Total:  4h 18m | Avg:  5m 44s | Max: 22m 20s
      🟩 GCC                Pass: 100%/52  | Total:  5h 54m | Avg:  6m 48s | Max: 39m 38s
      🟩 Intel              Pass: 100%/3   | Total: 18m 47s | Avg:  6m 15s | Max:  7m 09s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 25m | Avg: 21m 26s | Max: 37m 48s | Hits:  87%/11736 
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 11h 56m | Avg:  6m 53s | Max: 39m 38s | Hits:  87%/11736 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  8h 58m | Avg:  5m 36s | Max: 37m 48s | Hits:  87%/11736 
      🟩 NVRTC              Pass: 100%/4   | Total:  2h 05m | Avg: 31m 28s | Max: 39m 38s
      🟩 Test               Pass: 100%/3   | Total: 50m 26s | Avg: 16m 48s | Max: 21m 11s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  1m 59s | Avg:  1m 59s | Max:  1m 59s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 38m 35s | Avg: 12m 51s | Max: 30m 54s
      🟩 90a                Pass: 100%/4   | Total: 15m 31s | Avg:  3m 52s | Max:  4m 16s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 16m | Avg:  4m 51s | Max: 24m 30s
      🟩 14                 Pass: 100%/28  | Total:  3h 02m | Avg:  6m 30s | Max: 37m 48s | Hits:  74%/5576  
      🟩 17                 Pass: 100%/27  | Total:  3h 32m | Avg:  7m 52s | Max: 39m 38s | Hits:  99%/2996  
      🟩 20                 Pass: 100%/20  | Total:  3h 03m | Avg:  9m 11s | Max: 39m 07s | Hits:  99%/3164  
    
  • 🟩 thrust: Pass: 100%/103 | Total: 11h 17m | Avg: 6m 34s | Max: 27m 00s | Hits: 99%/13095

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total: 10h 39m | Avg:  6m 43s | Max: 27m 00s | Hits:  99%/13095 
      🟩 arm64              Pass: 100%/8   | Total: 38m 02s | Avg:  4m 45s | Max:  5m 11s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 25m | Avg:  5m 42s | Max: 22m 48s | Hits:  99%/2619  
      🟩 11.8               Pass: 100%/3   | Total: 16m 08s | Avg:  5m 22s | Max:  5m 39s
      🟩 12.6               Pass: 100%/85  | Total:  9h 35m | Avg:  6m 46s | Max: 27m 00s | Hits:  99%/10476 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 11m 00s | Avg:  5m 30s | Max:  5m 33s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 25m | Avg:  5m 42s | Max: 22m 48s | Hits:  99%/2619  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 16m 08s | Avg:  5m 22s | Max:  5m 39s
      🟩 nvcc12.6           Pass: 100%/83  | Total:  9h 24m | Avg:  6m 47s | Max: 27m 00s | Hits:  99%/10476 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 11m 00s | Avg:  5m 30s | Max:  5m 33s
      🟩 nvcc               Pass: 100%/101 | Total: 11h 06m | Avg:  6m 35s | Max: 27m 00s | Hits:  99%/13095 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 32m 22s | Avg:  5m 23s | Max:  6m 31s
      🟩 Clang10            Pass: 100%/3   | Total: 20m 39s | Avg:  6m 53s | Max:  7m 38s
      🟩 Clang11            Pass: 100%/4   | Total: 21m 41s | Avg:  5m 25s | Max:  5m 34s
      🟩 Clang12            Pass: 100%/4   | Total: 22m 10s | Avg:  5m 32s | Max:  5m 53s
      🟩 Clang13            Pass: 100%/4   | Total: 22m 42s | Avg:  5m 40s | Max:  6m 35s
      🟩 Clang14            Pass: 100%/4   | Total: 22m 14s | Avg:  5m 33s | Max:  6m 08s
      🟩 Clang15            Pass: 100%/4   | Total: 21m 15s | Avg:  5m 18s | Max:  5m 40s
      🟩 Clang16            Pass: 100%/4   | Total: 21m 03s | Avg:  5m 15s | Max:  5m 48s
      🟩 Clang17            Pass: 100%/4   | Total: 21m 16s | Avg:  5m 19s | Max:  5m 46s
      🟩 Clang18            Pass: 100%/9   | Total: 56m 07s | Avg:  6m 14s | Max: 14m 05s
      🟩 GCC6               Pass: 100%/2   | Total:  8m 50s | Avg:  4m 25s | Max:  4m 45s
      🟩 GCC7               Pass: 100%/6   | Total: 28m 20s | Avg:  4m 43s | Max:  5m 06s
      🟩 GCC8               Pass: 100%/6   | Total: 29m 32s | Avg:  4m 55s | Max:  5m 52s
      🟩 GCC9               Pass: 100%/6   | Total: 30m 30s | Avg:  5m 05s | Max:  6m 12s
      🟩 GCC10              Pass: 100%/4   | Total: 22m 07s | Avg:  5m 31s | Max:  6m 21s
      🟩 GCC11              Pass: 100%/7   | Total: 39m 05s | Avg:  5m 35s | Max:  5m 52s
      🟩 GCC12              Pass: 100%/4   | Total: 22m 38s | Avg:  5m 39s | Max:  6m 10s
      🟩 GCC13              Pass: 100%/14  | Total:  1h 42m | Avg:  7m 18s | Max: 22m 58s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 19m 58s | Avg:  6m 39s | Max:  7m 00s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 22m 48s | Avg: 22m 48s | Max: 22m 48s | Hits:  99%/2619  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 41m 11s | Avg: 20m 35s | Max: 21m 19s | Hits:  99%/5238  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 48m 10s | Avg: 24m 05s | Max: 27m 00s | Hits:  99%/5238  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  4h 21m | Avg:  5m 41s | Max: 14m 05s
      🟩 GCC                Pass: 100%/49  | Total:  4h 43m | Avg:  5m 47s | Max: 22m 58s
      🟩 Intel              Pass: 100%/3   | Total: 19m 58s | Avg:  6m 39s | Max:  7m 00s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 52m | Avg: 22m 25s | Max: 27m 00s | Hits:  99%/13095 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total: 11h 17m | Avg:  6m 34s | Max: 27m 00s | Hits:  99%/13095 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  9h 35m | Avg:  5m 59s | Max: 22m 48s | Hits:  99%/10476 
      🟩 TestCPU            Pass: 100%/4   | Total: 51m 18s | Avg: 12m 49s | Max: 27m 00s | Hits:  99%/2619  
      🟩 TestGPU            Pass: 100%/3   | Total: 50m 13s | Avg: 16m 44s | Max: 22m 58s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 16m 08s | Avg:  5m 22s | Max:  5m 39s
      🟩 90a                Pass: 100%/4   | Total: 18m 10s | Avg:  4m 32s | Max:  4m 42s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 41m | Avg:  5m 46s | Max: 22m 58s
      🟩 14                 Pass: 100%/27  | Total:  2h 53m | Avg:  6m 26s | Max: 22m 48s | Hits:  99%/5238  
      🟩 17                 Pass: 100%/26  | Total:  2h 42m | Avg:  6m 14s | Max: 21m 19s | Hits:  99%/2619  
      🟩 20                 Pass: 100%/22  | Total:  2h 59m | Avg:  8m 08s | Max: 27m 00s | Hits:  99%/5238  
    
  • 🟩 cudax: Pass: 100%/52 | Total: 4h 26m | Avg: 5m 07s | Max: 29m 02s | Hits: 89%/222

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total:  4h 15m | Avg:  5m 19s | Max: 29m 02s | Hits:  89%/222   
      🟩 arm64              Pass: 100%/4   | Total: 10m 27s | Avg:  2m 36s | Max:  2m 41s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  1h 33m | Avg:  4m 54s | Max: 19m 02s | Hits:  89%/111   
      🟩 12.6               Pass: 100%/33  | Total:  2h 53m | Avg:  5m 14s | Max: 29m 02s | Hits:  89%/111   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  1h 33m | Avg:  4m 54s | Max: 19m 02s | Hits:  89%/111   
      🟩 nvcc12.6           Pass: 100%/33  | Total:  2h 53m | Avg:  5m 14s | Max: 29m 02s | Hits:  89%/111   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/52  | Total:  4h 26m | Avg:  5m 07s | Max: 29m 02s | Hits:  89%/222   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total:  6m 36s | Avg:  3m 18s | Max:  3m 49s
      🟩 Clang10            Pass: 100%/2   | Total:  6m 42s | Avg:  3m 21s | Max:  3m 34s
      🟩 Clang11            Pass: 100%/4   | Total: 12m 26s | Avg:  3m 06s | Max:  3m 36s
      🟩 Clang12            Pass: 100%/4   | Total: 12m 38s | Avg:  3m 09s | Max:  3m 44s
      🟩 Clang13            Pass: 100%/4   | Total: 12m 04s | Avg:  3m 01s | Max:  3m 12s
      🟩 Clang14            Pass: 100%/4   | Total: 28m 19s | Avg:  7m 04s | Max: 19m 02s
      🟩 Clang15            Pass: 100%/2   | Total:  6m 42s | Avg:  3m 21s | Max:  3m 23s
      🟩 Clang16            Pass: 100%/4   | Total: 11m 50s | Avg:  2m 57s | Max:  3m 29s
      🟩 Clang17            Pass: 100%/2   | Total:  6m 27s | Avg:  3m 13s | Max:  3m 22s
      🟩 Clang18            Pass: 100%/2   | Total: 32m 16s | Avg: 16m 08s | Max: 29m 02s
      🟩 GCC9               Pass: 100%/2   | Total:  6m 36s | Avg:  3m 18s | Max:  3m 25s
      🟩 GCC10              Pass: 100%/4   | Total: 11m 57s | Avg:  2m 59s | Max:  3m 15s
      🟩 GCC11              Pass: 100%/4   | Total: 12m 05s | Avg:  3m 01s | Max:  3m 14s
      🟩 GCC12              Pass: 100%/7   | Total:  1h 09m | Avg:  9m 57s | Max: 21m 08s
      🟩 GCC13              Pass: 100%/3   | Total:  8m 36s | Avg:  2m 52s | Max:  3m 21s
      🟩 MSVC14.36          Pass: 100%/1   | Total: 10m 08s | Avg: 10m 08s | Max: 10m 08s | Hits:  89%/111   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 13s | Avg: 11m 13s | Max: 11m 13s | Hits:  89%/111   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  2h 16m | Avg:  4m 32s | Max: 29m 02s
      🟩 GCC                Pass: 100%/20  | Total:  1h 48m | Avg:  5m 26s | Max: 21m 08s
      🟩 MSVC               Pass: 100%/2   | Total: 21m 21s | Avg: 10m 40s | Max: 11m 13s | Hits:  89%/222   
    🟩 gpu
      🟩 v100               Pass: 100%/52  | Total:  4h 26m | Avg:  5m 07s | Max: 29m 02s | Hits:  89%/222   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total:  2h 40m | Avg:  3m 24s | Max: 11m 13s | Hits:  89%/222   
      🟩 Test               Pass: 100%/5   | Total:  1h 45m | Avg: 21m 10s | Max: 29m 02s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  2m 42s | Avg:  2m 42s | Max:  2m 42s
      🟩 90a                Pass: 100%/1   | Total:  3m 21s | Avg:  3m 21s | Max:  3m 21s
    🟩 std
      🟩 17                 Pass: 100%/28  | Total:  1h 58m | Avg:  4m 13s | Max: 18m 47s
      🟩 20                 Pass: 100%/24  | Total:  2h 27m | Avg:  6m 09s | Max: 29m 02s | Hits:  89%/222   
    
  • 🟩 cccl: Pass: 100%/6 | Total: 27m 43s | Avg: 4m 37s | Max: 5m 22s

    🟩 cpu
      🟩 amd64              Pass: 100%/6   | Total: 27m 43s | Avg:  4m 37s | Max:  5m 22s
    🟩 ctk
      🟩 11.1               Pass: 100%/2   | Total:  8m 26s | Avg:  4m 13s | Max:  4m 25s
      🟩 12.0               Pass: 100%/2   | Total:  9m 16s | Avg:  4m 38s | Max:  4m 55s
      🟩 12.6               Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  5m 22s
    🟩 cudacxx
      🟩 nvcc11.1           Pass: 100%/2   | Total:  8m 26s | Avg:  4m 13s | Max:  4m 25s
      🟩 nvcc12.0           Pass: 100%/2   | Total:  9m 16s | Avg:  4m 38s | Max:  4m 55s
      🟩 nvcc12.6           Pass: 100%/2   | Total: 10m 01s | Avg:  5m 00s | Max:  5m 22s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/6   | Total: 27m 43s | Avg:  4m 37s | Max:  5m 22s
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  4m 01s | Avg:  4m 01s | Max:  4m 01s
      🟩 Clang14            Pass: 100%/1   | Total:  4m 55s | Avg:  4m 55s | Max:  4m 55s
      🟩 Clang18            Pass: 100%/1   | Total:  5m 22s | Avg:  5m 22s | Max:  5m 22s
      🟩 GCC6               Pass: 100%/1   | Total:  4m 25s | Avg:  4m 25s | Max:  4m 25s
      🟩 GCC12              Pass: 100%/1   | Total:  4m 21s | Avg:  4m 21s | Max:  4m 21s
      🟩 GCC13              Pass: 100%/1   | Total:  4m 39s | Avg:  4m 39s | Max:  4m 39s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/3   | Total: 14m 18s | Avg:  4m 46s | Max:  5m 22s
      🟩 GCC                Pass: 100%/3   | Total: 13m 25s | Avg:  4m 28s | Max:  4m 39s
    🟩 gpu
      🟩 v100               Pass: 100%/6   | Total: 27m 43s | Avg:  4m 37s | Max:  5m 22s
    🟩 jobs
      🟩 Infra              Pass: 100%/6   | Total: 27m 43s | Avg:  4m 37s | Max:  5m 22s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 10m 34s | Avg: 5m 17s | Max: 8m 10s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 10m 34s | Avg:  5m 17s | Max:  8m 10s
    🟩 ctk
      🟩 12.5               Pass: 100%/2   | Total: 10m 34s | Avg:  5m 17s | Max:  8m 10s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/2   | Total: 10m 34s | Avg:  5m 17s | Max:  8m 10s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 10m 34s | Avg:  5m 17s | Max:  8m 10s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 10m 34s | Avg:  5m 17s | Max:  8m 10s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 10m 34s | Avg:  5m 17s | Max:  8m 10s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 10m 34s | Avg:  5m 17s | Max:  8m 10s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 24s | Avg:  2m 24s | Max:  2m 24s
      🟩 Test               Pass: 100%/1   | Total:  8m 10s | Avg:  8m 10s | Max:  8m 10s
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 17m 43s | Avg: 17m 43s | Max: 17m 43s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 17m 43s | Avg: 17m 43s | Max: 17m 43s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 17m 43s | Avg: 17m 43s | Max: 17m 43s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 17m 43s | Avg: 17m 43s | Max: 17m 43s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 17m 43s | Avg: 17m 43s | Max: 17m 43s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 17m 43s | Avg: 17m 43s | Max: 17m 43s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 17m 43s | Avg: 17m 43s | Max: 17m 43s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 17m 43s | Avg: 17m 43s | Max: 17m 43s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 17m 43s | Avg: 17m 43s | Max: 17m 43s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
pycuda
CCCL C Parallel Library

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CCCL C Parallel Library

🏃‍ Runner counts (total jobs: 372)

# Runner
298 linux-amd64-cpu16
31 linux-amd64-gpu-v100-latest-1
28 linux-arm64-cpu16
15 windows-amd64-cpu16

@miscco
Copy link
Collaborator

miscco commented Oct 24, 2024

pre-commit.ci autofix

@miscco
Copy link
Collaborator

miscco commented Oct 24, 2024

/ok to test

@miscco miscco enabled auto-merge (squash) October 24, 2024 06:35
@miscco miscco disabled auto-merge October 24, 2024 06:42
@miscco miscco enabled auto-merge (squash) October 24, 2024 06:45
Copy link
Contributor

🟩 CI finished in 1h 07m: Pass: 100%/372 | Total: 2d 02h | Avg: 8m 03s | Max: 59m 34s | Hits: 99%/27969
  • 🟩 cub: Pass: 100%/104 | Total: 12h 26m | Avg: 7m 10s | Max: 59m 34s | Hits: 99%/2916

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 11h 49m | Avg:  7m 23s | Max: 59m 34s | Hits:  99%/2916  
      🟩 arm64              Pass: 100%/8   | Total: 36m 40s | Avg:  4m 35s | Max:  6m 10s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 10m | Avg:  4m 43s | Max: 15m 15s | Hits:  99%/729   
      🟩 11.8               Pass: 100%/3   | Total: 15m 17s | Avg:  5m 05s | Max:  5m 16s
      🟩 12.6               Pass: 100%/86  | Total: 10h 59m | Avg:  7m 40s | Max: 59m 34s | Hits:  99%/2187  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total:  8m 09s | Avg:  4m 04s | Max:  4m 09s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 10m | Avg:  4m 43s | Max: 15m 15s | Hits:  99%/729   
      🟩 nvcc11.8           Pass: 100%/3   | Total: 15m 17s | Avg:  5m 05s | Max:  5m 16s
      🟩 nvcc12.6           Pass: 100%/84  | Total: 10h 51m | Avg:  7m 45s | Max: 59m 34s | Hits:  99%/2187  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total:  8m 09s | Avg:  4m 04s | Max:  4m 09s
      🟩 nvcc               Pass: 100%/102 | Total: 12h 18m | Avg:  7m 14s | Max: 59m 34s | Hits:  99%/2916  
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 29m 03s | Avg:  4m 50s | Max:  6m 12s
      🟩 Clang10            Pass: 100%/3   | Total: 16m 58s | Avg:  5m 39s | Max:  6m 06s
      🟩 Clang11            Pass: 100%/4   | Total: 19m 30s | Avg:  4m 52s | Max:  5m 02s
      🟩 Clang12            Pass: 100%/4   | Total: 19m 25s | Avg:  4m 51s | Max:  5m 10s
      🟩 Clang13            Pass: 100%/4   | Total: 19m 01s | Avg:  4m 45s | Max:  4m 50s
      🟩 Clang14            Pass: 100%/4   | Total: 18m 41s | Avg:  4m 40s | Max:  4m 46s
      🟩 Clang15            Pass: 100%/4   | Total: 19m 39s | Avg:  4m 54s | Max:  5m 11s
      🟩 Clang16            Pass: 100%/4   | Total: 21m 02s | Avg:  5m 15s | Max:  5m 30s
      🟩 Clang17            Pass: 100%/4   | Total: 20m 26s | Avg:  5m 06s | Max:  5m 29s
      🟩 Clang18            Pass: 100%/9   | Total:  1h 38m | Avg: 10m 57s | Max: 45m 34s
      🟩 GCC6               Pass: 100%/2   | Total:  7m 42s | Avg:  3m 51s | Max:  4m 05s
      🟩 GCC7               Pass: 100%/6   | Total: 25m 24s | Avg:  4m 14s | Max:  4m 47s
      🟩 GCC8               Pass: 100%/6   | Total: 26m 04s | Avg:  4m 20s | Max:  5m 01s
      🟩 GCC9               Pass: 100%/6   | Total: 26m 19s | Avg:  4m 23s | Max:  5m 04s
      🟩 GCC10              Pass: 100%/4   | Total: 18m 03s | Avg:  4m 30s | Max:  4m 38s
      🟩 GCC11              Pass: 100%/7   | Total: 34m 10s | Avg:  4m 52s | Max:  5m 16s
      🟩 GCC12              Pass: 100%/4   | Total: 20m 21s | Avg:  5m 05s | Max:  5m 37s
      🟩 GCC13              Pass: 100%/16  | Total:  3h 51m | Avg: 14m 27s | Max: 59m 34s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 18m 09s | Avg:  6m 03s | Max:  6m 13s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 15m 15s | Avg: 15m 15s | Max: 15m 15s | Hits:  99%/729   
      🟩 MSVC14.29          Pass: 100%/2   | Total: 26m 24s | Avg: 13m 12s | Max: 13m 20s | Hits:  99%/1458  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 14m 43s | Avg: 14m 43s | Max: 14m 43s | Hits:  99%/729   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  4h 42m | Avg:  6m 08s | Max: 45m 34s
      🟩 GCC                Pass: 100%/51  | Total:  6h 29m | Avg:  7m 38s | Max: 59m 34s
      🟩 Intel              Pass: 100%/3   | Total: 18m 09s | Avg:  6m 03s | Max:  6m 13s
      🟩 MSVC               Pass: 100%/4   | Total: 56m 22s | Avg: 14m 05s | Max: 15m 15s | Hits:  99%/2916  
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 12h 26m | Avg:  7m 10s | Max: 59m 34s | Hits:  99%/2916  
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  8h 12m | Avg:  5m 07s | Max: 15m 15s | Hits:  99%/2916  
      🟩 DeviceLaunch       Pass: 100%/1   | Total: 25m 45s | Avg: 25m 45s | Max: 25m 45s
      🟩 GraphCapture       Pass: 100%/1   | Total: 24m 18s | Avg: 24m 18s | Max: 24m 18s
      🟩 HostLaunch         Pass: 100%/3   | Total:  1h 12m | Avg: 24m 01s | Max: 27m 27s
      🟩 TestGPU            Pass: 100%/3   | Total:  2h 11m | Avg: 43m 59s | Max: 59m 34s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 15m 17s | Avg:  5m 05s | Max:  5m 16s
      🟩 90a                Pass: 100%/4   | Total: 15m 53s | Avg:  3m 58s | Max:  4m 05s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  3h 22m | Avg:  7m 13s | Max: 59m 34s
      🟩 14                 Pass: 100%/27  | Total:  2h 25m | Avg:  5m 22s | Max: 15m 15s | Hits:  99%/1458  
      🟩 17                 Pass: 100%/26  | Total:  2h 13m | Avg:  5m 07s | Max: 13m 20s | Hits:  99%/729   
      🟩 20                 Pass: 100%/23  | Total:  4h 25m | Avg: 11m 33s | Max: 45m 34s | Hits:  99%/729   
    
  • 🟩 libcudacxx: Pass: 100%/104 | Total: 11h 39m | Avg: 6m 43s | Max: 43m 39s | Hits: 99%/11736

    🟩 cpu
      🟩 amd64              Pass: 100%/96  | Total: 11h 09m | Avg:  6m 58s | Max: 43m 39s | Hits:  99%/11736 
      🟩 arm64              Pass: 100%/8   | Total: 30m 27s | Avg:  3m 48s | Max:  4m 44s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 07m | Avg:  4m 31s | Max: 22m 03s | Hits:  99%/2737  
      🟩 11.8               Pass: 100%/3   | Total: 10m 36s | Avg:  3m 32s | Max:  3m 50s
      🟩 12.6               Pass: 100%/86  | Total: 10h 21m | Avg:  7m 13s | Max: 43m 39s | Hits:  99%/8999  
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 39m 52s | Avg: 19m 56s | Max: 20m 07s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 07m | Avg:  4m 31s | Max: 22m 03s | Hits:  99%/2737  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 10m 36s | Avg:  3m 32s | Max:  3m 50s
      🟩 nvcc12.6           Pass: 100%/84  | Total:  9h 41m | Avg:  6m 55s | Max: 43m 39s | Hits:  99%/8999  
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 39m 52s | Avg: 19m 56s | Max: 20m 07s
      🟩 nvcc               Pass: 100%/102 | Total: 10h 59m | Avg:  6m 28s | Max: 43m 39s | Hits:  99%/11736 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 27m 40s | Avg:  4m 36s | Max:  6m 15s
      🟩 Clang10            Pass: 100%/3   | Total: 16m 55s | Avg:  5m 38s | Max:  6m 11s
      🟩 Clang11            Pass: 100%/4   | Total: 17m 35s | Avg:  4m 23s | Max:  4m 42s
      🟩 Clang12            Pass: 100%/4   | Total: 18m 24s | Avg:  4m 36s | Max:  4m 50s
      🟩 Clang13            Pass: 100%/4   | Total: 18m 04s | Avg:  4m 31s | Max:  5m 00s
      🟩 Clang14            Pass: 100%/4   | Total: 18m 30s | Avg:  4m 37s | Max:  4m 55s
      🟩 Clang15            Pass: 100%/4   | Total: 19m 08s | Avg:  4m 47s | Max:  5m 16s
      🟩 Clang16            Pass: 100%/4   | Total: 18m 20s | Avg:  4m 35s | Max:  4m 52s
      🟩 Clang17            Pass: 100%/4   | Total: 19m 37s | Avg:  4m 54s | Max:  5m 19s
      🟩 Clang18            Pass: 100%/8   | Total:  1h 26m | Avg: 10m 52s | Max: 25m 20s
      🟩 GCC6               Pass: 100%/2   | Total:  5m 50s | Avg:  2m 55s | Max:  3m 00s
      🟩 GCC7               Pass: 100%/6   | Total: 21m 44s | Avg:  3m 37s | Max:  4m 37s
      🟩 GCC8               Pass: 100%/6   | Total: 21m 25s | Avg:  3m 34s | Max:  4m 05s
      🟩 GCC9               Pass: 100%/6   | Total: 22m 22s | Avg:  3m 43s | Max:  4m 16s
      🟩 GCC10              Pass: 100%/4   | Total: 17m 06s | Avg:  4m 16s | Max:  4m 49s
      🟩 GCC11              Pass: 100%/7   | Total: 27m 39s | Avg:  3m 57s | Max:  4m 51s
      🟩 GCC12              Pass: 100%/4   | Total: 16m 56s | Avg:  4m 14s | Max:  4m 46s
      🟩 GCC13              Pass: 100%/17  | Total:  3h 40m | Avg: 12m 57s | Max: 43m 39s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 19m 05s | Avg:  6m 21s | Max:  6m 38s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 22m 03s | Avg: 22m 03s | Max: 22m 03s | Hits:  99%/2737  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 28m 19s | Avg: 14m 09s | Max: 14m 55s | Hits:  99%/5835  
      🟩 MSVC14.39          Pass: 100%/1   | Total: 15m 49s | Avg: 15m 49s | Max: 15m 49s | Hits:  99%/3164  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/45  | Total:  4h 21m | Avg:  5m 48s | Max: 25m 20s
      🟩 GCC                Pass: 100%/52  | Total:  5h 53m | Avg:  6m 47s | Max: 43m 39s
      🟩 Intel              Pass: 100%/3   | Total: 19m 05s | Avg:  6m 21s | Max:  6m 38s
      🟩 MSVC               Pass: 100%/4   | Total:  1h 06m | Avg: 16m 32s | Max: 22m 03s | Hits:  99%/11736 
    🟩 gpu
      🟩 v100               Pass: 100%/104 | Total: 11h 39m | Avg:  6m 43s | Max: 43m 39s | Hits:  99%/11736 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total:  8h 14m | Avg:  5m 08s | Max: 22m 03s | Hits:  99%/11736 
      🟩 NVRTC              Pass: 100%/4   | Total:  2h 06m | Avg: 31m 36s | Max: 43m 39s
      🟩 Test               Pass: 100%/3   | Total:  1h 16m | Avg: 25m 36s | Max: 30m 45s
      🟩 VerifyCodegen      Pass: 100%/1   | Total:  2m 29s | Avg:  2m 29s | Max:  2m 29s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 10m 36s | Avg:  3m 32s | Max:  3m 50s
      🟩 90a                Pass: 100%/4   | Total: 17m 33s | Avg:  4m 23s | Max:  4m 39s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 30m | Avg:  5m 21s | Max: 30m 45s
      🟩 14                 Pass: 100%/28  | Total:  3h 03m | Avg:  6m 33s | Max: 40m 27s | Hits:  99%/5576  
      🟩 17                 Pass: 100%/27  | Total:  2h 46m | Avg:  6m 09s | Max: 21m 13s | Hits:  99%/2996  
      🟩 20                 Pass: 100%/20  | Total:  3h 17m | Avg:  9m 51s | Max: 43m 39s | Hits:  99%/3164  
    
  • 🟩 thrust: Pass: 100%/103 | Total: 12h 29m | Avg: 7m 16s | Max: 43m 32s | Hits: 99%/13095

    🟩 cpu
      🟩 amd64              Pass: 100%/95  | Total: 11h 52m | Avg:  7m 29s | Max: 43m 32s | Hits:  99%/13095 
      🟩 arm64              Pass: 100%/8   | Total: 37m 13s | Avg:  4m 39s | Max:  5m 33s
    🟩 ctk
      🟩 11.1               Pass: 100%/15  | Total:  1h 58m | Avg:  7m 54s | Max: 35m 35s | Hits:  99%/2619  
      🟩 11.8               Pass: 100%/3   | Total: 16m 44s | Avg:  5m 34s | Max:  6m 07s
      🟩 12.6               Pass: 100%/85  | Total: 10h 14m | Avg:  7m 13s | Max: 43m 32s | Hits:  99%/10476 
    🟩 cudacxx
      🟩 ClangCUDA18        Pass: 100%/2   | Total: 10m 45s | Avg:  5m 22s | Max:  5m 33s
      🟩 nvcc11.1           Pass: 100%/15  | Total:  1h 58m | Avg:  7m 54s | Max: 35m 35s | Hits:  99%/2619  
      🟩 nvcc11.8           Pass: 100%/3   | Total: 16m 44s | Avg:  5m 34s | Max:  6m 07s
      🟩 nvcc12.6           Pass: 100%/83  | Total: 10h 03m | Avg:  7m 16s | Max: 43m 32s | Hits:  99%/10476 
    🟩 cudacxx_family
      🟩 ClangCUDA          Pass: 100%/2   | Total: 10m 45s | Avg:  5m 22s | Max:  5m 33s
      🟩 nvcc               Pass: 100%/101 | Total: 12h 18m | Avg:  7m 18s | Max: 43m 32s | Hits:  99%/13095 
    🟩 cxx
      🟩 Clang9             Pass: 100%/6   | Total: 35m 31s | Avg:  5m 55s | Max:  7m 20s
      🟩 Clang10            Pass: 100%/3   | Total: 19m 52s | Avg:  6m 37s | Max:  7m 11s
      🟩 Clang11            Pass: 100%/4   | Total: 22m 30s | Avg:  5m 37s | Max:  5m 45s
      🟩 Clang12            Pass: 100%/4   | Total: 20m 44s | Avg:  5m 11s | Max:  5m 19s
      🟩 Clang13            Pass: 100%/4   | Total: 20m 57s | Avg:  5m 14s | Max:  5m 39s
      🟩 Clang14            Pass: 100%/4   | Total: 21m 42s | Avg:  5m 25s | Max:  5m 43s
      🟩 Clang15            Pass: 100%/4   | Total: 21m 45s | Avg:  5m 26s | Max:  6m 07s
      🟩 Clang16            Pass: 100%/4   | Total: 22m 13s | Avg:  5m 33s | Max:  6m 04s
      🟩 Clang17            Pass: 100%/4   | Total: 22m 24s | Avg:  5m 36s | Max:  6m 09s
      🟩 Clang18            Pass: 100%/9   | Total:  1h 25m | Avg:  9m 28s | Max: 43m 32s
      🟩 GCC6               Pass: 100%/2   | Total:  9m 24s | Avg:  4m 42s | Max:  4m 44s
      🟩 GCC7               Pass: 100%/6   | Total: 30m 21s | Avg:  5m 03s | Max:  5m 57s
      🟩 GCC8               Pass: 100%/6   | Total: 28m 33s | Avg:  4m 45s | Max:  5m 27s
      🟩 GCC9               Pass: 100%/6   | Total:  1h 00m | Avg: 10m 05s | Max: 35m 35s
      🟩 GCC10              Pass: 100%/4   | Total: 23m 07s | Avg:  5m 46s | Max:  6m 16s
      🟩 GCC11              Pass: 100%/7   | Total: 39m 33s | Avg:  5m 39s | Max:  6m 07s
      🟩 GCC12              Pass: 100%/4   | Total: 24m 54s | Avg:  6m 13s | Max:  6m 36s
      🟩 GCC13              Pass: 100%/14  | Total:  1h 53m | Avg:  8m 07s | Max: 27m 44s
      🟩 Intel2023.2.0      Pass: 100%/3   | Total: 21m 12s | Avg:  7m 04s | Max:  7m 58s
      🟩 MSVC14.16          Pass: 100%/1   | Total: 22m 23s | Avg: 22m 23s | Max: 22m 23s | Hits:  99%/2619  
      🟩 MSVC14.29          Pass: 100%/2   | Total: 38m 54s | Avg: 19m 27s | Max: 20m 33s | Hits:  99%/5238  
      🟩 MSVC14.39          Pass: 100%/2   | Total: 44m 02s | Avg: 22m 01s | Max: 23m 53s | Hits:  99%/5238  
    🟩 cxx_family
      🟩 Clang              Pass: 100%/46  | Total:  4h 52m | Avg:  6m 22s | Max: 43m 32s
      🟩 GCC                Pass: 100%/49  | Total:  5h 30m | Avg:  6m 44s | Max: 35m 35s
      🟩 Intel              Pass: 100%/3   | Total: 21m 12s | Avg:  7m 04s | Max:  7m 58s
      🟩 MSVC               Pass: 100%/5   | Total:  1h 45m | Avg: 21m 03s | Max: 23m 53s | Hits:  99%/13095 
    🟩 gpu
      🟩 v100               Pass: 100%/103 | Total: 12h 29m | Avg:  7m 16s | Max: 43m 32s | Hits:  99%/13095 
    🟩 jobs
      🟩 Build              Pass: 100%/96  | Total: 10h 10m | Avg:  6m 21s | Max: 35m 35s | Hits:  99%/10476 
      🟩 TestCPU            Pass: 100%/4   | Total: 48m 50s | Avg: 12m 12s | Max: 23m 53s | Hits:  99%/2619  
      🟩 TestGPU            Pass: 100%/3   | Total:  1h 29m | Avg: 29m 57s | Max: 43m 32s
    🟩 sm
      🟩 60;70;80;90        Pass: 100%/3   | Total: 16m 44s | Avg:  5m 34s | Max:  6m 07s
      🟩 90a                Pass: 100%/4   | Total: 18m 52s | Avg:  4m 43s | Max:  4m 55s
    🟩 std
      🟩 11                 Pass: 100%/28  | Total:  2h 45m | Avg:  5m 53s | Max: 27m 44s
      🟩 14                 Pass: 100%/27  | Total:  2h 59m | Avg:  6m 38s | Max: 22m 23s | Hits:  99%/5238  
      🟩 17                 Pass: 100%/26  | Total:  3h 11m | Avg:  7m 22s | Max: 35m 35s | Hits:  99%/2619  
      🟩 20                 Pass: 100%/22  | Total:  3h 33m | Avg:  9m 41s | Max: 43m 32s | Hits:  99%/5238  
    
  • 🟩 cudax: Pass: 100%/52 | Total: 12h 08m | Avg: 14m 00s | Max: 23m 40s | Hits: 89%/222

    🟩 cpu
      🟩 amd64              Pass: 100%/48  | Total: 11h 18m | Avg: 14m 07s | Max: 23m 40s | Hits:  89%/222   
      🟩 arm64              Pass: 100%/4   | Total: 50m 19s | Avg: 12m 34s | Max: 13m 22s
    🟩 ctk
      🟩 12.0               Pass: 100%/19  | Total:  4h 18m | Avg: 13m 35s | Max: 19m 14s | Hits:  89%/111   
      🟩 12.6               Pass: 100%/33  | Total:  7h 50m | Avg: 14m 14s | Max: 23m 40s | Hits:  89%/111   
    🟩 cudacxx
      🟩 nvcc12.0           Pass: 100%/19  | Total:  4h 18m | Avg: 13m 35s | Max: 19m 14s | Hits:  89%/111   
      🟩 nvcc12.6           Pass: 100%/33  | Total:  7h 50m | Avg: 14m 14s | Max: 23m 40s | Hits:  89%/111   
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/52  | Total: 12h 08m | Avg: 14m 00s | Max: 23m 40s | Hits:  89%/222   
    🟩 cxx
      🟩 Clang9             Pass: 100%/2   | Total: 27m 35s | Avg: 13m 47s | Max: 14m 00s
      🟩 Clang10            Pass: 100%/2   | Total: 27m 26s | Avg: 13m 43s | Max: 14m 21s
      🟩 Clang11            Pass: 100%/4   | Total: 52m 14s | Avg: 13m 03s | Max: 13m 33s
      🟩 Clang12            Pass: 100%/4   | Total: 53m 22s | Avg: 13m 20s | Max: 14m 04s
      🟩 Clang13            Pass: 100%/4   | Total: 49m 02s | Avg: 12m 15s | Max: 12m 40s
      🟩 Clang14            Pass: 100%/4   | Total:  1h 02m | Avg: 15m 37s | Max: 19m 14s
      🟩 Clang15            Pass: 100%/2   | Total: 28m 26s | Avg: 14m 13s | Max: 14m 36s
      🟩 Clang16            Pass: 100%/4   | Total: 56m 59s | Avg: 14m 14s | Max: 16m 29s
      🟩 Clang17            Pass: 100%/2   | Total: 29m 41s | Avg: 14m 50s | Max: 16m 17s
      🟩 Clang18            Pass: 100%/2   | Total: 39m 09s | Avg: 19m 34s | Max: 23m 40s
      🟩 GCC9               Pass: 100%/2   | Total: 26m 51s | Avg: 13m 25s | Max: 14m 20s
      🟩 GCC10              Pass: 100%/4   | Total: 54m 30s | Avg: 13m 37s | Max: 14m 57s
      🟩 GCC11              Pass: 100%/4   | Total: 55m 50s | Avg: 13m 57s | Max: 15m 22s
      🟩 GCC12              Pass: 100%/7   | Total:  1h 48m | Avg: 15m 29s | Max: 18m 54s
      🟩 GCC13              Pass: 100%/3   | Total: 35m 28s | Avg: 11m 49s | Max: 13m 22s
      🟩 MSVC14.36          Pass: 100%/1   | Total:  9m 19s | Avg:  9m 19s | Max:  9m 19s | Hits:  89%/111   
      🟩 MSVC14.39          Pass: 100%/1   | Total: 11m 33s | Avg: 11m 33s | Max: 11m 33s | Hits:  89%/111   
    🟩 cxx_family
      🟩 Clang              Pass: 100%/30  | Total:  7h 06m | Avg: 14m 12s | Max: 23m 40s
      🟩 GCC                Pass: 100%/20  | Total:  4h 41m | Avg: 14m 03s | Max: 18m 54s
      🟩 MSVC               Pass: 100%/2   | Total: 20m 52s | Avg: 10m 26s | Max: 11m 33s | Hits:  89%/222   
    🟩 gpu
      🟩 v100               Pass: 100%/52  | Total: 12h 08m | Avg: 14m 00s | Max: 23m 40s | Hits:  89%/222   
    🟩 jobs
      🟩 Build              Pass: 100%/47  | Total: 10h 31m | Avg: 13m 26s | Max: 16m 29s | Hits:  89%/222   
      🟩 Test               Pass: 100%/5   | Total:  1h 36m | Avg: 19m 19s | Max: 23m 40s
    🟩 sm
      🟩 90                 Pass: 100%/1   | Total:  9m 15s | Avg:  9m 15s | Max:  9m 15s
      🟩 90a                Pass: 100%/1   | Total: 10m 01s | Avg: 10m 01s | Max: 10m 01s
    🟩 std
      🟩 17                 Pass: 100%/28  | Total:  6h 20m | Avg: 13m 36s | Max: 18m 54s
      🟩 20                 Pass: 100%/24  | Total:  5h 47m | Avg: 14m 28s | Max: 23m 40s | Hits:  89%/222   
    
  • 🟩 cccl: Pass: 100%/6 | Total: 43m 50s | Avg: 7m 18s | Max: 9m 19s

    🟩 cpu
      🟩 amd64              Pass: 100%/6   | Total: 43m 50s | Avg:  7m 18s | Max:  9m 19s
    🟩 ctk
      🟩 11.1               Pass: 100%/2   | Total:  8m 06s | Avg:  4m 03s | Max:  4m 06s
      🟩 12.0               Pass: 100%/2   | Total: 18m 02s | Avg:  9m 01s | Max:  9m 07s
      🟩 12.6               Pass: 100%/2   | Total: 17m 42s | Avg:  8m 51s | Max:  9m 19s
    🟩 cudacxx
      🟩 nvcc11.1           Pass: 100%/2   | Total:  8m 06s | Avg:  4m 03s | Max:  4m 06s
      🟩 nvcc12.0           Pass: 100%/2   | Total: 18m 02s | Avg:  9m 01s | Max:  9m 07s
      🟩 nvcc12.6           Pass: 100%/2   | Total: 17m 42s | Avg:  8m 51s | Max:  9m 19s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/6   | Total: 43m 50s | Avg:  7m 18s | Max:  9m 19s
    🟩 cxx
      🟩 Clang9             Pass: 100%/1   | Total:  4m 06s | Avg:  4m 06s | Max:  4m 06s
      🟩 Clang14            Pass: 100%/1   | Total:  9m 07s | Avg:  9m 07s | Max:  9m 07s
      🟩 Clang18            Pass: 100%/1   | Total:  8m 23s | Avg:  8m 23s | Max:  8m 23s
      🟩 GCC6               Pass: 100%/1   | Total:  4m 00s | Avg:  4m 00s | Max:  4m 00s
      🟩 GCC12              Pass: 100%/1   | Total:  8m 55s | Avg:  8m 55s | Max:  8m 55s
      🟩 GCC13              Pass: 100%/1   | Total:  9m 19s | Avg:  9m 19s | Max:  9m 19s
    🟩 cxx_family
      🟩 Clang              Pass: 100%/3   | Total: 21m 36s | Avg:  7m 12s | Max:  9m 07s
      🟩 GCC                Pass: 100%/3   | Total: 22m 14s | Avg:  7m 24s | Max:  9m 19s
    🟩 gpu
      🟩 v100               Pass: 100%/6   | Total: 43m 50s | Avg:  7m 18s | Max:  9m 19s
    🟩 jobs
      🟩 Infra              Pass: 100%/6   | Total: 43m 50s | Avg:  7m 18s | Max:  9m 19s
    
  • 🟩 cccl_c_parallel: Pass: 100%/2 | Total: 13m 57s | Avg: 6m 58s | Max: 11m 39s

    🟩 cpu
      🟩 amd64              Pass: 100%/2   | Total: 13m 57s | Avg:  6m 58s | Max: 11m 39s
    🟩 ctk
      🟩 12.5               Pass: 100%/2   | Total: 13m 57s | Avg:  6m 58s | Max: 11m 39s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/2   | Total: 13m 57s | Avg:  6m 58s | Max: 11m 39s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/2   | Total: 13m 57s | Avg:  6m 58s | Max: 11m 39s
    🟩 cxx
      🟩 GCC13              Pass: 100%/2   | Total: 13m 57s | Avg:  6m 58s | Max: 11m 39s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/2   | Total: 13m 57s | Avg:  6m 58s | Max: 11m 39s
    🟩 gpu
      🟩 v100               Pass: 100%/2   | Total: 13m 57s | Avg:  6m 58s | Max: 11m 39s
    🟩 jobs
      🟩 Build              Pass: 100%/1   | Total:  2m 18s | Avg:  2m 18s | Max:  2m 18s
      🟩 Test               Pass: 100%/1   | Total: 11m 39s | Avg: 11m 39s | Max: 11m 39s
    
  • 🟩 pycuda: Pass: 100%/1 | Total: 18m 41s | Avg: 18m 41s | Max: 18m 41s

    🟩 cpu
      🟩 amd64              Pass: 100%/1   | Total: 18m 41s | Avg: 18m 41s | Max: 18m 41s
    🟩 ctk
      🟩 12.5               Pass: 100%/1   | Total: 18m 41s | Avg: 18m 41s | Max: 18m 41s
    🟩 cudacxx
      🟩 nvcc12.5           Pass: 100%/1   | Total: 18m 41s | Avg: 18m 41s | Max: 18m 41s
    🟩 cudacxx_family
      🟩 nvcc               Pass: 100%/1   | Total: 18m 41s | Avg: 18m 41s | Max: 18m 41s
    🟩 cxx
      🟩 GCC13              Pass: 100%/1   | Total: 18m 41s | Avg: 18m 41s | Max: 18m 41s
    🟩 cxx_family
      🟩 GCC                Pass: 100%/1   | Total: 18m 41s | Avg: 18m 41s | Max: 18m 41s
    🟩 gpu
      🟩 v100               Pass: 100%/1   | Total: 18m 41s | Avg: 18m 41s | Max: 18m 41s
    🟩 jobs
      🟩 Test               Pass: 100%/1   | Total: 18m 41s | Avg: 18m 41s | Max: 18m 41s
    

👃 Inspect Changes

Modifications in project?

Project
+/- CCCL Infrastructure
libcu++
CUB
Thrust
+/- CUDA Experimental
pycuda
CCCL C Parallel Library

Modifications in project or dependencies?

Project
+/- CCCL Infrastructure
+/- libcu++
+/- CUB
+/- Thrust
+/- CUDA Experimental
+/- pycuda
+/- CCCL C Parallel Library

🏃‍ Runner counts (total jobs: 372)

# Runner
298 linux-amd64-cpu16
31 linux-amd64-gpu-v100-latest-1
28 linux-arm64-cpu16
15 windows-amd64-cpu16

@miscco miscco merged commit ea37858 into NVIDIA:main Oct 24, 2024
385 checks passed
pciolkosz pushed a commit to pciolkosz/cccl that referenced this pull request Oct 25, 2024
CUDASTF is an implementation of the Sequential Task Flow model for CUDA.

The availability of parallelism within modern hardware has dramatically
increased, with large nodes now featuring multiple accelerators. As a
result, maximizing concurrency at the application level in a scalable
manner has become a crucial priority. To effectively hide latencies, it
is essential to achieve the highest level of asynchrony possible.

CUDASTF introduces a tasking model that automates data transfers while
enforcing implicit data-driven dependencies.

Implemented as a header-only C++ library, CUDASTF builds on top of CUDA
APIs to simplify the development of multi-GPU applications.

CUDASTF is currently capable of generating parallel applications using
either the CUDA stream API or the CUDA graph API.

---------

Co-authored-by: Cédric Augonnet <[email protected]>
Co-authored-by: Andrei Alexandrescu <[email protected]>
fbusato pushed a commit to fbusato/cccl that referenced this pull request Nov 5, 2024
CUDASTF is an implementation of the Sequential Task Flow model for CUDA.

The availability of parallelism within modern hardware has dramatically
increased, with large nodes now featuring multiple accelerators. As a
result, maximizing concurrency at the application level in a scalable
manner has become a crucial priority. To effectively hide latencies, it
is essential to achieve the highest level of asynchrony possible.

CUDASTF introduces a tasking model that automates data transfers while
enforcing implicit data-driven dependencies.

Implemented as a header-only C++ library, CUDASTF builds on top of CUDA
APIs to simplify the development of multi-GPU applications.

CUDASTF is currently capable of generating parallel applications using
either the CUDA stream API or the CUDA graph API.

---------

Co-authored-by: Cédric Augonnet <[email protected]>
Co-authored-by: Andrei Alexandrescu <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
Development

Successfully merging this pull request may close these issues.

10 participants