Skip to content

Conversation

@github-actions
Copy link
Contributor

Description

Backport of #7239 to branch/3.2.x.

* Fix extracting CUDA stream in cub::DeviceTransform

#6204 changed cub::DeviceTransform APIs from taking cudaStream_t to environments. Special handling preserved support for cudaStream_t. However, user-provided stream types with conversion operators to cudaStream_t were now queried as environments, failing to return a stream.

* Properly use `get_stream` in device transform

We should not have a special overload for the `get_stream` overload but should ensure that we can pass types that are convertible to `::cudaStream_t` and extract that stream

I have expanded the get_stream CPO to accept a `::cudaStream_t __stream` so that we can work with those types

Co-authored-by: Michael Schellenberger Costa <[email protected]>
(cherry picked from commit 31f8a13)
@copy-pr-bot
Copy link
Contributor

copy-pr-bot bot commented Jan 16, 2026

This pull request requires additional validation before any workflows can run on NVIDIA's runners.

Pull request vetters can view their responsibilities here.

Contributors can view more details about this message here.

@miscco
Copy link
Contributor

miscco commented Jan 16, 2026

Has already been backported manually with #7263

@miscco miscco closed this Jan 16, 2026
@github-project-automation github-project-automation bot moved this from Todo to Done in CCCL Jan 16, 2026
@miscco miscco deleted the backport-7239-to-branch/3.2.x branch January 16, 2026 16:19
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Status: Done

Development

Successfully merging this pull request may close these issues.

3 participants