Skip to content

A bunch of questions in cufinufft with respect to ntransf and streams #323

Answered by blackwer
chaithyagr asked this question in Q&A
Discussion options

You must be logged in to vote
1. When we launch NUFFT with `ntransf>1`, I see we launch all kernels on same stream on GPU in a loop. I was wondering why don't we launch them on a new stream so that they are executed in parallel, is it currently executed in parallel or am I missing something?

Launching on multiple streams is generally helpful for overlapping host <==> device transfers with kernel execution, which is on the agenda of improvements to consider. It can be used for concurrent kernel execution, but, unless you're working with very small transforms (where you probably shouldn't be using cufinufft, or even finufft for...) it's unlikely that concurrent kernel execution will be particularly helpful. In most c…

Replies: 2 comments 7 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
7 replies
@blackwer
Comment options

@chaithyagr
Comment options

@chaithyagr
Comment options

@chaithyagr
Comment options

@blackwer
Comment options

Answer selected by blackwer
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants