We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cuda.parallel (Python)
Consider the reduce algorithm. The underlying C library exposes a stream argument. On the Python side however, we don't accept a corresponding stream argument, and always pass None (nullptr, or "default stream") to the C API. Thus, Python users are unable to take advantage of concurrency via CUDA streams.
stream
None
We should change the Python API to accept a stream argument, and pass that to the underlying C library.
There are a few options here, that I'm ordering from least preferred to most (in my opinion):
cuda.core.Stream
__cuda_stream__
cuda.core
No response
The text was updated successfully, but these errors were encountered:
NaderAlAwar
No branches or pull requests
Is this a duplicate?
Area
cuda.parallel (Python)
Is your feature request related to a problem? Please describe.
Consider the reduce algorithm. The underlying C library exposes a
stream
argument. On the Python side however, we don't accept a corresponding stream argument, and always passNone
(nullptr, or "default stream") to the C API. Thus, Python users are unable to take advantage of concurrency via CUDA streams.Describe the solution you'd like
We should change the Python API to accept a stream argument, and pass that to the underlying C library.
What should the argument type be?
There are a few options here, that I'm ordering from least preferred to most (in my opinion):
cuda.core.Stream
object.__cuda_stream__
protocol - this would automatically include (3) without an explicit dependency oncuda.core
.Describe alternatives you've considered
No response
Additional context
No response
The text was updated successfully, but these errors were encountered: