Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[EPIC] Implement VSMem abstraction #548

Closed
3 tasks done
gevtushenko opened this issue May 22, 2023 · 1 comment
Closed
3 tasks done

[EPIC] Implement VSMem abstraction #548

gevtushenko opened this issue May 22, 2023 · 1 comment
Assignees

Comments

@gevtushenko
Copy link
Collaborator

gevtushenko commented May 22, 2023

Porting of Thrust algorithms to CUB is blocked by a lack of VSMem abstraction. In Thrust, when a thread block can't fit temporary storage in the default shared memory size, it'll switch to using per-CTA global memory tiles. Introducing a pointer to shared memory in CUB kernels leads to performance regressions because generic loads are used instead of shared or global ones.

To avoid regressions, we must implement functionality allowing the processing of user-defined types of any size. It doesn't have to match the agent approach from Thrust or use global memory as long as the requirements are satisfied.

Tasks

Preview Give feedback
  1. 7 of 7
    elstehle
  2. elstehle
@elstehle
Copy link
Collaborator

Closing this EPIC, as the last task of this EPIC was completed with PR #1197

@github-project-automation github-project-automation bot moved this from Todo to Done in CCCL Jan 24, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Archived in project
Development

No branches or pull requests

2 participants