[EPIC] Implement VSMem abstraction #548

gevtushenko · 2023-05-22T11:34:06Z

Porting of Thrust algorithms to CUB is blocked by a lack of VSMem abstraction. In Thrust, when a thread block can't fit temporary storage in the default shared memory size, it'll switch to using per-CTA global memory tiles. Introducing a pointer to shared memory in CUB kernels leads to performance regressions because generic loads are used instead of shared or global ones.

To avoid regressions, we must implement functionality allowing the processing of user-defined types of any size. It doesn't have to match the agent approach from Thrust or use global memory as long as the requirements are satisfied.

Tasks

Give feedback

[EPIC] Design a scheme allowing CUB to process user-defined types of any size #612

7 of 7
Integrate new VShmem facility into DeviceMergeSort #549
Integrate this approach to unique by key ([BUG]: Unique by key doesn't use allocated vsmem #159)
Options

The text was updated successfully, but these errors were encountered:

elstehle · 2024-01-24T18:07:06Z

Closing this EPIC, as the last task of this EPIC was completed with PR #1197

jrhemstad mentioned this issue Oct 25, 2023

[EPIC] Design a scheme allowing CUB to process user-defined types of any size #612

Closed

gevtushenko mentioned this issue Jul 3, 2023

[BUG]: Unique by key doesn't use allocated vsmem #159

Closed

1 task

jrhemstad changed the title ~~Implement VSMem abstraction~~ [EPIC] Implement VSMem abstraction Aug 9, 2023

jrhemstad assigned elstehle Aug 9, 2023

gevtushenko mentioned this issue Aug 15, 2023

[BUG]: thrust::min_element / max_element fail with numItems close to int_max #330

Open

1 task

github-project-automation bot added this to CCCL Oct 11, 2023

github-project-automation bot moved this to Todo in CCCL Oct 11, 2023

jrhemstad transferred this issue from NVIDIA/cub Oct 11, 2023

This was referenced Oct 16, 2023

[EPIC] Consolidate kernels between Thrust and CUB #26

Open

Port triple_chevron #568

Open

elstehle closed this as completed Jan 24, 2024

github-project-automation bot moved this from Todo to Done in CCCL Jan 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[EPIC] Implement VSMem abstraction #548

[EPIC] Implement VSMem abstraction #548

gevtushenko commented May 22, 2023 •

edited by elstehle

Loading

Tasks

elstehle commented Jan 24, 2024

[EPIC] Implement VSMem abstraction #548

[EPIC] Implement VSMem abstraction #548

Comments

gevtushenko commented May 22, 2023 • edited by elstehle Loading

Tasks

elstehle commented Jan 24, 2024

gevtushenko commented May 22, 2023 •

edited by elstehle

Loading