This repository has been archived by the owner on Mar 21, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 447
Pull requests: NVIDIA/cub
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Draft of segmented reduce optimization
P2: nice to have
Desired, but not necessary.
#578
opened Sep 30, 2022 by
gevtushenko
Loading…
Wrap launch bounds
testing: gpuCI in progress
Started gpuCI testing.
type: bug: compiler
Bug in a compiler, not this library.
add support FutureValue for reduce
P2: nice to have
Desired, but not necessary.
type: enhancement
New feature or request.
[WIP] Allow cub::DeviceRadixSort and cub::DeviceSegmentedRadixSort to use iterator as input
helps: pytorch
Helps or needed by PyTorch.
P3: backlog
Unprioritized
Add assignment operator to the TestBar test util class.
P2: nice to have
Desired, but not necessary.
triage
Needs investigation and classification.
fix 'invalid arguments' warp sync error on Volta
info needed
Cannot make progress without more information.
P1: should have
Necessary, but not critical.
repro: missing
Missing a complete example that reproduces the issue.
type: bug: functional
Does not work as intended.
ProTip!
Follow long discussions with comments:>50.