CUB
|
The operations exposed by WarpReduce require a temporary memory allocation of this nested type for thread communication. This opaque storage can be allocated directly using the __shared__
keyword. Alternatively, it can be aliased to externally allocated memory (shared or global) or union
'd with other storage allocation types to facilitate memory reuse.