This repository was archived by the owner on Mar 21, 2024. It is now read-only.
File tree Expand file tree Collapse file tree 2 files changed +10
-8
lines changed Expand file tree Collapse file tree 2 files changed +10
-8
lines changed Original file line number Diff line number Diff line change 1
- # CUB 1.15.0 (NVIDIA HPC SDK 21.11)
1
+ # CUB 1.15.0
2
2
3
3
## Summary
4
4
5
- CUB 1.15.0 accompanies the NVIDIA HPC SDK 21.11 release. It includes a
6
- new ` cub::DeviceSegmentedSort ` algorithm, which demonstrates up to 5000x speedup
7
- compared to ` cub::DeviceSegmentedRadixSort ` when sorting a large number of small
8
- segments. A new ` cub::FutureValue<T> ` helper allows the ` cub::DeviceScan `
9
- algorithms to lazily load the ` initial_value ` from a pointer. ` cub::DeviceScan `
10
- also added ` ScanByKey ` functionality.
5
+ CUB 1.15.0 includes a new ` cub::DeviceSegmentedSort ` algorithm, which
6
+ demonstrates up to 5000x speedup compared to ` cub::DeviceSegmentedRadixSort `
7
+ when sorting a large number of small segments. A new ` cub::FutureValue<T> `
8
+ helper allows the ` cub::DeviceScan ` algorithms to lazily load the
9
+ ` initial_value ` from a pointer. ` cub::DeviceScan ` also added ` ScanByKey `
10
+ functionality.
11
11
12
12
The new ` DeviceSegmentedSort ` algorithm partitions segments into size groups.
13
13
Each group is processed with specialized kernels using a variety of sorting
@@ -84,6 +84,8 @@ kernel launches.
84
84
(@matt-stack) for this contribution.
85
85
- NVIDIA/cub#388: Fix debug assertion on MSVC when using
86
86
`cub::CachingDeviceAllocator`.
87
+ - NVIDIA/cub#395: Support building with `__CUDA_NO_HALF_CONVERSIONS__`. Thanks
88
+ to Xiang Gao (@zasdfgbnm) for this contribution.
87
89
88
90
# CUB 1.14.0 (NVIDIA HPC SDK 21.9)
89
91
Original file line number Diff line number Diff line change @@ -100,7 +100,7 @@ See the [changelog](CHANGELOG.md) for details about specific releases.
100
100
101
101
| CUB Release | Included In |
102
102
| ------------------------- | --------------------------------------- |
103
- | 1.15.0 | NVIDIA HPC SDK 21.11 |
103
+ | 1.15.0 | TBD |
104
104
| 1.14.0 | NVIDIA HPC SDK 21.9 |
105
105
| 1.13.1 | CUDA Toolkit 11.5 |
106
106
| 1.13.0 | NVIDIA HPC SDK 21.7 |
You can’t perform that action at this time.
0 commit comments