This repository has been archived by the owner on Mar 21, 2024. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 447
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Update DeviceScan to pass Thrust's scan tests.
Thrust will be switching to `cub::DeviceScan` to replace its custom scan implementation. This patch addresses some issues found by the Thrust tests: - Initialize unused `BlockLoad` items with values known to be in the input set. This fixes the `TestInclusiveScanWithIndirection` Thrust test by keeping the `plus_mod3` functor indices valid. - Use `OffsetT` instead of `int` to hold indicies in `AgentScan`. This fixes the `Test*ScanWithBigIndexes` Thrust tests by not truncating the input problem size. - Use `BLOCK_[STORE|LOAD]_WARP_TRANSPOSED_TIMESLICED` instead of `BLOCK_[STORE|LOAD]_WARP_TRANSPOSED` when the intermediate type is larger than 128 bytes. This keeps shared memory buffers from growing too large in the `TestScanWithLargeTypes` Thrust test.
- Loading branch information
1 parent
200cf19
commit 7e6f33b
Showing
2 changed files
with
34 additions
and
6 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters