Skip to content

Commit

Permalink
Adds support for large number of items to DeviceScan (NVIDIA#2171)
Browse files Browse the repository at this point in the history
* make DeviceScan offset type a template parameter

* updates tests to use device interface

* moves thrust scan to unsigned offset types

* adjusts benchmarks to account for used offset types

* uses dynamic dispatch to unsigned type

* adds tparam docs for NumItemsT

* fixes warning about different signedness comparison

* adds check for negative num_items in thrust::scan

* fixes unused param in is_negative
  • Loading branch information
elstehle authored and pciolkosz committed Aug 21, 2024
1 parent 0a27a16 commit 0be218f
Show file tree
Hide file tree
Showing 6 changed files with 159 additions and 92 deletions.
2 changes: 1 addition & 1 deletion cub/benchmarks/bench/scan/exclusive/base.cuh
Original file line number Diff line number Diff line change
Expand Up @@ -129,7 +129,7 @@ static void basic(nvbench::state& state, nvbench::type_list<T, OffsetT>)
});
}

using some_offset_types = nvbench::type_list<nvbench::int32_t>;
using some_offset_types = nvbench::type_list<nvbench::uint32_t, nvbench::uint64_t>;

NVBENCH_BENCH_TYPES(basic, NVBENCH_TYPE_AXES(all_types, some_offset_types))
.set_name("base")
Expand Down
Loading

0 comments on commit 0be218f

Please sign in to comment.