Skip to content

Commit

Permalink
fixed global memory read calculation for ast benchmark
Browse files Browse the repository at this point in the history
Co-authored-by: Yunsong Wang <yunsongw@nvidia.com>
  • Loading branch information
lamarrr and PointKernel committed Sep 30, 2024
1 parent 00c8189 commit ca95756
Showing 1 changed file with 1 addition and 2 deletions.
3 changes: 1 addition & 2 deletions cpp/benchmarks/ast/transform.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -89,8 +89,7 @@ static void BM_ast_transform(nvbench::state& state)
[&](nvbench::launch&) { cudf::compute_column(table, expression_tree_root); });

// Use the number of bytes read from global memory
state.add_global_memory_reads(static_cast<int64_t>(state.get_summaries().size()) * table_size *
(tree_levels + 1) * sizeof(key_type));
state.add_global_memory_reads<key_type>(table_size * (tree_levels + 1));
}

#define AST_TRANSFORM_BENCHMARK_DEFINE(name, key_type, tree_type, reuse_columns, nullable) \
Expand Down

0 comments on commit ca95756

Please sign in to comment.