-
Notifications
You must be signed in to change notification settings - Fork 908
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Enable fractional null probability for hashing benchmark #13967
Enable fractional null probability for hashing benchmark #13967
Conversation
In the past, the `HASING_NVBENCH` benchmark treated the `nulls` parameter as a boolean. Any value other than 0.0 resulted in a null probability of 100% for the generated data. Now, the `nulls` parameter directly determines the null probability. For instance, a value of 0.1 will generate 10% of the data as null. Moreover, setting nulls to 0.0 produces data without a null bitmask. Additionally, `bytes_per_second` are added to the benchmark. This patch relates to rapidsai#13735.
Pull requests from external contributors require approval from a |
Hi, I wasn't sure about this one, thus the draft. For me, it looked like the Hope that makes sense, if not I will just remove my changes and just add the |
Yes. It looks like you fixed a bug here. |
/ok to test |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Several typos otherwise look great. @Blonck Can you please also paste the console output of those benchmarks in the PR just for reference?
Co-authored-by: Yunsong Wang <yunsongw@nvidia.com>
Sure @PointKernel, here is the log. Please note that I'm currently using WSL to compile and run the code. Therefore, the performance metrics might not be fully representative. I'm uncertain about the extent to which WSL/Windows affects performance. At least it significantly impacts compile time :).
|
/ok to test |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
/ok to test |
Code review suggestions. Co-authored-by: David Wendt <45795991+davidwendt@users.noreply.github.com>
/ok to test |
/ok to test |
/merge |
In the past, the HASING_NVBENCH benchmark treated the
nulls
parameter as a boolean. Any value other than 0.0 resulted in a null probability of 1.0.Now, the
nulls
parameter directly determines the null probability. For instance, a value of 0.1 will generate 10% of the data as null. Moreover, setting nulls to 0.0 produces data without a null bitmask.Additionally,
bytes_per_second
are added to the benchmark.This patch relates to #13735.
Checklist