-
Notifications
You must be signed in to change notification settings - Fork 541
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Bugfix for 2767 - fix rf path trying to sample 0 columns #2788
Conversation
Please update the changelog in order to start CI tests. View the gpuCI docs here. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Glad to see this is so clean!
Just curious why this only came up with RF regressor, rather than both the RF classifier and regressor. Does the classifier segment have a guard against this? Seems like this is used in both downstream in the //regression
and //classifcation
sections.
@beckernick |
Got it, makes sense. A silent failure is just as devious. Looks like a FIL test is now failing, but it seems like that would be unrelated to this code path. It also seems to be failing in another PR as well, further suggesting that (#2789)
cc @dantegd is this possibly an expected failure? |
rerun tests |
@beckernick PR #2787 disabled that test temporarily because there seems to be a small issue with that test and lightgbm 3.0 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Change lgtm
Ah, perfect. thanks for the quick explanation 👍 |
In some situations, the number of sampled columns, multiplied by max features would result in a number that rounded to 0, which in turn caused a memory block of size zero to be allocated, and CudaMemsetAsync to throw an invalid argument exception.
This sets a floor of 1 for ncols_sampled, which resolves the issue.
Closes #2767