-
Notifications
You must be signed in to change notification settings - Fork 541
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
PowerTransformer
, QuantileTransformer
and KernelCenterer
#4755
PowerTransformer
, QuantileTransformer
and KernelCenterer
#4755
Conversation
PowerTransformer
, QuantileTransformer
and KernelCenterer
Can you rebase or merge the latest changes? It seems that we're getting tagged to review code changes that were merged in previous PRs. |
Sorry for that. |
@viclafargue, no problem. I've just resolved the conflicts in that PR and merged it. The branches should be in sync now. |
rerun tests |
This will also close #2035 , right? |
Thanks for noticing me of this issue. The version of the |
a574900
to
27dab01
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, can you add them to docs/source/api.rst
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
This PR has been labeled |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
codeowner approval
self.quantiles_.append( | ||
cpu_np.nanpercentile(np.asnumpy(col), references) | ||
) | ||
self.quantiles_ = cpu_np.transpose(self.quantiles_) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why are we doing the transpose with NumPy and not CuPy?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
self.quantiles_
data is stored on host, the result of the transpose
operation is then again used on host by numpy.maximum.accumulate
. I considered that the transposition on device wasn't worth the transfer. Which might arguably be wrong on very large datasets... Do you think we should be using CuPy there knowing this information?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@viclafargue it might be good to do benchmarking later to be sure when it could be a bottleneck. In the meantime, I’ll go ahead and merge this PR
Codecov ReportBase: 78.02% // Head: 79.68% // Increases project coverage by
Additional details and impacted files@@ Coverage Diff @@
## branch-22.10 #4755 +/- ##
================================================
+ Coverage 78.02% 79.68% +1.65%
================================================
Files 180 180
Lines 11385 11457 +72
================================================
+ Hits 8883 9129 +246
+ Misses 2502 2328 -174
Flags with carried forward coverage won't be shown. Click here to find out more.
Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. ☔ View full report at Codecov. |
@gpucibot merge |
…sai#4755) Closes rapidsai#4622 Authors: - Victor Lafargue (https://github.com/viclafargue) Approvers: - Micka (https://github.com/lowener) - Dante Gama Dessavre (https://github.com/dantegd) URL: rapidsai#4755
Closes #4622