Avoid extra memory copy when using cp.concatenate in cuml.dask kmeans #5937

dantegd · 2024-06-15T17:51:21Z

Partial solution for #5936

Issue was that concatenating when having a single array per worker was causing a memory copy (not sure if always, but often enough). This PR avoids the concatenation when a worker has a single partition of data.

This is coming from a behavior from CuPy, where some testing reveals that sometimes it creates an extra allocation when concatenating lists that are comprised of a single array:

>>> import cupy as cp
>>> a = cp.random.rand(2000000, 250).astype(cp.float32) # Memory occupied: 5936MB
>>> b = [a]
>>> c = cp.concatenate(b) # Memory occupied: 5936 MB <- no memory copy

>>> import cupy as cp
>>> a = cp.random.rand(1000000, 250) # Memory occupied: 2120 MB
>>> b = [a]
>>> c = cp.concatenate(b) # Memory occupied: 4028 MB <- memory copy was performed!

I'm not sure what are the exact rules that CuPy follows here, we could check, but in general avoiding the concatenate when we have a single partition is an easy fix that will not depend on the behavior outside of cuML's code.

cc @tfeher @cjnolet

achirkin

Thanks, @dantegd LGTM. This does fix one problem on the cuml side, but I believe it does not fully close #5936, because there still could happen a double allocation when the data is being distributed among the workers. That, however, is potentially a problem in dask / zict rather than in cuml.

achirkin

(I meant to approve not comment in my previous message)

tfeher

Thanks @dantegd for the fix, LGTM!

dantegd · 2024-06-24T23:01:28Z

/merge

dantegd · 2024-07-08T04:40:56Z

/merge

FIX avoid extra memory copy when using cp.concatenate

97e39d8

github-actions bot added the Cython / Python Cython or Python issue label Jun 15, 2024

dantegd changed the title ~~void extra memory copy when using cp.concatenate~~ Avoid extra memory copy when using cp.concatenate in cuml.dask kmeans Jun 15, 2024

dantegd added bug Something isn't working non-breaking Non-breaking change labels Jun 15, 2024

dantegd marked this pull request as ready for review June 15, 2024 23:07

dantegd requested a review from a team as a code owner June 15, 2024 23:07

achirkin mentioned this pull request Jun 17, 2024

[BUG] Multi GPU KMeans memory usage is 2x larger than expected. #5936

Closed

FIX Use X_m instead of original X for assigning labels_

78c3bde

achirkin reviewed Jun 18, 2024

View reviewed changes

achirkin approved these changes Jun 18, 2024

View reviewed changes

tfeher approved these changes Jun 18, 2024

View reviewed changes

divyegala approved these changes Jun 24, 2024

View reviewed changes

dantegd added 2 commits July 4, 2024 00:09

Merge branch 'branch-24.08' into 2408-fix-daskkmeans-mem

f7f4345

Merge branch 'branch-24.08' into 2408-fix-daskkmeans-mem

001eae6

rapids-bot bot merged commit 50ec050 into rapidsai:branch-24.08 Jul 8, 2024
60 of 62 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid extra memory copy when using cp.concatenate in cuml.dask kmeans #5937

Avoid extra memory copy when using cp.concatenate in cuml.dask kmeans #5937

dantegd commented Jun 15, 2024 •

edited

Loading

achirkin left a comment

achirkin left a comment

tfeher left a comment

dantegd commented Jun 24, 2024

dantegd commented Jul 8, 2024

Avoid extra memory copy when using cp.concatenate in cuml.dask kmeans #5937

Avoid extra memory copy when using cp.concatenate in cuml.dask kmeans #5937

Conversation

dantegd commented Jun 15, 2024 • edited Loading

achirkin left a comment

Choose a reason for hiding this comment

achirkin left a comment

Choose a reason for hiding this comment

tfeher left a comment

Choose a reason for hiding this comment

dantegd commented Jun 24, 2024

dantegd commented Jul 8, 2024

dantegd commented Jun 15, 2024 •

edited

Loading