[Grouped Matmul] Fix PyTorch memory leak when tensors are not contiguous #290

Jokeren · 2024-01-06T01:55:39Z

For example, if other[i] was transposed, a new tensor is created in the loop of for (size_t i = 0; i < num_matrices; ++i) {.
The data_ptr of the tensor newly created will be used by a kernel following, but this tensor itself may get released before the kernel launch.

codecov · 2024-01-06T02:01:17Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (160d5c3) 86.47% compared to head (84b4324) 86.47%.

❗ Current head 84b4324 differs from pull request most recent head 11be60f. Consider uploading reports for the commit 11be60f to get more accurate results

Additional details and impacted files

@@           Coverage Diff           @@
##           master     #290   +/-   ##
=======================================
  Coverage   86.47%   86.47%           
=======================================
  Files          35       35           
  Lines        1213     1213           
=======================================
  Hits         1049     1049           
  Misses        164      164

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Jokeren · 2024-01-06T02:03:32Z

@rusty1s

Partial credit goes to my student @Karthikg99 who reported the problem.

rusty1s

Thank you :)

CHANGELOG.md

Jokeren added 2 commits January 5, 2024 20:23

Update matmul_kernel.cu

af92b88

Update test_matmul.py

bbc43f6

Update CHANGELOG.md

eecdf2e

Update CHANGELOG.md

84b4324

Jokeren force-pushed the patch-1 branch from 2b24aec to 84b4324 Compare January 7, 2024 00:00

rusty1s approved these changes Jan 7, 2024

View reviewed changes

rusty1s assigned Jokeren Jan 7, 2024

rusty1s added 0 - Priority P0 bug ops labels Jan 7, 2024

rusty1s reviewed Jan 7, 2024

View reviewed changes

CHANGELOG.md Outdated Show resolved Hide resolved

Update CHANGELOG.md

11be60f

rusty1s merged commit 92c99d9 into pyg-team:master Jan 7, 2024
8 of 10 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Grouped Matmul] Fix PyTorch memory leak when tensors are not contiguous #290

[Grouped Matmul] Fix PyTorch memory leak when tensors are not contiguous #290

Jokeren commented Jan 6, 2024

codecov bot commented Jan 6, 2024 •

edited

Loading

Jokeren commented Jan 6, 2024

rusty1s left a comment

[Grouped Matmul] Fix PyTorch memory leak when tensors are not contiguous #290

[Grouped Matmul] Fix PyTorch memory leak when tensors are not contiguous #290

Conversation

Jokeren commented Jan 6, 2024

codecov bot commented Jan 6, 2024 • edited Loading

Codecov Report

Jokeren commented Jan 6, 2024

rusty1s left a comment

Choose a reason for hiding this comment

codecov bot commented Jan 6, 2024 •

edited

Loading