Argsort performance improvement #1859

oleksandr-pavlyk · 2024-10-09T13:58:00Z

This change modifies implementation of tensor.argsort functions making is about 2x faster.

Instead of implementing argsort as sort over structures (index, value),
with subsequent projection to index, it is now implemented as sort
over linear indices themselves, with dereferencing comparator, and
subsequent mapping from linear index to row-wise index.

On Iris Xe, tensor.argsort call took 215 ms to find sorting permutation for
vector of 5670000 elements of type int32, and it now takes 106 ms.

The new implementation no longer makes temporary allocations for
storing indices. Previously, it would allocate
2*(sizeof(ValueT) + sizeof(IndexT)), now it just uses the output
allocation.

Tests exist.

Have you provided a meaningful PR description?
Have you added a test, reproducer or referred to an issue with a reproducer?
Have you tested your changes locally for CPU and GPU devices?
Have you made sure that new changes do not introduce compiler warnings?
Have you checked performance impact of proposed changes?
Have you added documentation for your changes, if necessary?
Have you added your changes to the changelog?
If this PR is a work in progress, are you opening the PR as a draft?

Instead of implementing argsort as sort over structures (index, value), with subsequent projection to index, it is now implemented as sort over linear indices themselves, with dereferencing comparator, and subsequent mapping from linear index to row-wise index. On Iris Xe, argsort call took 215 ms to argsort 5670000 elements of type int32, and it now takes 117 ms. The new implementation also has smaller temporary allocation footprint. Previously, it would allocate 2*(sizeof(ValueT) + sizeof(IndexT)), now it only allocates sizeof(IndexT) for storing linear indices.

Eliminate use of temporary allocation altogether, cutting argsort execution time from 116 ms to 110 ms for 5670000 element array of type int32_t.

github-actions · 2024-10-09T14:32:47Z

Deleted rendered PR docs from intelpython.github.com/dpctl, latest should be updated shortly. 🤞

github-actions · 2024-10-09T14:38:48Z

Array API standard conformance tests for dpctl=0.19.0dev0=py310hdf72452_118 ran successfully.
Passed: 894
Failed: 1
Skipped: 119

github-actions · 2024-10-09T14:40:28Z

Array API standard conformance tests for dpctl=0.19.0dev0=py310hdf72452_119 ran successfully.
Passed: 894
Failed: 1
Skipped: 119

coveralls · 2024-10-09T14:41:42Z

coverage: 87.669% (-0.2%) from 87.907%
when pulling aeb1b1f on argsort-performance-improvement
into d5de65b on master.

oleksandr-pavlyk · 2024-10-09T14:51:14Z

Drop in coverage must be related to some changes in coverall analysis code. PR did not make changes to files coverall reports have reduced coverage.

ndgrigorian

This LGTM, the performance improvement is very nice. Thank you @oleksandr-pavlyk !

oleksandr-pavlyk added 3 commits October 9, 2024 08:02

Remove unused imported qualifier in py_argsort function

9061c37

Further improvement to argsort

dc45158

Eliminate use of temporary allocation altogether, cutting argsort execution time from 116 ms to 110 ms for 5670000 element array of type int32_t.

oleksandr-pavlyk requested a review from ndgrigorian as a code owner October 9, 2024 13:58

Add changelog entry for improvement to argsort performance

aeb1b1f

ndgrigorian approved these changes Oct 9, 2024

View reviewed changes

oleksandr-pavlyk merged commit 2037d49 into master Oct 9, 2024
49 checks passed

oleksandr-pavlyk deleted the argsort-performance-improvement branch October 9, 2024 17:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Argsort performance improvement #1859

Argsort performance improvement #1859

Uh oh!

oleksandr-pavlyk commented Oct 9, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Oct 9, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Oct 9, 2024

Uh oh!

github-actions bot commented Oct 9, 2024

Uh oh!

coveralls commented Oct 9, 2024 •

edited

Loading

Uh oh!

oleksandr-pavlyk commented Oct 9, 2024

Uh oh!

ndgrigorian left a comment

Uh oh!

Uh oh!

Uh oh!

Argsort performance improvement #1859

Argsort performance improvement #1859

Uh oh!

Conversation

oleksandr-pavlyk commented Oct 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Oct 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Oct 9, 2024

Uh oh!

github-actions bot commented Oct 9, 2024

Uh oh!

coveralls commented Oct 9, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oleksandr-pavlyk commented Oct 9, 2024

Uh oh!

ndgrigorian left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

oleksandr-pavlyk commented Oct 9, 2024 •

edited

Loading

github-actions bot commented Oct 9, 2024 •

edited

Loading

coveralls commented Oct 9, 2024 •

edited

Loading