[SYCLomatic] Bugfix in non-trivial run length encode's usage of oneDPL's reduce-by-segment #2596

mmichel11 · 2025-01-03T16:53:13Z

The current non-trivial run length encode implementation is dependent on an internal implementation detail in oneDPL. The defined operator (named op) in the reduce_by_segment call is dependent on segments being processed serially by a single work-item. In particular, the get<2>(lhs) += get<0>(rhs); update is dependent on get<2>(rhs) having no run length information that needs to be propagated through the reduction. When the segment is reduced serially, this is always the case.

oneDPL's new reduce-by-segment performance improvements does not process segments serially but rather distributes work evenly throughout work items. Information regarding lengths of runs are lost in get<2>(rhs) when oneDPL's new sub-group scan is performed. To resolve this issue, the flag element in the tuple is changed from a bool to an integral type and is used to compute the length of the run instead of separating the flag from the run-length count. As a result, partial computations of the run-length in get<0>(rhs) are propagated through the reduction. Additional logic is required when defining the mask to ensure that all elements of the run are flagged, and the padded end case is properly handled.

Please note that this PR is dependent on uxlfoundation/oneDPL#1987 to compile.

The previous implementation was dependent on implementation details of oneDPL's reduce_by_segment. These adjustments fix this. Signed-off-by: Matthew Michel <matthew.michel@intel.com>

Signed-off-by: Matthew Michel <matthew.michel@intel.com>

tomflinda

LGMT

mmichel11 added 2 commits January 3, 2025 08:55

Hotfix non-trivial RLE

696927c

The previous implementation was dependent on implementation details of oneDPL's reduce_by_segment. These adjustments fix this. Signed-off-by: Matthew Michel <matthew.michel@intel.com>

Cleanup

d9455ef

Signed-off-by: Matthew Michel <matthew.michel@intel.com>

mmichel11 requested a review from a team as a code owner January 3, 2025 16:53

mmichel11 requested review from tomflinda and zhiweij1 January 3, 2025 16:53

zhiweij1 approved these changes Jan 6, 2025

View reviewed changes

tomflinda approved these changes Jan 6, 2025

View reviewed changes

zhimingwang36 approved these changes Jan 6, 2025

View reviewed changes

zhimingwang36 merged commit eb7646a into oneapi-src:SYCLomatic Jan 6, 2025
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCLomatic] Bugfix in non-trivial run length encode's usage of oneDPL's reduce-by-segment #2596

[SYCLomatic] Bugfix in non-trivial run length encode's usage of oneDPL's reduce-by-segment #2596

mmichel11 commented Jan 3, 2025

tomflinda left a comment

[SYCLomatic] Bugfix in non-trivial run length encode's usage of oneDPL's reduce-by-segment #2596

[SYCLomatic] Bugfix in non-trivial run length encode's usage of oneDPL's reduce-by-segment #2596

Conversation

mmichel11 commented Jan 3, 2025

tomflinda left a comment

Choose a reason for hiding this comment