[OpenCL] Modify fill emulation to work for patterns which are not powers of 2 #1603

keyradical · 2024-05-14T10:32:24Z

This is a follow-up of #1412 which added the isPowerOf2 condition to the OpenCL fill function. This is correct since clEnqueueMemFillINTEL_fn only accepts such patterns.

What was not correct was the later logic for emulating filling on the host and copying it to the destination ptr. It assumed that the pattern is greater than 128 bytes but after adding the above isPowerOf2 condition, it could also execute for smaller patterns which are simply not powers of 2.

So this PR fixes my introduced bugs, intel/llvm CI: intel/llvm#13779

… of 2

source/adapters/opencl/usm.cpp

…of 2 (#13779) oneapi-src/unified-runtime#1603

Modified host fill emulation to include patterns which are not powers…

b8e15e2

… of 2

keyradical requested a review from a team as a code owner May 14, 2024 10:32

keyradical mentioned this pull request May 14, 2024

[UR] Modify fill emulation to work for patterns which are not powers of 2 intel/llvm#13779

Merged

kbenzie reviewed May 14, 2024

View reviewed changes

source/adapters/opencl/usm.cpp Outdated Show resolved Hide resolved

source/adapters/opencl/usm.cpp Outdated Show resolved Hide resolved

Changed to uint8_t and improved fix as suggested

483a632

kbenzie approved these changes May 14, 2024

View reviewed changes

keyradical mentioned this pull request May 15, 2024

[SYCL][ABI-Break] Improve Queue fill intel/llvm#13788

Merged

keyradical added ready to merge Added to PR's which are ready to merge opencl OpenCL adapter specific issues labels May 17, 2024

kbenzie merged commit e16d01c into oneapi-src:main May 30, 2024

steffenlarsen pushed a commit to intel/llvm that referenced this pull request May 30, 2024

[UR] Modify fill emulation to work for patterns which are not powers …

e147f36

…of 2 (#13779) oneapi-src/unified-runtime#1603

keyradical mentioned this pull request Jun 4, 2024

Q.fill() improvements fail on gen12 intel/llvm#13787

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[OpenCL] Modify fill emulation to work for patterns which are not powers of 2 #1603

[OpenCL] Modify fill emulation to work for patterns which are not powers of 2 #1603

Uh oh!

keyradical commented May 14, 2024 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[OpenCL] Modify fill emulation to work for patterns which are not powers of 2 #1603

[OpenCL] Modify fill emulation to work for patterns which are not powers of 2 #1603

Uh oh!

Conversation

keyradical commented May 14, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

keyradical commented May 14, 2024 •

edited

Loading