Skip to content

Conversation

@keyradical
Copy link
Contributor

@keyradical keyradical commented May 14, 2024

This is a follow-up of #1412 which added the isPowerOf2 condition to the OpenCL fill function. This is correct since clEnqueueMemFillINTEL_fn only accepts such patterns.

What was not correct was the later logic for emulating filling on the host and copying it to the destination ptr. It assumed that the pattern is greater than 128 bytes but after adding the above isPowerOf2 condition, it could also execute for smaller patterns which are simply not powers of 2.

So this PR fixes my introduced bugs, intel/llvm CI: intel/llvm#13779

@keyradical keyradical added ready to merge Added to PR's which are ready to merge opencl OpenCL adapter specific issues labels May 17, 2024
@kbenzie kbenzie merged commit e16d01c into oneapi-src:main May 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

opencl OpenCL adapter specific issues ready to merge Added to PR's which are ready to merge

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants