fix(ggml-sycl): add synchronization before exiting argsort kernel #15582

MengAiDev · 2025-08-26T04:21:51Z

Add stream->wait() to ensure all kernels finish execution before proceeding
This resolves potential race conditions in the argsort operation

- Add `stream->wait()` to ensure all kernels finish execution before proceeding - This resolves potential race conditions in the argsort operation

simonlui · 2025-08-26T05:31:32Z

@MengAiDev The closing brace for the function is missing so it fails to compile when I tried to check out the branch. I added an extra line to close it with } and it works.

NeoZhangJianyu · 2025-08-27T01:59:03Z

#15580 support on iGPU.
Could you check if dGPU has this issue?
if no, maybe add the condition to check the iGPU and add wait() for iGPU only.

It could reduce the protentional risk to dGPU.

simonlui · 2025-08-27T02:06:57Z

@NeoZhangJianyu I have an Intel Arc A770 16GB and can confirm the issue existed on my dGPU too. This is a snippet from the backtrace I posted in the issue.
/home/simonlui/Code_Repositories/llama-cpp-python/vendor/llama.cpp/ggml/src/ggml-sycl/ggml-sycl.cpp:3380: GGML_ASSERT(row_id_i >= 0 && row_id_i < n_as) failed
Same assert error as iGPU.

NeoZhangJianyu · 2025-08-27T03:06:32Z

@NeoZhangJianyu I have an Intel Arc A770 16GB and can confirm the issue existed on my dGPU too. This is a snippet from the backtrace I posted in the issue. /home/simonlui/Code_Repositories/llama-cpp-python/vendor/llama.cpp/ggml/src/ggml-sycl/ggml-sycl.cpp:3380: GGML_ASSERT(row_id_i >= 0 && row_id_i < n_as) failed Same assert error as iGPU.

OK! Thank you for your feedback!
It's OK to me!

MengAiDev · 2025-08-28T00:28:56Z

I have fix the }

NeoZhangJianyu · 2025-10-17T02:22:47Z

@MengAiDev
Sorry for delayed reply!
I have thought other maintainer will merge this PR.
But some maintainers won't focus on SYCL backend now.

I will continue to support SYCL backend.

I test this PR, it's passed.
But as my experiment, adding wait() will reduce a little performance.
And it will break the SYCL graph feature. (This feature is pending for other issue).

I have created a PR to fix argsort OP too: #16521.
Could it fix the issue #15580?

If the merged PR (#16521) could resolved the issue, I suggest not merging this PR.

How do you think?

Thank you!

fix(ggml-sycl): add synchronization before exiting argsort kernel

ce79ded

- Add `stream->wait()` to ensure all kernels finish execution before proceeding - This resolves potential race conditions in the argsort operation

github-actions bot added ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language labels Aug 26, 2025

NeoZhangJianyu approved these changes Aug 27, 2025

View reviewed changes

fix

afb6f45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(ggml-sycl): add synchronization before exiting argsort kernel #15582

fix(ggml-sycl): add synchronization before exiting argsort kernel #15582

Uh oh!

MengAiDev commented Aug 26, 2025 •

edited

Loading

Uh oh!

simonlui commented Aug 26, 2025

Uh oh!

NeoZhangJianyu commented Aug 27, 2025

Uh oh!

simonlui commented Aug 27, 2025

Uh oh!

NeoZhangJianyu commented Aug 27, 2025

Uh oh!

MengAiDev commented Aug 28, 2025

Uh oh!

NeoZhangJianyu commented Oct 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix(ggml-sycl): add synchronization before exiting argsort kernel #15582

Are you sure you want to change the base?

fix(ggml-sycl): add synchronization before exiting argsort kernel #15582

Uh oh!

Conversation

MengAiDev commented Aug 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

simonlui commented Aug 26, 2025

Uh oh!

NeoZhangJianyu commented Aug 27, 2025

Uh oh!

simonlui commented Aug 27, 2025

Uh oh!

NeoZhangJianyu commented Aug 27, 2025

Uh oh!

MengAiDev commented Aug 28, 2025

Uh oh!

NeoZhangJianyu commented Oct 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

MengAiDev commented Aug 26, 2025 •

edited

Loading