[SYCL][CUDA][HIP][PI] Fix barrier #6490

t4c1 · 2022-07-29T09:15:59Z

Fixes a bug in barrier implementation in CUDA and HIP plugins that often caused barrier not to work. The new implementation is also faster.

Tests in: intel/llvm-test-suite#1122

t4c1 · 2022-08-17T10:59:15Z

I think none of these failures are actually related to the changes in this PR.

steffenlarsen · 2022-09-05T20:41:56Z

sycl/plugins/cuda/pi_cuda.cpp

@@ -381,8 +381,25 @@ pi_result cuda_piEventRetain(pi_event event);

 /// \endcond

+void _pi_queue::compute_stream_wait_for_barrier_if_needed(CUstream stream,
+                                                          pi_uint32 stream_i) {
+  if (barrier_event_ && !compute_applied_barrier_[stream_i]) {


Would it make sense to check if the barrier event has finished prior to this so we can potentially clear it and skip the cuStreamWaitEvent call?

You are proposing an additional CUDA API call for potentially every operation that might let us eliminate additional calls in the future. Whether this makes sense to do or not depends on whether we expect most streams will have work enqueued to them before or after all the work before the barrier is finished. If the answer is before, the current code will be more performant. If the answer is after, we want to do what you suggest.

My gut feeling says current implementation will be better for most use cases. However the reasoning from the previous paragraph assumes that cuStreamWaitEvent and cuEventQuery take roughly the same time. Looking into that could give us more information that could inform this decision. However, for now I would leave this as it is.

I agree, this is potentially more work up-front, though I would expect cuEventQuery to be somewhat lightweight, but doing some benchmarking for it may make sense before a final decision is made on this. I am okay to keep as-is. 😄

sycl/plugins/cuda/pi_cuda.cpp

steffenlarsen

LGTM!

AerialMantis · 2022-10-06T13:03:37Z

@steffenlarsen @smaslov-intel is this okay to merge now or are there further reviews we should request? I also see there is a failure for the ESIMD job, I assume this is unrelated.

steffenlarsen · 2022-10-06T13:14:24Z

ESIMD failures seem to happen on other PRs as well. Lets merge this.

Improves the test for barrier to make it actually fail if the barrier implementation does not work. Tests intel/llvm#6490

…m-test-suite#1122) Improves the test for barrier to make it actually fail if the barrier implementation does not work. Tests intel#6490

t4c1 added 3 commits July 28, 2022 13:37

fixed barrier

f26d6aa

Merge branch 'sycl' into fix_barrier

8228e30

added hip chenges

ee10465

t4c1 requested review from a team as code owners July 29, 2022 09:16

t4c1 requested a review from smaslov-intel July 29, 2022 09:16

t4c1 mentioned this pull request Jul 29, 2022

[SYCL][HIP] Improve test for barrier and enable it for HIP intel/llvm-test-suite#1122

Merged

t4c1 added 3 commits July 29, 2022 05:20

bugfix HIP implementation

ae97b36

format

f1a3ad5

remove accidental changes to CMake

e30c582

t4c1 added 2 commits August 8, 2022 10:13

Merge branch 'sycl' into fix_barrier

9b825bc

Merge branch 'sycl' into fix_barrier

7e31805

steffenlarsen reviewed Sep 6, 2022

View reviewed changes

steffenlarsen reviewed Sep 16, 2022

View reviewed changes

sycl/plugins/cuda/pi_cuda.cpp Show resolved Hide resolved

add comments about how barrier now works

5cb944e

steffenlarsen approved these changes Oct 6, 2022

View reviewed changes

steffenlarsen merged commit 1c3d598 into intel:sycl Oct 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SYCL][CUDA][HIP][PI] Fix barrier #6490

[SYCL][CUDA][HIP][PI] Fix barrier #6490

Uh oh!

t4c1 commented Jul 29, 2022 •

edited

Loading

Uh oh!

t4c1 commented Aug 17, 2022

Uh oh!

steffenlarsen Sep 5, 2022

Uh oh!

t4c1 Sep 7, 2022

Uh oh!

steffenlarsen Sep 14, 2022

Uh oh!

Uh oh!

Uh oh!

steffenlarsen left a comment

Uh oh!

AerialMantis commented Oct 6, 2022

Uh oh!

steffenlarsen commented Oct 6, 2022

Uh oh!

Uh oh!

[SYCL][CUDA][HIP][PI] Fix barrier #6490

[SYCL][CUDA][HIP][PI] Fix barrier #6490

Uh oh!

Conversation

t4c1 commented Jul 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

t4c1 commented Aug 17, 2022

Uh oh!

steffenlarsen Sep 5, 2022

Choose a reason for hiding this comment

Uh oh!

t4c1 Sep 7, 2022

Choose a reason for hiding this comment

Uh oh!

steffenlarsen Sep 14, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

steffenlarsen left a comment

Choose a reason for hiding this comment

Uh oh!

AerialMantis commented Oct 6, 2022

Uh oh!

steffenlarsen commented Oct 6, 2022

Uh oh!

Uh oh!

t4c1 commented Jul 29, 2022 •

edited

Loading