[SYCL] [DOC] Prepare design-document for assert feature #3461

s-kanaev · 2021-04-01T08:24:56Z

See extension document for SYCL describing assert behaviour

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

sycl/doc/extensions/Assert/level-zero.md

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

sycl/doc/extensions/Assert/opencl.md

sycl/doc/Assert.md

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

HoppeMateusz · 2021-04-06T07:28:23Z

sycl/doc/extensions/Assert/level-zero.md

+ze_result Result = zeEventQueryStatus(Event);
+```
+
+If kernel failed an assertion `zeEventQueryStatus` should return


I don;t think this is possible to achieve in asynchronous / non-blocking way in L0.

We dont have any communication between kernel and event - so we can;t signal events with "assert happened" information.

if we use global / program wide assert buffer - each kernel will be using the same assert happened flag - we do not have fine grain control to determine which kernel - and which connected event fired the assert.

Fences could be used - allowing to synchronize at cmdQueue level and not kernel - any kernel causing assert executed in cmd Queue can then make fence synchronize to return error:https://spec.oneapi.com/level-zero/latest/core/PROG.html#fences

Is it still possible in OpenCL?
Can the OpenCL approach be reused in Level-Zero?

Could you, please, provide more details about using fences?

fences are decribed in L0 spec - they are similar to events, but directly connected to command queues: https://spec.oneapi.com/level-zero/latest/core/PROG.html#fences

In OpenCL the submission model is different - each enqueue is independent - single kernel is submitted ( queued) at a time. L0 operates on command lists that may contain multiple kernels - once cmd list is submitted to HW - we can;t control when a kernel in whole sequence is started completed.

OpenCL handles kernels with printf in a blocking way - enqueueNDRangeKErnel with printf makes this a blocking call - so we have fine control when specific kernel is completed - we can do the same for assert() message - output event will be created when the kernel has already finished. I L0 this is not possible - as we would have to synchronize whoel command list.

sycl/doc/extensions/Assert/level-zero.md

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

sycl/doc/Assert.md

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

sycl/doc/Assert.md

AlexeySachkov · 2021-04-07T16:12:20Z

sycl/doc/Assert.md

+`sycl::event_error` exception. Otherwise, SYCL Runtime should trigger abort.
+Even though multiple failures of the same or different assertions can happen in
+multiple workitems, implementation is required to deliver only one. The
+assertion failure message is printed to `stderr` by SYCL Runtime.


Does it happen always or only without async_handler set?

I think it should always print the assertion message because:

This would be consistent with the "safe" implementation (the one that depends on hardware support), which is defined to print the message even before notifying the host.

This is also consistent with the way assert works on the host, which prints the assertion message even before raising SIGABRT.

So, even if user set an async_handler in order to gracefully react to assert failure, we still print something to stderr? What for? It is not that bad as if we printed into stdout, but still seems unnecessary a bit.

I thought it was weird when I first read this spec also. But then I tried the following test:

#include <cassert> #include <csignal> #include <cstdlib> void handle(int sig) { std::exit(0); // Exit silently } int main() { std::signal(SIGABRT, handle); assert(false); }

The results:

$ clang -std=c++17 -pedantic -o t t.cpp $ ./t t: t.cpp:11: int main(): Assertion `false' failed. $ echo $? 0

Despite the fact that I catch the SIGABRT and exit without printing anything, I still get a message printed to stderr.

Therefore, it seems like the behavior defined in this spec is consistent with the way assert works on the host.

That's kind of obvious due to the fact that assert(expr) on host is unwrapped into

if (!(expr)) { fprintf(stderr, ...); abort(); }

In device-code, assert(expr) unwraps to:

if (!(expr)) { __devicelib_assert_fail(#expr, __FILE__, __LINE__, __PRETTY_FUNCTION__, global ID, local ID); }

sycl/doc/Assert.md

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

…ssert-abort

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

…ssert-abort

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

sycl/doc/Assert.md

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com> Co-authored-by: kbobrovs <konstantin.s.bobrovsky@intel.com>

s-kanaev · 2021-05-25T16:05:49Z

@kbobrovs , a friendly ping

bader

LGTM, just a few nits.

sycl/doc/extensions/Assert/SYCL_ONEAPI_ASSERT.asciidoc

sycl/doc/Assert.md

Co-authored-by: bader <alexey.bader@intel.com> Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

gmlueck · 2021-05-27T17:39:46Z

I'd suggest changing this paragraph in the extension specification now that we have the new aspect:

It is unspecified whether a failing assert() returns to its caller before the kernel terminates. If a failing call returns, the device code may need to continue execution without deadlocking for the assertion message to be printed or for std::abort() to be called.

Maybe something like this:

Some devices implement assert() natively while others use a fallback implementation, and the two implementations provide different guarantees. The native implementation is most similar to the way assert() works on the host. If an assertion fails in the native implementation, the assertion message is immediately printed to stderr and the program terminates by calling std::abort(). If an assertion fails with the fallback implementation, the failing assert() returns back to its caller and the device code must continue executing (without deadlocking) until the kernel completes. The implementation prints the assertion message to stderr and terminates with std::abort() only after the kernel completes execution. An application can determine which of the two mechanisms a device uses by testing the device aspect aspect::ext_oneapi_native_assert.

Note that this also defines the terms "native support" and "fallback implementation", which you use later in the description of ext_oneapi_native_assert.

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

s-kanaev · 2021-05-27T17:49:04Z

I'd suggest changing this paragraph in the extension specification now that we have the new aspect:

Done.

gmlueck

LGTM

s-kanaev · 2021-05-31T07:00:41Z

The failure in Jenkins/Precommit doesn't relate to the changes here. Reported the failure.

bader · 2021-05-31T08:43:23Z

The failure in Jenkins/Precommit doesn't relate to the changes here. Reported the failure.

Please, disable related tests and re-run the job.

s-kanaev · 2021-05-31T09:22:16Z

Created PR to disable the test: intel/llvm-test-suite#303

olegmaslovatintel · 2021-10-20T14:55:39Z

sycl/doc/Assert.md

+performed only when assertion is enabled and Device-side Runtime doesn't provide
+implementation of `__devicelib_assert_fail`.
+
+In DPCPP headers one can see if assert is enabled with status of `NDEBUG` macro


We had a many user reported issues after functionality is merged #3767 which seems caused by fall back design.
@s-kanaev is there possibility to NOT enable/define/link `__devicelib_assert_fail by default?

tagging @AlexeySachkov @gmlueck @kbobrovs

Sergey Kanaev added 2 commits March 31, 2021 17:07

[SYCL] [DOC] Prepare design-document for assert feature

2911ea7

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

Remove redundant file

b69a1cd

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

s-kanaev requested review from kbobrovs, pvchupin and a team as code owners April 1, 2021 08:24

s-kanaev commented Apr 1, 2021

View reviewed changes

sycl/doc/extensions/Assert/level-zero.md Outdated Show resolved Hide resolved

Fix typo

15ea88e

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

romanovvlad reviewed Apr 2, 2021

View reviewed changes

sycl/doc/extensions/Assert/opencl.md Outdated Show resolved Hide resolved

intel deleted a comment from gmlueck Apr 2, 2021

gmlueck reviewed Apr 2, 2021

View reviewed changes

sycl/doc/Assert.md Outdated Show resolved Hide resolved

sycl/doc/Assert.md Outdated Show resolved Hide resolved

sycl/doc/Assert.md Outdated Show resolved Hide resolved

sycl/doc/Assert.md Outdated Show resolved Hide resolved

sycl/doc/Assert.md Outdated Show resolved Hide resolved

Sergey Kanaev added 3 commits April 5, 2021 16:16

Address some review comments. Add description of built-ins.

ca08fec

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

Fix links

1f8d9a9

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

Clarify that assertion failure message is printed by DPCPP Runtime

2ee590c

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

HoppeMateusz reviewed Apr 6, 2021

View reviewed changes

sycl/doc/extensions/Assert/level-zero.md Outdated Show resolved Hide resolved

Sergey Kanaev added 2 commits April 6, 2021 17:31

Clarify that fallback assert impl is synchronous

77699a2

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

Fix typo in level-zero ext draft

001a573

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

kbobrovs reviewed Apr 6, 2021

View reviewed changes

sycl/doc/Assert.md Outdated Show resolved Hide resolved

sycl/doc/Assert.md Outdated Show resolved Hide resolved

sycl/doc/Assert.md Outdated Show resolved Hide resolved

sycl/doc/Assert.md Outdated Show resolved Hide resolved

kbobrovs reviewed Apr 6, 2021

View reviewed changes

Address some review comments.

32b6479

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

AlexeySachkov reviewed Apr 7, 2021

View reviewed changes

gmlueck reviewed Apr 7, 2021

View reviewed changes

sycl/doc/Assert.md Show resolved Hide resolved

Sergey Kanaev added 8 commits April 8, 2021 16:30

Add exception extension

b8637c2

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

Use error-code instead of distinct exception.

b0cd85f

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

[SYCL] Add OpenCL extension for assert error code

8c03648

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

[SYCL] Add Level-Zero extension for assert error code

121c945

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

Merge branch 'private/s-kanaev/assert-ocl-l0' into private/s-kanaev/a…

13b40fd

…ssert-abort

Remove draft files

a4b4884

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

Remove unwanted part

c06db5f

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

Merge branch 'private/s-kanaev/assert-ocl-l0' into private/s-kanaev/a…

823124a

…ssert-abort

Sergey Kanaev added 2 commits May 19, 2021 16:35

Remove use of NDEBUG from suggested changes

a5461f3

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

Reorder text to increase readability

32a32f4

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

kbobrovs suggested changes May 19, 2021

View reviewed changes

sycl/doc/Assert.md Outdated Show resolved Hide resolved

Address review comment

641d071

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com> Co-authored-by: kbobrovs <konstantin.s.bobrovsky@intel.com>

s-kanaev requested a review from kbobrovs May 20, 2021 09:03

bader requested a review from AlexeySachkov May 24, 2021 09:12

kbobrovs previously approved these changes May 25, 2021

View reviewed changes

bader reviewed May 26, 2021

View reviewed changes

Sergey Kanaev and others added 2 commits May 27, 2021 13:23

Address review comments

dc058a9

Co-authored-by: bader <alexey.bader@intel.com> Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

Add aspect

16fd8f0

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

s-kanaev dismissed kbobrovs’s stale review via 16fd8f0 May 27, 2021 14:57

s-kanaev requested review from kbobrovs and bader May 27, 2021 14:58

bader previously approved these changes May 27, 2021

View reviewed changes

kbobrovs previously approved these changes May 27, 2021

View reviewed changes

Update extension with suggestion

fbca768

Signed-off-by: Sergey Kanaev <sergey.kanaev@intel.com>

s-kanaev dismissed stale reviews from kbobrovs and bader via fbca768 May 27, 2021 17:48

s-kanaev requested review from bader and kbobrovs May 27, 2021 17:49

gmlueck approved these changes May 27, 2021

View reviewed changes

bader approved these changes May 27, 2021

View reviewed changes

bader merged commit 69fc6dc into intel:sycl May 31, 2021

olegmaslovatintel reviewed Oct 20, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCL] [DOC] Prepare design-document for assert feature #3461

[SYCL] [DOC] Prepare design-document for assert feature #3461

s-kanaev commented Apr 1, 2021 •

edited

Loading

HoppeMateusz Apr 6, 2021

s-kanaev Apr 7, 2021 •

edited

Loading

s-kanaev Apr 7, 2021

HoppeMateusz Apr 8, 2021

AlexeySachkov Apr 7, 2021

gmlueck Apr 7, 2021

AlexeySachkov Apr 8, 2021

gmlueck Apr 8, 2021

s-kanaev Apr 8, 2021

s-kanaev commented May 25, 2021

bader left a comment

gmlueck commented May 27, 2021

s-kanaev commented May 27, 2021

gmlueck left a comment

s-kanaev commented May 31, 2021

bader commented May 31, 2021

s-kanaev commented May 31, 2021

olegmaslovatintel Oct 20, 2021 •

edited

Loading

[SYCL] [DOC] Prepare design-document for assert feature #3461

[SYCL] [DOC] Prepare design-document for assert feature #3461

Conversation

s-kanaev commented Apr 1, 2021 • edited Loading

Choose a reason for hiding this comment

s-kanaev Apr 7, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

s-kanaev commented May 25, 2021

bader left a comment

Choose a reason for hiding this comment

gmlueck commented May 27, 2021

s-kanaev commented May 27, 2021

gmlueck left a comment

Choose a reason for hiding this comment

s-kanaev commented May 31, 2021

bader commented May 31, 2021

s-kanaev commented May 31, 2021

olegmaslovatintel Oct 20, 2021 • edited Loading

Choose a reason for hiding this comment

s-kanaev commented Apr 1, 2021 •

edited

Loading

s-kanaev Apr 7, 2021 •

edited

Loading

olegmaslovatintel Oct 20, 2021 •

edited

Loading