-
Notifications
You must be signed in to change notification settings - Fork 29
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for raw_kernel_arg extension #2038
base: master
Are you sure you want to change the base?
Conversation
@@ -53,4 +53,22 @@ void DPCTLWorkGroupMemory_Delete(__dpctl_take DPCTLSyclWorkGroupMemoryRef Ref); | |||
DPCTL_API | |||
bool DPCTLWorkGroupMemory_Available(); | |||
|
|||
typedef struct RawKernelArgDataTy | |||
{ | |||
void *bytes; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
RawKernelArgDataTy doesn't owns bytes
which rises a lot of questions.
Who's owning bytes
?
Who's responsible for deleting bytes
?
How long do we need to keep bytes
alive?
It is probably better to make RawKernelArgDataTy
to own bytes
and allocate and copy data on construction of RawKernelArgDataTy
.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I had considered having RawKernelArgDataTy
own the money and copy on construction. My reasoning to not do that was that the current approach allows the modification of the data in the time between construction of the raw kernel arg and the kernel launch. If you prefer to have raw kernel arg own the data, I can change that, no problem.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think I favor copying as well, especially because, while the overload from an object exposing buffer is pretty unambiguous (though the GC is a concern), the nbytes and pointer constructor or using the C API directly have a lot of ownership concerns.
{ | ||
DPCTLSyclRawKernelArgRef rka = nullptr; | ||
try { | ||
auto RawKernelArg = new RawKernelArgData{bytes, count}; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Could you please use unique_ptr
here?
auto RawKernelArg = std::make_unique(new RawKernelArgData{bytes, count});
rka = wrap<RawKernelArgData>(RawKernelArg.release());
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure I understand the advantage here. We're still allocating with new
and we're immediately releasing the unique_ptr
in the second line. Is there any benefit of this, e.g., exception handling?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It also deviates from the code style used in the rest of libsyclinterface
, e.g., here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yes, biggest advantage would be not having to delete it in case an exception is caught.
If an exception did occur here, we would need delete rka
below in the catch block.
Add support for the DPC++ SYCL extension
raw_kernel_arg
that allows to pass a binary blob as kernel arguments.This is for example useful if the kernel uses a
struct
as kernel parameter, but is loaded from SPIR-V and the argument type is unknown forset_arg
.The implementation follows #1984.