Program with device code in multiple translation units fails on CUDA #4156

sergey-semenov · 2021-07-21T16:52:29Z

Describe the bug
A simple program with device code in multiple translation units fails in runtime with CUDA_ERROR_INVALID_IMAGE as of #3735

To Reproduce
h.hpp:

#include <CL/sycl.hpp>

void submit_kernelB();

b.cpp

#include "h.hpp"

class KernelNameB;

void submit_kernelB() {
  sycl::queue q;
  q.submit([&](sycl::handler &cgh) { cgh.single_task<KernelNameB>([]() {}); });
}

main.cpp:

#include "h.hpp"
#include <CL/sycl.hpp>

class KernelNameA;
void submit_kernelA() {
  sycl::queue q;
  q.submit([&](sycl::handler &cgh) { cgh.single_task<KernelNameA>([]() {}); });
}

int main() { submit_kernelA(); }

clang++ -fsycl -fsycl-targets=nvptx64-nvidia-cuda-sycldevice main.cpp b.cpp
./a.out

This reproducer fails with CUDA_ERROR_INVALID_IMAGE, note that compiling this results in 2 device images as of #3735, but in only one with it reverted. The error disappears once the number of device images in the application is reduced to 1 (either by moving submit_kernelB to the same translation unit as submit_kernelA, by using -fsycl-device-code-split=off or by reverting #3735).

Environment:

OS: Linux
Target device and vendor: CUDA, Titan RTX.
DPC++ version: e9d308e
Dependencies version: CUDA 10.1

The text was updated successfully, but these errors were encountered:

bader · 2021-07-21T16:55:09Z

@sergey-semenov, I think this issue is already fixed by 351af24. Could you check with newer version of the compiler?

bader · 2021-07-21T16:57:21Z

#4088 and #4079 seems to be about the same problem.

sergey-semenov · 2021-07-21T17:02:04Z

Ah, I specified the guilty commit rather than the version of the compiler I reproduced this on. This problem is still reproducible on e9d308e, so it seems 351af24 didn't address this.

bader · 2021-07-21T17:07:39Z

Okay, thanks for the update.
Maybe #4107 will help this case.

Michoumichmich · 2021-07-21T18:07:34Z

@sergey-semenov can you try that? Reverting only the driver is enough to make cuda work again (temporarily)

sergey-semenov · 2021-07-22T12:31:12Z

@Michoumichmich Reverting the driver part of f7ce532 didn't help, the reproducer is still failing as before.

bader · 2021-07-26T11:37:00Z

A simple program with device code in multiple translation units fails in runtime with CUDA_ERROR_INVALID_IMAGE as of #3735

@steffenlarsen, could you take a look, please? It looks like #3735 introduced a significant functional regression.

steffenlarsen · 2021-07-26T11:41:12Z

#4107 introduces a better solution for the driver changes in #3735. @sergey-semenov would you be able to check if that solves this issue?

sergey-semenov · 2021-07-26T12:36:32Z

This is indeed resolved by #4107

sergey-semenov added the bug label Jul 21, 2021

bader added the cuda label Jul 21, 2021

sergey-semenov linked a pull request Jul 26, 2021 that will close this issue

[SYCL] Add splitting module capabilities when compiling for NVPTX and AMDGCN #4107

Merged

bader closed this as completed in #4107 Jul 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Program with device code in multiple translation units fails on CUDA #4156

Program with device code in multiple translation units fails on CUDA #4156

sergey-semenov commented Jul 21, 2021 •

edited

Loading

bader commented Jul 21, 2021

bader commented Jul 21, 2021

sergey-semenov commented Jul 21, 2021

bader commented Jul 21, 2021

Michoumichmich commented Jul 21, 2021 •

edited

Loading

sergey-semenov commented Jul 22, 2021

bader commented Jul 26, 2021

steffenlarsen commented Jul 26, 2021

sergey-semenov commented Jul 26, 2021

Program with device code in multiple translation units fails on CUDA #4156

Program with device code in multiple translation units fails on CUDA #4156

Comments

sergey-semenov commented Jul 21, 2021 • edited Loading

bader commented Jul 21, 2021

bader commented Jul 21, 2021

sergey-semenov commented Jul 21, 2021

bader commented Jul 21, 2021

Michoumichmich commented Jul 21, 2021 • edited Loading

sergey-semenov commented Jul 22, 2021

bader commented Jul 26, 2021

steffenlarsen commented Jul 26, 2021

sergey-semenov commented Jul 26, 2021

sergey-semenov commented Jul 21, 2021 •

edited

Loading

Michoumichmich commented Jul 21, 2021 •

edited

Loading