GPU - dedicated vs shared #607

perdigao1 · 2022-09-09T12:41:00Z

perdigao1
Sep 9, 2022

I was just wondering if there is a way to create arrays in shared memory.

In Windows 10 task Manager > Perfomance tab, it shows that Intel GPU have only shared memory, and NVIDIA has both dedicated and shared.

I can monitor dedicated and shared memory usage

When I run my code in Intel GPU, the arrays are created in shared memory.

When I run my code in NVIDIA the arrays are created in the dedicated GPU memory. In some computers this is actually less than the shared memory. Large arrays may throw MEM_OBJECT_ALLOCATION_FAILURE error.

I have looked into SVM but this appears to be something different (and a bit too complex), and I don't think this a solution.

How can I specify which memory to use when creating/copying buffers, please?

A simple example of elementwise multiplication of two 2D arrays, would be a super bonus.

Thank you

Answered by inducer

Sep 9, 2022

To be unhelpful, this question is technically out of scope of PyOpenCL, which merely exposes the abstraction offered by OpenCL, and what physical memory is used to back allocations is up to the actual ICD ("driver"/"implementation"). So the right thing to do is to look at the documentation for the ICD.

To be a bit more helpful, a set of useful questions is:

which types of memory does each GPU have access to?
how are those types mapped into OpenCL?

Another question is, "how does Windows report this memory usage?". I unfortunately can't help with that, since I know next to nothing about Windows. The other two I can try.

For (current-gen) Intel, this is easy: There is only "DRAM on the mai…

View full answer

inducer · 2022-09-09T18:21:43Z

inducer
Sep 9, 2022
Maintainer

To be unhelpful, this question is technically out of scope of PyOpenCL, which merely exposes the abstraction offered by OpenCL, and what physical memory is used to back allocations is up to the actual ICD ("driver"/"implementation"). So the right thing to do is to look at the documentation for the ICD.

To be a bit more helpful, a set of useful questions is:

which types of memory does each GPU have access to?
how are those types mapped into OpenCL?

Another question is, "how does Windows report this memory usage?". I unfortunately can't help with that, since I know next to nothing about Windows. The other two I can try.

For (current-gen) Intel, this is easy: There is only "DRAM on the mainboard". Some of it may be earmarked specifically for GPU use (and therefore reported as "Dedicated", but in general, there is only one type of physical memory. While the non-dedicated memory may have some overheads in access due to page tables and such, generally both should be equivalent bandwidth- and latency-wise.

For Nvidia, there (typically) DRAM directly attached to the GPU, and that's what you get when you allocate memory. This is typically somewhat limited in size. (a few gigs) Nvidia GPUs can also access host memory directly, but this has sufficiently high cost that it is not often an appealing option.

So overall, there is potentially little reason to want anything but the default mode of memory allocation. Could you clarify what you're trying to accomplish?

1 reply

perdigao1 Sep 14, 2022
Author

Oh great, thank you for your fast response.

So in order to control what DRAM memory to use, we need to go a layer above PyOpenCL, the ICD driver. I was hoping to run bigger array calculations by exploiting the different DRAMS available, but it now appears too 'advanced'.

We are writing software to do 3D data processing from microscopes. These arrays can be as big as 512x4096x4096 size.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU - dedicated vs shared #607

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

GPU - dedicated vs shared #607

perdigao1 Sep 9, 2022

Replies: 1 comment · 1 reply

inducer Sep 9, 2022 Maintainer

perdigao1 Sep 14, 2022 Author

perdigao1
Sep 9, 2022

Replies: 1 comment 1 reply

inducer
Sep 9, 2022
Maintainer

perdigao1 Sep 14, 2022
Author