Getting "cudaMallocAsync not supported with this CUDA driver/runtime version" #7636
-
I am trying to run Spark 3.2.2 with rapids with rapids-4-spark_2.12-22.10.0.jar in local. |
Beta Was this translation helpful? Give feedback.
Replies: 4 comments 7 replies
-
I can provide other necessary information if required |
Beta Was this translation helpful? Give feedback.
-
@pedastrian57 I am not sure we have seen this before, honestly it's the first time I see usage with vGPU. I took a look at some internal threads on this and it looks like for a vGPU setup you need to enable Unified Memory for this allocator to work ( In our project we normally fallback to another allocator if we detect that the driver/kernel are too old. In this case we detected everything was fine, based on version numbers, but then the async feature failed to start up after that. We should be able to make this less painful and retry with our fallback allocator, but I am kind of glad it broke since we've learned something new here. If you are not able to try the Unified Memory suggestion, please set |
Beta Was this translation helpful? Give feedback.
-
@abellina Thanks for the reply, with the arena allocator, its working on local now. |
Beta Was this translation helpful? Give feedback.
-
@abellina |
Beta Was this translation helpful? Give feedback.
@pedastrian57 I am not sure we have seen this before, honestly it's the first time I see usage with vGPU. I took a look at some internal threads on this and it looks like for a vGPU setup you need to enable Unified Memory for this allocator to work (
cudaMallocAsync
). If you are able to, you could try this: https://docs.nvidia.com/grid/latest/grid-vgpu-user-guide/index.html#enabling-unified-memory-vgpu.In our project we normally fallback to another allocator if we detect that the driver/kernel are too old. In this case we detected everything was fine, based on version numbers, but then the async feature failed to start up after that. We should be able to make this less painful and retry w…