Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GPU utilization plot reports repeated GPUs #4117

Closed
mrocklin opened this issue Sep 19, 2020 · 5 comments
Closed

GPU utilization plot reports repeated GPUs #4117

mrocklin opened this issue Sep 19, 2020 · 5 comments

Comments

@mrocklin
Copy link
Member

@quasiben and I were playing around with his 2-gpu system, and were surprised to see four GPUs in his dashboard plots. I suspect that each worker was separately reporting the metrics for both GPUs on the system.

The relevant code for this is here:

def _pynvml_handles():
global handles
if handles is None:
pynvml.nvmlInit()
count = pynvml.nvmlDeviceGetCount()
handles = [pynvml.nvmlDeviceGetHandleByIndex(i) for i in range(count)]
return handles

Perhaps pynvml does not respect CUDA_VISIBLE_DEVICES? Should we filter this on our own?

cc @rjzamora @jacobtomlinson

@trivialfis
Copy link

trivialfis commented Sep 27, 2020

Hi @mrocklin , I'm using LocalCUDACluster from dask-cuda and can't find the utilization information on dashboard. Is there any document on how to enable it?

@quasiben
Copy link
Member

@trivialfis you may need to upgrade the dask lab extension if you are using jupyter lab (recommended).

Screen Shot 2020-09-27 at 9 23 01 AM

You can also find it directly on the dashboard at :8787/individual-gpu-utilization

@trivialfis
Copy link

@quasiben Thanks for the reply. The direct way works perfectly for me.

@mrocklin

Perhaps pynvml does not respect CUDA_VISIBLE_DEVICES?

If memory serves, you are right that nvml doesn't respect CUDA_VISIBLE_DEVICES, so pynvml won't be affected by this env.

@trivialfis
Copy link

Related #3808 .

@jacobtomlinson
Copy link
Member

I think this has been resolved by #3810

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants