Massive memory (100GB) used by dask-scheduler #4243

pseudotensor · 2020-11-13T09:20:01Z

Cross-posting since seems now to be mostly a dask.distributed problem.

Maybe related:
#3898
dask/dask#3530
dask/dask#6762

See for code/repro: dmlc/xgboost#6388 (comment)

In very short order workers and the scheduler hit OOM killers due to them keep accumulating memory, including across cleanly completed python client code.

This issue basically makes using dask with, e.g. NVIDIA RAPIDS/xgboost not possible as a multi-GPU or multi-node solution.

pseudotensor · 2020-11-13T09:30:58Z

I'm not confident it is a pure dask.distributed issue, so may continue to discuss in xgboost repo.

This was referenced Nov 13, 2020

Massive memory (100GB) used by dask-scheduler dask/dask#6833

Open

Massive memory (100GB) used by dask-scheduler dmlc/xgboost#6388

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Massive memory (100GB) used by dask-scheduler #4243

Massive memory (100GB) used by dask-scheduler #4243

pseudotensor commented Nov 13, 2020 •

edited

Loading

pseudotensor commented Nov 13, 2020

Massive memory (100GB) used by dask-scheduler #4243

Massive memory (100GB) used by dask-scheduler #4243

Comments

pseudotensor commented Nov 13, 2020 • edited Loading

pseudotensor commented Nov 13, 2020

pseudotensor commented Nov 13, 2020 •

edited

Loading