Python predict() does not work with multiprocessing #4246

hcho3 · 2019-03-11T19:38:10Z

It has been reported that the predict() function in the Python interface does not work well with multiprocessing. We should find a way to allow multiple processes to predict with the same model simultaneously.

The text was updated successfully, but these errors were encountered:

andreieuganox · 2019-07-01T17:22:31Z

Is there any update on this. It seems that this is complete stoper from using xgb on Production...?

xEcEz · 2019-07-25T16:26:23Z

Any update? I am just discovering this now. This is indeed a problem...

It has been reported that the predict() function in the Python interface does not work well with multiprocessing. We should find a way to allow multiple processes to predict with the same model simultaneously.

What do you mean exactly?

In my context, I have a pool of processes that each load a pickled model and then try to make predictions, which is where I get the dmlc::Error.
Note that I also tried with a unique process in the pool and still got the same error.

Here is the error stack:

terminate called after throwing an instance of 'dmlc::Error'
  what():  [13:08:08] /workspace/include/xgboost/./../../src/common/common.h:41: /workspace/src/common/host_device_vector.cu: 150: initialization error

Stack trace returned 10 entries:
[bt] (0) /home/.../.local/lib/python3.6/site-packages/xgboost/./lib/libxgboost.so(dmlc::StackTrace(unsigned long)+0x47) [0x7f14b4c0ffc7]
[bt] (1) /home/.../.local/lib/python3.6/site-packages/xgboost/./lib/libxgboost.so(dmlc::LogMessageFatal::~LogMessageFatal()+0x1d) [0x7f14b4c1042d]
[bt] (2) /home/.../.local/lib/python3.6/site-packages/xgboost/./lib/libxgboost.so(dh::ThrowOnCudaError(cudaError, char const*, int)+0x123) [0x7f14b4de2153]
[bt] (3) /home/.../.local/lib/python3.6/site-packages/xgboost/./lib/libxgboost.so(xgboost::HostDeviceVectorImpl<float>::DeviceShard::Init(xgboost::HostDeviceVectorImpl<float>*, int)+0x278) [0x7f14b4e3fb68]
[bt] (4) /home/.../.local/lib/python3.6/site-packages/xgboost/./lib/libxgboost.so(+0x33b261) [0x7f14b4e17261]
[bt] (5) /home/.../.local/lib/python3.6/site-packages/xgboost/./lib/libxgboost.so(xgboost::HostDeviceVectorImpl<float>::Reshard(xgboost::GPUDistribution const&)+0x1b6) [0x7f14b4e40d26]
[bt] (6) /home/.../.local/lib/python3.6/site-packages/xgboost/./lib/libxgboost.so(xgboost::obj::RegLossObj<xgboost::obj::LinearSquareLoss>::PredTransform(xgboost::HostDeviceVector<float>*)+0xf9) [0x7f14b4e0d239]
[bt] (7) /home/.../.local/lib/python3.6/site-packages/xgboost/./lib/libxgboost.so(XGBoosterPredict+0x107) [0x7f14b4c08be7]
[bt] (8) /usr/lib/x86_64-linux-gnu/libffi.so.6(ffi_call_unix64+0x4c) [0x7f14f3b21dae]
[bt] (9) /usr/lib/x86_64-linux-gnu/libffi.so.6(ffi_call+0x22f) [0x7f14f3b2171f]

It seems that CUDA is somehow involved in this. If that helps, I have CUDA v10.0.130 installed on my machine.

I tried to run it on a machine in the cloud that doesn't have any GPU and it seems to work as intended.

teopapad92 · 2019-08-01T09:38:38Z

I ran into the same problem recently.

I noticed that if you use an older version of xgboost (0.72.1) the problem of "it hangs and doesn’t do anything" seems to disappear, but the process takes way too long.

Just for comparison I used multi Threading (which is slower than multi processing) on the latest version (0.90).
Results:
-Multi Processing on v.0.72.1: 672 sec
-Multi Threading on v.0.90: 164 sec

trivialfis · 2019-09-17T10:00:19Z

Some related thoughts: The nthread is a runtime parameter, so when pickling (what Python do when spawning new process) can not include nthread in the pickle. This can be resolved once #4855 is materialized.

mayanxin · 2019-09-27T09:56:55Z

I had the same problem when I tried to run it on a machine that has GPUs

owenljn · 2019-11-15T16:02:19Z

Any update on this? I have the same issue here

trivialfis · 2019-11-15T19:48:58Z

Thanks for reminding. Let's see if I can get to this at the weekend.

owenljn · 2019-11-15T21:55:42Z

I implemented a workaround using ZMQ Load Balancer.

So I cut out the code where XGBoost models are initialized and loaded in my master script, and put the code into an independent python script and implemented a worker routine that uses ZMQ load balancing techniques to serve the XGBoost models in the backend.

Due to system memory limit, I only initiated 4 workers, so 4 independent XGBoost models as backend workers. The frontend is still in the multiprocessing part of the original master script, but instead of utilizing XGBoost models to make predictions directly, the frontend now sends requests to backend XGBoost workers and receive the predictions from backend. Now no more dmlc errors.

Still, it will be awesome if XGBoost eventually make predict() work with multiprocessing
link to ZMQ Load Balancer which inspires my workaround

owenljn · 2019-11-18T21:47:48Z

Hi I implemented a demo which shows how ZMQ load balancer can help with this issue:
Link to the demo

trivialfis · 2019-12-24T03:55:31Z

Right now another workaround is don't initialize XGBoost before forking (Like loading pickle only after fork). Maybe we can utilize some low level driver API to maintain the cuda context ourselves, but simply using a distributed framework like dask seems much simpler.

trivialfis · 2020-04-15T12:04:34Z

A quick update on this, thread safe prediction/inplace-prediction are now supported.

… in parallel with multiple processes" This reverts commit 34de11c. We can't run the evaluation right after training otherwise, because of dmlc/xgboost#4246. Performing test selection in parallel doesn't buy us that much anyway as XGBoost already works in parallel (only the generation of the elements to pass to XGBoost would be parallel).

colin-zhou · 2020-12-21T16:25:14Z

hi @trivialfis , did this problem fixed now or not?

hcho3 · 2020-12-21T16:48:35Z

@colin-zhou You can now use inplace_predict() for thread-safe prediction.

sangaline · 2022-11-20T13:48:05Z

You can now use inplace_predict() for thread-safe prediction.

@trivialfis @hcho3 I'm still experiencing this issue with the latest v1.7.1 release and model.inplace_predict(). When loading a pickled model before forking, any call to XGBClassifier.predict() after forking will hang. The predictor is set to auto on a machine with no GPU or CUDA installed, and model._can_use_inplace_predict() returns True. The hang occurs here:

predts = self.get_booster().inplace_predict(
    data=X,
    iteration_range=iteration_range,
    predict_type="margin" if output_margin else "value",
    missing=self.missing,
    base_margin=base_margin,
    validate_features=validate_features,
)

trivialfis · 2022-11-20T14:37:21Z

Hi, the inplace predict is an alternative to using multi-process. The hang is caused by fork safety issue in libgomp in gcc, which we cannot fix. Please take a look at #7044 (comment) for a potential workaround and more background.

…

On 11/20/22 21:48, Evan Sangaline wrote: You can now use |inplace_predict()| for thread-safe prediction. @trivialfis <https://github.com/trivialfis> @hcho3 <https://github.com/hcho3> I'm still experiencing this issue with the latest v1.7.1 release and |model.inplace_predict()|. When loading a pickled model before forking, any call to |XGBClassifier.predict()| after forking will hang. The predictor is set to |auto| on a machine with no GPU or CUDA installed, and |model._can_use_inplace_predict()| returns |True|. The hang occurs here <https://github.com/dmlc/xgboost/blob/v1.7.1/python-package/xgboost/sklearn.py#L1140-L1147>: predts = self.get_booster().inplace_predict( data=X, iteration_range=iteration_range, predict_type="margin" if output_margin else "value", missing=self.missing, base_margin=base_margin, validate_features=validate_features, ) — Reply to this email directly, view it on GitHub <#4246 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AD7YPKKYT6JKX3C5NYXOC4TWJIT2BANCNFSM4G5FTZSQ>. You are receiving this because you were mentioned.Message ID: ***@***.***>

hcho3 mentioned this issue Mar 11, 2019

'dmlc::Error' when using xgboost with multiprocessing.pool #4141

Closed

hcho3 added the known-issue label Mar 11, 2019

hcho3 changed the title ~~Python predict() should work with multiprocessing~~ Python predict() does not work with multiprocessing Mar 11, 2019

trivialfis added the type: bug label Dec 24, 2019

hcho3 closed this as completed Jun 18, 2020

trivialfis mentioned this issue Apr 23, 2021

GPU initialization error on multiprocessing #6899

Closed

josiahkhor mentioned this issue Jul 27, 2022

Gunicorn preload flag not working with latest XGBoost version (post 1.6.0) #8040

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python predict() does not work with multiprocessing #4246

Python predict() does not work with multiprocessing #4246

hcho3 commented Mar 11, 2019

andreieuganox commented Jul 1, 2019

xEcEz commented Jul 25, 2019 •

edited

Loading

teopapad92 commented Aug 1, 2019

trivialfis commented Sep 17, 2019

mayanxin commented Sep 27, 2019 •

edited

Loading

owenljn commented Nov 15, 2019

trivialfis commented Nov 15, 2019

owenljn commented Nov 15, 2019 •

edited

Loading

owenljn commented Nov 18, 2019

trivialfis commented Dec 24, 2019

trivialfis commented Apr 15, 2020

colin-zhou commented Dec 21, 2020

hcho3 commented Dec 21, 2020

sangaline commented Nov 20, 2022

trivialfis commented Nov 20, 2022 via email

Python predict() does not work with multiprocessing #4246

Python predict() does not work with multiprocessing #4246

Comments

hcho3 commented Mar 11, 2019

andreieuganox commented Jul 1, 2019

xEcEz commented Jul 25, 2019 • edited Loading

teopapad92 commented Aug 1, 2019

trivialfis commented Sep 17, 2019

mayanxin commented Sep 27, 2019 • edited Loading

owenljn commented Nov 15, 2019

trivialfis commented Nov 15, 2019

owenljn commented Nov 15, 2019 • edited Loading

owenljn commented Nov 18, 2019

trivialfis commented Dec 24, 2019

trivialfis commented Apr 15, 2020

colin-zhou commented Dec 21, 2020

hcho3 commented Dec 21, 2020

sangaline commented Nov 20, 2022

trivialfis commented Nov 20, 2022 via email

xEcEz commented Jul 25, 2019 •

edited

Loading

mayanxin commented Sep 27, 2019 •

edited

Loading

owenljn commented Nov 15, 2019 •

edited

Loading