Latency high after loading a new model. #385

xiaop1987 · 2017-03-30T11:50:58Z

I'm using Tensorflow Serving load a widendeep model as online predict service, and the model will update every 10 minutes, we found that the first few requests' latency are high right after the new model is loaded, is this a known issue or any suggestion to figure out this problem?

chrisolston · 2017-03-30T15:16:59Z

Some tensorflow graphs perform lazy initialization, making the first request (or few requests) to a newly-loaded model slow. The best way to handle that is to add initialization or dummy "warm-up requests" to the init op which tf-serving calls while loading the model.

xiaop1987 · 2017-04-03T04:24:14Z

@chrisolston Thanks for your explanation and suggestion very much, problem is clear to me.
Here is some suggestions for tf-serving loading model.
a) May tf-serving add a warm up option:
1. we can store a request for each model when request first arrived,
2. when new version of model was loaded, do not make it ready util it is warmed up by the stored request.

b) Add lazy-loading model:
1. For we may start hundreds of tf-serving process, and they start loading and updating new version of model almost the same time, these situation may make the cluster's network and disk quite busy(the model is stored on HDFS), and
make the cluster unstable.
2. So we can loading/updating the model at random time in a specified period to make the network and disk more smooth.

chrisolston · 2017-04-03T15:40:53Z

For (a), the recommended approach is to do it within the tf graph, triggered from tf-serving calling the init op during load.

For (b), interesting idea. I would expect various I/O queues to smooth it out anyway but maybe you are hitting timeouts? You could write a custom SourceAdapter that acts as the identity function but adds a random delay -- that would do the trick. Feel free to contribute the SourceAdapter via a PR.

eldonaldo · 2018-05-05T14:37:58Z

Hi, I have the exact same problem. However, I do not understand how one can add initialization or dummy "warm-up requests" to the init op (I used Keras for training and the SavedModelBuilder for exporting the model). Can you please explain it in more detail, e.g. with a code example?

Thanks!

eldonaldo · 2018-05-24T14:04:34Z

Ping @chrisolston

ydp · 2018-05-28T06:07:48Z

same problem

weberxie · 2018-05-28T12:42:07Z

Hi @chrisolston , I have the same problem, can you provide an example on how to call the init op during load?

tianyapiaozi · 2018-08-20T08:15:30Z

Hi @chrisolston，Current version of tf serving try to load warmup request from tf_serving_warmup_requests file. I wonder if tensorflow provides common api to export request to the location or not? Or should we write request to the location manually?

chrisolston closed this as completed May 1, 2017

rmminusrslash mentioned this issue Jan 22, 2018

Latency high after loading a new model. #733

Closed

weberxie mentioned this issue Jun 5, 2018

Yet another problem about high latency when loading a new model #910

Closed

weberxie mentioned this issue Jul 11, 2018

tensorflow c++ api session->run() consumes too much time. tensorflow/tensorflow#20661

Closed

tianyapiaozi mentioned this issue Jul 17, 2018

Latency high while loading new model after upgrading to tf1.8 #996

Closed

smit-hinsu mentioned this issue Nov 14, 2018

Performance decrease in TensorFlow 1.9 for large graphs tensorflow/tensorflow#20843

Closed

peddybeats added the type:performance Performance Issue label Nov 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Latency high after loading a new model. #385

Latency high after loading a new model. #385

xiaop1987 commented Mar 30, 2017

chrisolston commented Mar 30, 2017

xiaop1987 commented Apr 3, 2017 •

edited

Loading

chrisolston commented Apr 3, 2017

eldonaldo commented May 5, 2018

eldonaldo commented May 24, 2018

ydp commented May 28, 2018

weberxie commented May 28, 2018

tianyapiaozi commented Aug 20, 2018

Latency high after loading a new model. #385

Latency high after loading a new model. #385

Comments

xiaop1987 commented Mar 30, 2017

chrisolston commented Mar 30, 2017

xiaop1987 commented Apr 3, 2017 • edited Loading

chrisolston commented Apr 3, 2017

eldonaldo commented May 5, 2018

eldonaldo commented May 24, 2018

ydp commented May 28, 2018

weberxie commented May 28, 2018

tianyapiaozi commented Aug 20, 2018

xiaop1987 commented Apr 3, 2017 •

edited

Loading