Call init() from request thread pool #1146

deliahu · 2020-06-20T01:12:25Z

@RobertLucian @vishalbollu what do you think about this? Are there any issues you can think of with this approach?

vishalbollu · 2020-06-20T15:44:17Z

This looks like an interesting idea. I have a few points:

If each thread runs its own predictor constructor, does this mean that each thread loads a model into memory separately? At that point, it may be safer to use processes instead of threads. If that is the case, we can also only initialize the threadpool if the thread is of size 1 (to avoid OOM). At least this way, running TensorFlow sessions should work by default when threads_per_worker is set to 1.

Another consideration, which isn't mutually exclusive to the above, is to import the threadpool from serve.py in the predictor implementation and run initializations in the predictor's constructor. We can have a guide showcasing this pattern if it works (and I am not sure that it does work).

RobertLucian

Cool that it works! I think this is a good idea and solves the immediate issue that TensorFlow presents (aka, having to reinitialize things when moving thread to thread).

The bad part about this is if the user is not aware of this, they can potentially experience that TF bug/issue/limitation when threads_per_worker > 1 and this can generally lead to a confusing experience with Cortex - it could take some time until they ask for support from us. It can look like the Cortex functionality is fragmented as opposed to just keeping it not working regardless of how many threads there are per worker, even though this is a TF limitation.

I'm thinking we could probably mitigate this confusion by adding a big warning sign in the docs mentioning the behavioral difference when threads_per_worker > 1.

Also, I'm wondering how much production deployments could benefit from it, as those would probably have threads_per_worker set to something higher than 1 and thus will require a different implementation of the predict method. Simple deployments with threads_per_worker set to 1 and workers_per_replica >= 1 will definitely benefit from this.

I don't see a way of initializing the constructor in each thread such that data duplication is avoided, so maybe this is the closest we can get to.

RobertLucian

LGTM!

Call __init__() from request thread pool

c9a7dbe

deliahu requested review from vishalbollu and RobertLucian June 20, 2020 01:12

RobertLucian reviewed Jun 22, 2020

View reviewed changes

deliahu added 2 commits June 23, 2020 17:48

Merge branch 'master' of github.com:cortexlabs/cortex into thread-pool

1d2c98c

Merge branch 'master' of github.com:cortexlabs/cortex into thread-pool

b3d4957

RobertLucian approved these changes Jun 24, 2020

View reviewed changes

deliahu merged commit 7a4ac70 into master Jun 24, 2020

deliahu deleted the thread-pool branch June 24, 2020 03:52

RobertLucian added a commit that referenced this pull request Jun 24, 2020

Edit troubleshooting to include #1146

df4c252

RobertLucian mentioned this pull request Jan 12, 2021

Resource exhausted error #1740

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Call init() from request thread pool #1146

Call init() from request thread pool #1146

Uh oh!

deliahu commented Jun 20, 2020

Uh oh!

vishalbollu commented Jun 20, 2020 •

edited

Loading

Uh oh!

RobertLucian left a comment •

edited

Loading

Uh oh!

RobertLucian left a comment

Uh oh!

Uh oh!

Call __init__() from request thread pool #1146

Call __init__() from request thread pool #1146

Uh oh!

Conversation

deliahu commented Jun 20, 2020

Uh oh!

vishalbollu commented Jun 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

RobertLucian left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

RobertLucian left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Call init() from request thread pool #1146

Call init() from request thread pool #1146

vishalbollu commented Jun 20, 2020 •

edited

Loading

RobertLucian left a comment •

edited

Loading