-
Notifications
You must be signed in to change notification settings - Fork 280
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Question] Scaling the Query Frontend #745
Comments
|
I don't really understand your answer @withsmilo (1) We agree that the query frontend is stateful right ? So do de we have a 1:1 relationship between a model and a query frontend ? (2) If you add more instances of the query frontend, and at the same time, model instances connect only to one instance of the query frontend. How do you ensure that each query frontend has at least one connection to an instance of a model in that situation ? |
Hi, this question is sort of a two parter. 1. Do you guys have a slack or something to post questions? 2. The other question is regarding scaling out the query-frontend container. Regarding this portion of the docs: http://clipper.ai/tutorials/container_managers/#known-limitation-query-frontend-scaleout, would a rebalancing method mean that each query frontend would need to have at least one model container behind it? I'm guessing if I spin up a new query-frontend, I would need to launch it in such a way that it either terminates the connection to one of the pre-existing model containers or it spins up a fresh version of a model container. I'm also guessing I would need something to orchestrate that, like maybe some flask server or something else that sits as a separate pod to the query-frontend (or bake it into the mgmt frontend).
The text was updated successfully, but these errors were encountered: