Prevent scaling immediately after API creation #222

deliahu · 2019-07-09T22:58:46Z

Description

Deploying a model and then updating it (e.g. setting compute.cpu to 400m) causes the horizontal pod autoscaler to scale the deployment to 2+ replicas. The same thing happens when running cortex delete and cortex deploy shortly after.

This may be due to the high CPU utilization during pod initialization, although the algorithm seems to try to account for this.

Possible flags:
--horizontal-pod-autoscaler-cpu-initialization-period
--horizontal-pod-autoscaler-initial-readiness-delay

It may also be due to a bug in the metrics server

Potentially related issues (although the problem here doesn't seem to be related to certs):

The text was updated successfully, but these errors were encountered:

deliahu added the enhancement New feature or request label Jul 9, 2019

deliahu changed the title ~~Ignore pod startup in horizontal pod autoscaler~~ Prevent scaling immediately after API creation Jul 10, 2019

deliahu self-assigned this Jul 19, 2019

deliahu added the v0.8 label Jul 19, 2019

deliahu mentioned this issue Jul 23, 2019

Add HPA workload #255

Merged

9 tasks

deliahu closed this as completed in #255 Jul 23, 2019

deliahu added this to the v0.8 milestone Nov 26, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent scaling immediately after API creation #222

Prevent scaling immediately after API creation #222

deliahu commented Jul 9, 2019 •

edited

Loading

Prevent scaling immediately after API creation #222

Prevent scaling immediately after API creation #222

Comments

deliahu commented Jul 9, 2019 • edited Loading

Description

deliahu commented Jul 9, 2019 •

edited

Loading