Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Request-based horizontal pod autoscaling #573

Closed
deliahu opened this issue Nov 11, 2019 · 0 comments · Fixed by #815
Closed

Request-based horizontal pod autoscaling #573

deliahu opened this issue Nov 11, 2019 · 0 comments · Fixed by #815
Labels
enhancement New feature or request
Milestone

Comments

@deliahu
Copy link
Member

deliahu commented Nov 11, 2019

Description

Currently, the user must tune an API's CPU request for horizontal pod autoscaling to behave as expected. An approach based on concurrent requests per container may be better (similar to what Knative uses).

This would also make autoscaling for GPU workloads behave more as expected

It may make sense to have both request-based and CPU/GPU-based autoscaling active at the same time, i.e. it will scale when either of the thresholds are met, and won't scale back until both metrics have backed off.

@deliahu deliahu added the enhancement New feature or request label Nov 11, 2019
@deliahu deliahu changed the title Revisit pod autoscaling Request-based horizontal pod autoscaling Nov 25, 2019
@deliahu deliahu added the v0.13 label Nov 25, 2019
@deliahu deliahu removed the v0.13 label Dec 20, 2019
@deliahu deliahu assigned vishalbollu and unassigned vishalbollu Dec 20, 2019
@deliahu deliahu added v0.14 and removed v0.13 labels Jan 2, 2020
@deliahu deliahu mentioned this issue Feb 14, 2020
4 tasks
@deliahu deliahu added this to the v0.14 milestone Nov 26, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

3 participants