Upstream core Kubernetes HPA loop is single-threaded #2382

bpinske · 2021-12-04T03:21:12Z

Report

I am leveraging Keda to query Prometheus for metrics to use in scaling decisions. Core Kubernetes appears to have some poor performance when it comes to operation with many external metrics based queries. The core HPA loop is single-threaded and acts on a single HPA object at a time in a blocking fashion on every loop iteration.

This core HPA loop should be concurrent and process every HPA object of the cluster at once as there is no mutability of the HPA objects themselves caused by other HPA objects.

This has an associated upstream issue opened describing the issuehere

The single threaded nature can be found here

Expected Behavior

Kubernetes' core HPA loop to be performant

Actual Behavior

It's possible for scaling decisions to be greatly delayed due to the sequential blocking nature of the core HPA loop. HPA simply cannot process all HPA objects sequentially in a timely manner.

Steps to Reproduce the Problem

Create and install the https://github.com/kubernetes-sigs/custom-metrics-apiserver and alter its code to add a delay in the GetExternalMetric method of the Testing Provider
Also add logging to the method with timestamps logrus supports timestamps, e.g.
Create 1,000 HPA objects referring to that test metric - possibly 1,000 deployments as well, but one will work fine too even if not a real use case.
Pick one of your HPA's and watch for the time between its call to GetExternalMetric.

Logs from KEDA operator

.

KEDA Version

2.5.0

Kubernetes Version

1.20

Platform

Any

Scaler Details

All of them!

Anything else?

No response

zroubalik · 2021-12-07T09:44:39Z

Thanks for documenting this, we should try to work with SIG Autoscaling to tackle this.

stale · 2022-02-05T10:46:52Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 7 days if no further activity occurs. Thank you for your contributions.

stale · 2022-02-12T11:11:36Z

This issue has been automatically closed due to inactivity.

zroubalik · 2023-10-21T18:53:10Z

This has been fixed upstream.

bpinske added the bug Something isn't working label Dec 4, 2021

tomkerkhove added the upstream-integration All issues related to upstream Kubernetes/community label Dec 7, 2021

zroubalik mentioned this issue Dec 20, 2021

KEDA capacity is very limited with Kafka scaler #911

Closed

stale bot added the stale All issues that are marked as stale due to inactivity label Feb 5, 2022

tomkerkhove added this to Roadmap - KEDA Core Feb 10, 2022

tomkerkhove moved this to Backlog in Roadmap - KEDA Core Feb 10, 2022

stale bot closed this as completed Feb 12, 2022

Repository owner moved this from To Do to Ready To Ship in Roadmap - KEDA Core Feb 12, 2022

zroubalik added the stale-bot-ignore All issues that should not be automatically closed by our stale bot label Feb 12, 2022

zroubalik reopened this Feb 12, 2022

stale bot removed the stale All issues that are marked as stale due to inactivity label Feb 12, 2022

Repository owner moved this from Ready To Ship to Proposed in Roadmap - KEDA Core Feb 12, 2022

zroubalik mentioned this issue Apr 13, 2022

add --concurrent-horizontal-pod-autoscaler-syncs flag to kube-controller-manager kubernetes/kubernetes#108501

Merged

JorTurFer mentioned this issue Oct 19, 2022

Timeout or abort while handling GET external metrics with 1500 scaledobjects #3670

Closed

zroubalik closed this as completed Oct 21, 2023

github-project-automation bot moved this from Proposed to Ready To Ship in Roadmap - KEDA Core Oct 21, 2023

zroubalik moved this from Ready To Ship to Done in Roadmap - KEDA Core Oct 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upstream core Kubernetes HPA loop is single-threaded #2382

Upstream core Kubernetes HPA loop is single-threaded #2382

bpinske commented Dec 4, 2021

zroubalik commented Dec 7, 2021

stale bot commented Feb 5, 2022

stale bot commented Feb 12, 2022

zroubalik commented Oct 21, 2023

Upstream core Kubernetes HPA loop is single-threaded #2382

Upstream core Kubernetes HPA loop is single-threaded #2382

Comments

bpinske commented Dec 4, 2021

Report

Expected Behavior

Actual Behavior

Steps to Reproduce the Problem

Logs from KEDA operator

KEDA Version

Kubernetes Version

Platform

Scaler Details

Anything else?

zroubalik commented Dec 7, 2021

stale bot commented Feb 5, 2022

stale bot commented Feb 12, 2022

zroubalik commented Oct 21, 2023