Kafka scaler should concurrently query brokers and partitions for their message offsets #2377

bpinske · 2021-12-01T22:25:02Z

Proposal

Currently, the kafka scaler serially queries kafka brokers for partitions one at a time. For very large topics with many hundreds of partitions, this can be quite slow. During this slow query, the controller-manager hpa loop blocks preventing other HPA actions from taking place.

This has previously been optimized with only querying each broker once for the entire list of all partitions held by the broker, but ultimately this process remains serial.

All brokers should be concurrently queried to minimize time to calculate the total number of unprocessed messages in a topic.

Use-Case

As of now, the kafka scaler is non-performant when multiple kafka-based scaledObjects are querying large topics.

This change would make the kafka scaler usable.

Anything else?

VerstraeteBert · 2021-12-17T00:35:56Z

Hey @bpinske

I've tried my hand at implementing this. Haven't had the chance to properly test this myself, feel free to try it out :-). Note that I'm quite a golang novice still, especially its concurrency patterns.

bpinske added feature-request All issues for new features that have not been committed to needs-discussion labels Dec 1, 2021

bpinske changed the title ~~Kafka scaler should concurrently query partitions for~~ Kafka scaler should concurrently query partitions for their message offsets Dec 1, 2021

bpinske changed the title ~~Kafka scaler should concurrently query partitions for their message offsets~~ Kafka scaler should concurrently query brokers and partitions for their message offsets Dec 1, 2021

VerstraeteBert mentioned this issue Dec 17, 2021

Kafka scaler: concurrent offset fetches #2405

Merged

5 tasks

zroubalik mentioned this issue Dec 20, 2021

KEDA capacity is very limited with Kafka scaler #911

Closed

zroubalik closed this as completed in #2405 Jan 3, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kafka scaler should concurrently query brokers and partitions for their message offsets #2377

Kafka scaler should concurrently query brokers and partitions for their message offsets #2377

bpinske commented Dec 1, 2021 •

edited

Loading

VerstraeteBert commented Dec 17, 2021

Kafka scaler should concurrently query brokers and partitions for their message offsets #2377

Kafka scaler should concurrently query brokers and partitions for their message offsets #2377

Comments

bpinske commented Dec 1, 2021 • edited Loading

Proposal

Use-Case

Anything else?

VerstraeteBert commented Dec 17, 2021

bpinske commented Dec 1, 2021 •

edited

Loading