Should scalers reuse opened connection/clients? #1121

zroubalik · 2020-09-09T08:30:18Z

I have been thinking about this for some time. Currently we are creating a new Scaler (and creating a client and opening a connection) in every cycle after in the Scale Loop (after pollingInterval has passed) and we are doing the same in the Metrics Server when HPA is requesting the metrics.

This has some benefits, eg. if the pollingInterval is long, we don't need to keep the opened conection, use the memory resources etc. And if I am not mistaken, then ENV variables on the Target Deployment are evaluated in each cycle.

On the other hand, if the pollingInterval is not that long, we keep opening and closing the clients very often. This is not ideal from the performance perspective and we can as well flood the Event Source (eg. Kafka Broker) with many opened and closed connections.

What if we try to create the Scaler just once, and keep the reference on the Scaler (and if there is an error while accessing this scaler, we can recreate again). We should do it on both sides (in the Operator's Scale Loop and in the Metrics server).

Are there any downsides or problems? Or any other ideas?

The text was updated successfully, but these errors were encountered:

zroubalik · 2020-09-09T08:30:51Z

CC @ahmelsayed @anirudhgarg

arschles · 2020-10-21T20:51:48Z

Would #1133 be a step toward this? in the PR for this #1251 creates a long-lived HTTP client that has an internal connection pool

zroubalik · 2020-10-22T07:46:53Z

@arschles yeah, that's definititely a step forward 👍

ahmelsayed · 2021-02-19T21:00:22Z

Sorry for not replying to this earlier @zroubalik, I know I promised I'll but wanted to take a look at all the scalers, and how they implement connections to get a better feel for what would be the best option here.

The following scalers are the ones that create a client/connection on New*() and implement a Close() method.

scaler
azure_eventhub scaler
azure_log_analytics scaler
gcp_pub_sub scaler
influxdb scaler
kafka scaler
liiklus scaler
mongo scaler
mysql scaler
postgresql scaler
rabbitmq scaler
redis scaler
redis_streams scaler

All other scalers don't use connections, or use HTTP style clients that don't need to open/close connections.

Initially I thought a Scaler lifetime could be long, and thought the Scaler interface should make it easier for Scaler authors to handle that scenario, hence the Close() method on the interface. However, as you mentioned it's actually on every pollingInterval now, and we always create and close scalers right away.

The reason to recreate the scalers on every pollingInterval I think was to make sure to always use the latest connection secrets/auth parameters since we don't get notified on Secret or other changes. I think we can hash those values though and only recreate the scaler if those have changed. I did something similar for the external streaming GRPC scaler to make it so that if there are 1000 ScaledObjects pointing to the same GRPC server, they can all reuse the same connection instead of opening 1000 connections for no reason.

We have had few memory leaks (1, 2, 3) when those scalers are not closed, there is one code path now actually also missing to close those scalers (#1608), so I'd also like to improve that to make sure the interface of buildScalers() forces the caller to deal with closing scalers once they are done.

It'd be a slightly large change, but I'll work on it

mboutet · 2021-02-19T21:39:21Z

@ahmelsayed, you can also add 1543642 to your memory leaks list

tomkerkhove · 2021-08-17T15:21:20Z

@ahmelsayed is working on this one and will aim for v2.5 🎉

Closes #1121

Closes #1121 Signed-off-by: Ahmed ElSayed <ahmels@microsoft.com>

* Add ScalersCache to reuse scales unless they need changing Closes #1121 Signed-off-by: Ahmed ElSayed <ahmels@microsoft.com>

Signed-off-by: qvalentin <valentin.theodor@web.de>

zroubalik added needs-discussion feature-request All issues for new features that have not been committed to labels Sep 9, 2020

anirudhgarg assigned ahmelsayed Sep 17, 2020

mboutet mentioned this issue Dec 11, 2020

RabbitMQ connections are not closed when a cron trigger is active in the same ScaledObject #1413

Closed

zroubalik mentioned this issue Feb 3, 2021

Kafka Scaler - Be able to consider an average lag instead of an instant one #1556

Closed

TBeijen mentioned this issue Feb 26, 2021

MySQL connections not closed #1636

Closed

tomkerkhove added this to the v2.5.0 milestone Aug 17, 2021

tomkerkhove removed the needs-discussion label Sep 28, 2021

ahmelsayed added a commit that referenced this issue Oct 12, 2021

Add ScalersCache to reuse scales unless they need changing

0a379dc

Closes #1121

ahmelsayed added a commit that referenced this issue Oct 12, 2021

Add ScalersCache to reuse scales unless they need changing

7450c90

Closes #1121

ahmelsayed added a commit that referenced this issue Oct 12, 2021

Add ScalersCache to reuse scales unless they need changing

88c10a3

Closes #1121

ahmelsayed added a commit that referenced this issue Oct 12, 2021

Add ScalersCache to reuse scales unless they need changing

433df78

Closes #1121 Signed-off-by: Ahmed ElSayed <ahmels@microsoft.com>

ahmelsayed mentioned this issue Oct 12, 2021

Add ScalersCache to reuse scalers unless they need changing #2187

Merged

3 tasks

ahmelsayed added a commit that referenced this issue Oct 21, 2021

Add ScalersCache to reuse scales unless they need changing

6d3128b

Closes #1121 Signed-off-by: Ahmed ElSayed <ahmels@microsoft.com>

ahmelsayed added a commit that referenced this issue Nov 9, 2021

Add ScalersCache to reuse scales unless they need changing

628e9b3

Closes #1121 Signed-off-by: Ahmed ElSayed <ahmels@microsoft.com>

ahmelsayed added a commit that referenced this issue Nov 9, 2021

Add ScalersCache to reuse scales unless they need changing

030b482

Closes #1121 Signed-off-by: Ahmed ElSayed <ahmels@microsoft.com>

ahmelsayed added a commit that referenced this issue Nov 9, 2021

Add ScalersCache to reuse scales unless they need changing

fbd69ca

Closes #1121 Signed-off-by: Ahmed ElSayed <ahmels@microsoft.com>

ahmelsayed added a commit that referenced this issue Nov 9, 2021

Add ScalersCache to reuse scales unless they need changing

4fdf72a

Closes #1121 Signed-off-by: Ahmed ElSayed <ahmels@microsoft.com>

ahmelsayed added a commit that referenced this issue Nov 9, 2021

Add ScalersCache to reuse scales unless they need changing

d79a0bd

Closes #1121 Signed-off-by: Ahmed ElSayed <ahmels@microsoft.com>

ahmelsayed closed this as completed in #2187 Nov 9, 2021

ahmelsayed added a commit that referenced this issue Nov 9, 2021

Add ScalersCache to reuse scalers unless they need changing (#2187)

37a4324

* Add ScalersCache to reuse scales unless they need changing Closes #1121 Signed-off-by: Ahmed ElSayed <ahmels@microsoft.com>

tomkerkhove added this to Roadmap - KEDA Core Feb 10, 2022

tomkerkhove moved this to Backlog in Roadmap - KEDA Core Feb 10, 2022

SpiritZhou pushed a commit to SpiritZhou/keda that referenced this issue Jul 18, 2023

feat: Add OAuth extensions for kafka scaler docs (kedacore#1121)

e778dd4

Signed-off-by: qvalentin <valentin.theodor@web.de>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should scalers reuse opened connection/clients? #1121

Should scalers reuse opened connection/clients? #1121

zroubalik commented Sep 9, 2020

zroubalik commented Sep 9, 2020

arschles commented Oct 21, 2020

zroubalik commented Oct 22, 2020

ahmelsayed commented Feb 19, 2021

mboutet commented Feb 19, 2021

tomkerkhove commented Aug 17, 2021

Should scalers reuse opened connection/clients? #1121

Should scalers reuse opened connection/clients? #1121

Comments

zroubalik commented Sep 9, 2020

zroubalik commented Sep 9, 2020

arschles commented Oct 21, 2020

zroubalik commented Oct 22, 2020

ahmelsayed commented Feb 19, 2021

mboutet commented Feb 19, 2021

tomkerkhove commented Aug 17, 2021