ADR: Changing semantics of min runners to be min idle runners #3040

nikola-jokic · 2023-11-02T10:33:18Z

Proposing design for having two ways of handling minRunners field

Addresses #2707

TingluoHuang · 2023-11-02T13:04:40Z

docs/adrs/2023-11-02-min-runners-strategy.md

+
+With the "lazy" strategy, the current behavior of `minRunners` will be preserved. The `minRunners` field will specify the minimum number of runners running in a cluster regardless of their state ("running" or "idle").
+
+With the "eager" strategy, `minRunners` will be treated as `minIdleRunners`. This strategy will calculate the number of runners based on the number of workflows acquired plus the number of minRunners. If the `maxRunners` field is specified, it will be respected, so the number of idle runners can be less than the number of idle `minRunners` when the number of acquired jobs plus the number of min runners is greater than `maxRunners`.


since we don't stop acquire jobs after we reaching maxRunners, would this cause we creating too many runners?
also since 2 runner scale sets can acquire the same job, but only one of them will get run, will we cause too much wasting resource?

It should not because the patch from the listener should only be up to maxRunners regardless of how many are acquired

Here we can over-provision them if other scale set "steals" the job, but eventually we will scale down on the next message. The unlucky scenario here can be that no events are created after the job is stolen, so we may not scale down to the desired number of min runners. Good point...

We might also want to fix the scale down process if we start doing this since it might make the extra scale up/down even worse.

TingluoHuang · 2023-11-02T13:06:07Z

docs/adrs/2023-11-02-min-runners-strategy.md

+
+## Context
+
+Current implementation treats the `minRunners` field as the number of runners that should be running on your cluster. They can be busy running the job, or they can be idle. This ensures faster cold startup time when workflows are acquired as well as trying to use the minimum amount of runners needed to fulfill the scaling requirement.


i think runner pod kind of having 3 stage:

Running jobs

Idle, runner online to the service

Runner pod starting, runner is trying to be online to the service.

Good point! I'll add it (I merged this case with the idle one as it is not busy, but I should have been more precise)

nikola-jokic · 2023-11-28T11:38:35Z

Hey everyone,

The ADR has been changed not to introduce multiple scaling strategies, but rather to change the semantics of the field.

Link-

Let's ship it

ADR: Introducing min runner strategies

9b3ed94

nikola-jokic requested review from mumoshu, toast-gear and a team as code owners November 2, 2023 10:33

TingluoHuang reviewed Nov 2, 2023

View reviewed changes

nikola-jokic added 2 commits November 2, 2023 16:20

Include the third state as 'starting up'

e5fee14

Change semantics of min runners to min idle runners completely

b7f3097

nikola-jokic changed the title ~~ADR: Introducing min runner strategies~~ ADR: Changing semantics of min runners to be min idle runners Nov 28, 2023

Merge branch 'master' into nikola-jokic/adr-min-runner-strategy

2d01617

Link- approved these changes Nov 30, 2023

View reviewed changes

nikola-jokic merged commit 5347e2c into master Nov 30, 2023
12 checks passed

nikola-jokic deleted the nikola-jokic/adr-min-runner-strategy branch November 30, 2023 10:59

nikola-jokic added this to the gha-runner-scale-set-0.8.0 milestone Dec 18, 2023

nikola-jokic mentioned this pull request Dec 19, 2023

Prepare 0.8.0 release #3175

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ADR: Changing semantics of min runners to be min idle runners #3040

ADR: Changing semantics of min runners to be min idle runners #3040

nikola-jokic commented Nov 2, 2023

TingluoHuang Nov 2, 2023

nikola-jokic Nov 2, 2023

TingluoHuang Nov 2, 2023

TingluoHuang Nov 2, 2023

nikola-jokic Nov 2, 2023

nikola-jokic commented Nov 28, 2023

Link- left a comment


		With the "lazy" strategy, the current behavior of `minRunners` will be preserved. The `minRunners` field will specify the minimum number of runners running in a cluster regardless of their state ("running" or "idle").

		With the "eager" strategy, `minRunners` will be treated as `minIdleRunners`. This strategy will calculate the number of runners based on the number of workflows acquired plus the number of minRunners. If the `maxRunners` field is specified, it will be respected, so the number of idle runners can be less than the number of idle `minRunners` when the number of acquired jobs plus the number of min runners is greater than `maxRunners`.


		## Context

		Current implementation treats the `minRunners` field as the number of runners that should be running on your cluster. They can be busy running the job, or they can be idle. This ensures faster cold startup time when workflows are acquired as well as trying to use the minimum amount of runners needed to fulfill the scaling requirement.

ADR: Changing semantics of min runners to be min idle runners #3040

ADR: Changing semantics of min runners to be min idle runners #3040

Conversation

nikola-jokic commented Nov 2, 2023

TingluoHuang Nov 2, 2023

Choose a reason for hiding this comment

nikola-jokic Nov 2, 2023

Choose a reason for hiding this comment

TingluoHuang Nov 2, 2023

Choose a reason for hiding this comment

TingluoHuang Nov 2, 2023

Choose a reason for hiding this comment

nikola-jokic Nov 2, 2023

Choose a reason for hiding this comment

nikola-jokic commented Nov 28, 2023

Link- left a comment

Choose a reason for hiding this comment