Adding support for capacity buffer #39

jones2026 · 2019-07-04T06:21:40Z

This is to enable the drone autoscaler to have standby capacity ready so you can have warm instances before scaling is needed. This will help avoid builds waiting on nodes to be provisioned and should not affect the normal operation of the autoscaler if you do not want to use this feature.

@tboerger or @bradrydzewski let me know if you have any concerns or suggestions

tboerger

I don't get why you changed the drone config, but on the first view that LGTM

bradrydzewski · 2019-07-04T16:43:11Z

I would be ok with changing the default behavior so that when the autoscaler starts, it immediately creates the min number of servers set via DRONE_POOL_MIN. This seems to be how most people expect it to work anyway and it slightly simplifies the implementation by not adding a new configuration parameter.

engine/planner.go

jones2026 · 2019-07-04T17:03:24Z

I would be ok with changing the default behavior so that when the autoscaler starts, it immediately creates the min number of servers set via DRONE_POOL_MIN. This seems to be how most people expect it to work and it (slightly) simplifies the implementation.

While this change might accidentally accomplish that (I wasn't actually even thinking of that when I made this), it really is more to prevent builds queuing and to proactively spin up a new instance ahead of time to have it ready before builds are actually waiting. It won't stop all queuing, but it will add a buffer to reduce what I am hoping is the majority of it.

As I think this over though, I could accomplish a similar outcome by increasing the DRONE_POOL_MIN_AGE to more than the default, then the majority of the queuing would result in the morning when builds historically start to ramp up and should reduce most queuing throughout the remainder of the day. Really it's just enabling the choice of being only reactive to demand or slightly proactive with the fallback of still being reactive if demand exceeds that DRONE_STANDBY_CAPACITY

jones2026 · 2019-07-04T17:34:36Z

Switched it from DRONE_STANDBY_CAPACITY to use instead DRONE_POOL_STANDBY_CAPACITY

Thinking that might be more clear what this is used for?

jones2026 · 2019-07-04T18:45:00Z

I don't get why you changed the drone config, but on the first view that LGTM

@tboerger, sorry I have a habit of just running go fmt and drone fmt anytime I work on things and this added those extra unwanted changes to this PR. I have reverted files I didn't want to actually change.

bradrydzewski · 2019-07-04T18:54:59Z

@jones2026 ok I think I get it. Is the goal to enable always having a little extra capacity instead of the exact capacity? For example:

warm count is 1, min server count is 2, current demand is for 0 servers. 2 servers are provisioned
warm count is 1, min server count is 2, current demand is for 3 servers. 4 servers are provisioned (3 servers to meet current demand + 1 warm instance)
warm count is 1, max server count is 5, current demand is for 5 servers, 5 servers are provisioned (warm instance is ignored to prevent exceeding max)

Am I understanding correctly? And does this cover all the permutations? Lets make sure we have a unit test for each permutation as well.

Thanks for the pull request, and just to let you know, I'm going to be traveling today and tomorrow so my replies may be delayed.

jones2026 · 2019-07-04T19:02:55Z

@bradrydzewski no worries, I honestly didn't expect any replies today!

That is exactly what I was going for, except the current implementation in my PR is around server capacity (i.e. concurrency * number of servers) and not the number of actual servers.

Do you think it would be clearer to switch it to number of warm server instances instead of spare capacity? (Once we decide whether we think it's best for standby servers or standby capacity I will add all the permutations to the test)

bradrydzewski · 2019-07-21T01:20:52Z

do you think it would be clearer to switch it to number of warm server instances instead of spare capacity

Sorry for the delayed reply. I went back and forth on this. I think both approaches could work just fine. I think capacity is more granular and therefore makes more sense.

In terms of variable names, I think using something like DRONE_CAPACITY_BUFFER could be a good option. The DRONE_POOL_ variables deal with instance counts as opposed to capacity which could cause some confusion.

I think overall this looks good. Once we have the additional unit tests in place we should be all set :)

jones2026 · 2019-07-28T17:51:32Z

@bradrydzewski I updated the variable name and added tests for the other permutations you mentioned above. Let me know if you see any other issues.

bradrydzewski · 2019-07-30T05:03:35Z

Thanks for this. I also updated the documentation accordingly:
https://autoscale.drone.io/reference/drone-capacity-buffer/

jones2026 added 12 commits July 3, 2019 23:56

temp commenting out publish

cf361c6

adding initial test

2dc2c6f

adding breaking test

ff2bae9

turning test green

ace09f2

Adding test to load standby capacity config

ea2c397

Adding test to load standby capacity config

1a7209a

fix: fixing unit type for test

3d44b58

feat: adding test to ensure default can be overridden

8566b94

fix: adding parameter to override default value

936e274

fix: updating config lookup parameter

6628364

reverting removing of publish step

9dfd3fb

drone fmt

26c2ce0

tboerger reviewed Jul 4, 2019

View reviewed changes

bradrydzewski reviewed Jul 4, 2019

View reviewed changes

engine/planner.go Outdated Show resolved Hide resolved

jones2026 added 3 commits July 4, 2019 12:13

reverting unwanted go and drone fmt output

64e3bff

reverting a little more unwanted fmt

8f244fe

feat: switching config parameter to DRONE_POOL_STANDBY_CAPACITY

bab3532

jones2026 added 2 commits July 24, 2019 07:54

Merge branch 'master' of https://github.com/drone/autoscaler

d4b98b8

Refactoring to DRONE_CAPACITY_BUFFER and adding unit tests

cf86d29

jones2026 changed the title ~~Adding support for standby capacity~~ Adding support for capacity buffer Jul 28, 2019

add server buffer to log statement [ci skip]

93c860b

bradrydzewski merged commit e5184ab into drone:master Jul 30, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding support for capacity buffer #39

Adding support for capacity buffer #39

jones2026 commented Jul 4, 2019

tboerger left a comment

bradrydzewski commented Jul 4, 2019 •

edited

Loading

jones2026 commented Jul 4, 2019

jones2026 commented Jul 4, 2019

jones2026 commented Jul 4, 2019

bradrydzewski commented Jul 4, 2019

jones2026 commented Jul 4, 2019 •

edited

Loading

bradrydzewski commented Jul 21, 2019

jones2026 commented Jul 28, 2019

bradrydzewski commented Jul 30, 2019

Adding support for capacity buffer #39

Adding support for capacity buffer #39

Conversation

jones2026 commented Jul 4, 2019

tboerger left a comment

Choose a reason for hiding this comment

bradrydzewski commented Jul 4, 2019 • edited Loading

jones2026 commented Jul 4, 2019

jones2026 commented Jul 4, 2019

jones2026 commented Jul 4, 2019

bradrydzewski commented Jul 4, 2019

jones2026 commented Jul 4, 2019 • edited Loading

bradrydzewski commented Jul 21, 2019

jones2026 commented Jul 28, 2019

bradrydzewski commented Jul 30, 2019

bradrydzewski commented Jul 4, 2019 •

edited

Loading

jones2026 commented Jul 4, 2019 •

edited

Loading