feat(runners): add configurable eviction strategy to idle config #3375

maschwenk · 2023-07-20T19:50:45Z

We do some on-instance caching so when we scale down we'd prefer to keep the older instances around instead of the new ones (because they will have a hotter cache). This adds a configurable setting to the idleConfig to pick a sorting strategy. Never contributed to this repo, so please tell me if I'm doing something wrong!

lambdas/functions/control-plane/src/scale-runners/scale-down.ts

maschwenk · 2023-07-21T17:48:17Z

@npalm I rebased and fixed the prettier failures as well as one typecheck failure. I'm running into failures for tests because of missing type coverage. Will wait to make sure you approve of the approach before proceeding to write tests.

npalm · 2023-07-23T09:45:37Z

@npalm I rebased and fixed the prettier failures as well as one typecheck failure. I'm running into failures for tests because of missing type coverage. Will wait to make sure you approve of the approach before proceeding to write tests.

Thanks for the PR. WIll check the approach ASAP.

GuptaNavdeep1983 · 2023-07-24T07:03:26Z

@maschwenk one of the test case is failing for this PR. Can you please fix that?

lambdas/functions/control-plane/src/scale-runners/scale-down.ts

npalm

The approach looks good to me. I understand the idea of keeping caches aroudn. But we see this as a risk as well. Supporting this eviction strategy is fine as long you esnure you add a test case.

maschwenk · 2023-07-24T14:25:07Z

@npalm 👍🏼 sounds good. Will fix.

maschwenk · 2023-07-24T17:47:41Z

@npalm I've added a test to make sure the default config is extracted correctly

@GuptaNavdeep1983 took on your suggestion as well 👍🏼

lambdas/functions/control-plane/src/scale-runners/scale-down-config.ts

maschwenk · 2023-07-25T15:16:22Z

@npalm Ahh I just realized it's failing due to the test coverage? I couldn't figure out why before. Happy to add a bit of coverage.

maschwenk · 2023-07-26T01:45:36Z

@npalm Added a test in lambdas/functions/control-plane/src/scale-runners/scale-down.test.ts to get the coverage level to acceptable

maschwenk · 2023-07-26T19:02:33Z

@npalm 😅 the Code health thing did not like some duplication I had, cleaned that up. This should be ready for review! 🙏🏼

maschwenk · 2023-07-28T12:21:42Z

@npalm should be ready for another look 🙏🏼 sorry for the churn

README.md

npalm

Looks in gneral good, a few remarks. Will test asap

maschwenk · 2023-08-02T15:03:11Z

@npalm I think my comment above the eviction strategy was more confusing so I just moved it up into the paragraph above to give a more thoughtful explanation. Completely aside from the point of this PR, your note of:

This helps keep your environment up-to-date and reduce problems like running out of disk space or RAM

The disk space clause makes sense to me, but I'd be curious how often your instances are carrying around "dead" RAM after jobs run on them? We definitely have RAM issues but because we don't run things containerized it often just toasts the box completely and we take it out of rotation versus leaking over time. Just curious about your experience there. Thanks for reviewing!

maschwenk · 2023-08-04T14:59:15Z

@npalm were you able to take a look?

npalm · 2023-08-04T16:08:22Z

@npalm were you able to take a look?

Sorry not yet, seems also the coerage is dropped slightly. I wil have a look early next week

npalm · 2023-08-04T20:32:07Z

Just ran the test suite locally, but see no coverage error. Will have a check later.

npalm · 2023-08-05T11:20:28Z

Just ran the test suite locally, but see no coverage error. Will have a check later.

Re-ran the build, seems no fine.

npalm

@maschwenk thx for your contribution

🤖 I have created a release *beep* *boop* --- ## [4.1.0](v4.0.2...v4.1.0) (2023-08-08) ### Features * **runners:** add configurable eviction strategy to idle config ([#3375](#3375)) ([896f473](896f473)) ### Bug Fixes * **lambda:** bump the aws group in /lambdas with 5 updates ([#3413](#3413)) ([1acc8ba](1acc8ba)) * **runners:** retry aws metadata token download on Linux ([#3408](#3408)) ([ef46827](ef46827)) --- This PR was generated with [Release Please](https://github.com/googleapis/release-please). See [documentation](https://github.com/googleapis/release-please#release-please). Co-authored-by: forest-releaser[bot] <80285352+forest-releaser[bot]@users.noreply.github.com>

maschwenk · 2023-08-08T18:15:12Z

@npalm I'm seeing those failures you are getting on master, I think I have an idea of why they are happening. Will upstream a change if I figure it out.

npalm · 2023-08-08T20:40:13Z

Do you mean the release build on main? For some reason the build is randomly failing. I have no clue why (yet). The documentation build is another problem. And a less a big issue. Do you have any clue why the release build is failing when building all lambda's?

maschwenk · 2023-08-08T21:16:14Z

Do you mean the release build on main? For some reason the build is randomly failing. I have no clue why (yet). The documentation build is another problem. And a less a big issue. Do you have any clue why the release build is failing when building all lambda's?

@npalm I don't yet, but my test was mutating process.env, so I was thinking perhaps the mutation was polluting global state. Haven't been able to repro locally. It's also odd because the stack-trace is pretty useless, not clear what the actual error is, just shows where it's coming from.

lambdas/functions/control-plane/src/scale-runners/scale-down.test.ts

- fix for randomly failing unit test after merging #3375 - add workspace config for vscode - increase coverage

npalm · 2023-08-09T08:55:17Z

I think I have solved the issue in #3418

npalm · 2023-08-09T08:58:29Z

Not yet, still failing

npalm · 2023-08-09T21:13:55Z

@maschwenk I have tried earlier today to fix the test. I am not able to reproduce the failing test on my local. Assume it has something to do with the order. I added the SCALE_DOWN_CONFIG to other test to ensure the value is set right. At some moment got a error the coverage was not metting our settings. Fixed that as well. THought it was all good. But now the main is failing again. So ondering if you have still any ieda.

maschwenk · 2023-08-10T14:52:09Z

@npalm still at a little bit of a loss. Is there any way we can get better logs out of the failing tests perhaps? Or maybe try forcing the Jest test to run in an order that will deterministically fail?

npalm · 2023-08-10T20:12:37Z

Yes we can set a few things in jest (at least)

verbose -> true
silent -> false
define reporters. and save as artiefact (retention 1 day).

I will also dig in a bit more next week.

npalm · 2023-08-15T14:55:25Z

This PR should fix the issue: #3432

maschwenk · 2023-08-15T14:57:40Z

@npalm ❤️

maschwenk changed the title ~~Add eviction strategy to idle config~~ [feature] Add configurable eviction strategy to idle config Jul 20, 2023

maschwenk commented Jul 20, 2023

View reviewed changes

lambdas/functions/control-plane/src/scale-runners/scale-down.ts Outdated Show resolved Hide resolved

maschwenk force-pushed the main branch from e38f9ca to df58e66 Compare July 20, 2023 19:55

maschwenk changed the title ~~[feature] Add configurable eviction strategy to idle config~~ [feat] Add configurable eviction strategy to idle config Jul 20, 2023

maschwenk force-pushed the main branch from df58e66 to 90c040d Compare July 20, 2023 19:59

maschwenk changed the title ~~[feat] Add configurable eviction strategy to idle config~~ feat: Add configurable eviction strategy to idle config Jul 20, 2023

maschwenk force-pushed the main branch from 90c040d to 6319dd1 Compare July 20, 2023 20:02

npalm self-requested a review July 21, 2023 04:19

maschwenk force-pushed the main branch 2 times, most recently from 2274baf to 0019895 Compare July 21, 2023 17:47

npalm reviewed Jul 24, 2023

View reviewed changes

lambdas/functions/control-plane/src/scale-runners/scale-down.ts Outdated Show resolved Hide resolved

npalm reviewed Jul 24, 2023

View reviewed changes

maschwenk force-pushed the main branch from 0019895 to b26e628 Compare July 24, 2023 17:45

maschwenk commented Jul 24, 2023

View reviewed changes

lambdas/functions/control-plane/src/scale-runners/scale-down-config.ts Show resolved Hide resolved

maschwenk commented Jul 24, 2023

View reviewed changes

lambdas/functions/control-plane/src/scale-runners/scale-down-config.ts Outdated Show resolved Hide resolved

maschwenk force-pushed the main branch 2 times, most recently from ff3cbf0 to b24497b Compare July 26, 2023 01:45

maschwenk force-pushed the main branch from b24497b to 312165c Compare July 26, 2023 16:34

feat: Add eviction strategy for idle config

507b12e

maschwenk force-pushed the main branch from 312165c to 507b12e Compare July 26, 2023 16:37

Merge branch 'main' into main

bf1781d

npalm reviewed Aug 2, 2023

View reviewed changes

README.md Outdated Show resolved Hide resolved

npalm reviewed Aug 2, 2023

View reviewed changes

README.md Show resolved Hide resolved

npalm reviewed Aug 2, 2023

View reviewed changes

Fixup README to explain use-cases of different evictions strategies

af54495

Merge branch 'main' into main

46ebffe

npalm changed the title ~~feat: Add configurable eviction strategy to idle config~~ feat(runners): Add configurable eviction strategy to idle config Aug 5, 2023

Merge branch 'main' into main

6f479fd

npalm changed the title ~~feat(runners): Add configurable eviction strategy to idle config~~ feat(runners): add configurable eviction strategy to idle config Aug 8, 2023

npalm approved these changes Aug 8, 2023

View reviewed changes

npalm merged commit 896f473 into philips-labs:main Aug 8, 2023

forest-releaser bot mentioned this pull request Aug 8, 2023

chore(main): release 4.1.0 #3417

Merged

npalm reviewed Aug 9, 2023

View reviewed changes

lambdas/functions/control-plane/src/scale-runners/scale-down.test.ts Show resolved Hide resolved

npalm mentioned this pull request Aug 9, 2023

chore: fix test and add vscode workspace config #3418

Merged

npalm added a commit that referenced this pull request Aug 9, 2023

chore: fix test and add vscode workspace config (#3418)

a5f58ae

- fix for randomly failing unit test after merging #3375 - add workspace config for vscode - increase coverage

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(runners): add configurable eviction strategy to idle config #3375

feat(runners): add configurable eviction strategy to idle config #3375

maschwenk commented Jul 20, 2023

maschwenk commented Jul 21, 2023

npalm commented Jul 23, 2023

GuptaNavdeep1983 commented Jul 24, 2023

npalm left a comment

maschwenk commented Jul 24, 2023

maschwenk commented Jul 24, 2023

maschwenk commented Jul 25, 2023

maschwenk commented Jul 26, 2023

maschwenk commented Jul 26, 2023

maschwenk commented Jul 28, 2023

npalm left a comment

maschwenk commented Aug 2, 2023

maschwenk commented Aug 4, 2023

npalm commented Aug 4, 2023

npalm commented Aug 4, 2023

npalm commented Aug 5, 2023

npalm left a comment

maschwenk commented Aug 8, 2023

npalm commented Aug 8, 2023

maschwenk commented Aug 8, 2023

npalm commented Aug 9, 2023

npalm commented Aug 9, 2023

npalm commented Aug 9, 2023

maschwenk commented Aug 10, 2023

npalm commented Aug 10, 2023

npalm commented Aug 15, 2023

maschwenk commented Aug 15, 2023

feat(runners): add configurable eviction strategy to idle config #3375

feat(runners): add configurable eviction strategy to idle config #3375

Conversation

maschwenk commented Jul 20, 2023

maschwenk commented Jul 21, 2023

npalm commented Jul 23, 2023

GuptaNavdeep1983 commented Jul 24, 2023

npalm left a comment

Choose a reason for hiding this comment

maschwenk commented Jul 24, 2023

maschwenk commented Jul 24, 2023

maschwenk commented Jul 25, 2023

maschwenk commented Jul 26, 2023

maschwenk commented Jul 26, 2023

maschwenk commented Jul 28, 2023

npalm left a comment

Choose a reason for hiding this comment

maschwenk commented Aug 2, 2023

maschwenk commented Aug 4, 2023

npalm commented Aug 4, 2023

npalm commented Aug 4, 2023

npalm commented Aug 5, 2023

npalm left a comment

Choose a reason for hiding this comment

maschwenk commented Aug 8, 2023

npalm commented Aug 8, 2023

maschwenk commented Aug 8, 2023

npalm commented Aug 9, 2023

npalm commented Aug 9, 2023

npalm commented Aug 9, 2023

maschwenk commented Aug 10, 2023

npalm commented Aug 10, 2023

npalm commented Aug 15, 2023

maschwenk commented Aug 15, 2023