Skip to content
This repository has been archived by the owner on Nov 1, 2023. It is now read-only.

Delete nodes on decommission instead of just releasing scale in protection #2586

Merged
merged 1 commit into from
Nov 1, 2022

Conversation

tevoinea
Copy link
Member

Summary of the Pull Request

What is this about?

We want to discard nodes as soon as they're done completing their assigned tasks. This is different from the current behavior where we try to re-use nodes by reimaging them. Originally, this discard process was accomplished by removing scale-in protection and letting auto scale take care of everything. The issue with that approach is that scale-in actions only happen when the work queue for the pool is empty so letting auto scale delete the nodes won't work for a system under load.

This PR changes our approach to deleting nodes right away instead of reimaging or waiting for scale-in actions. The behavior continues to be behind the "NodeDisposalStrategies" env var until we decide to enable it by default.

@tevoinea tevoinea linked an issue Oct 31, 2022 that may be closed by this pull request
@tevoinea tevoinea merged commit 33e2d5c into microsoft:main Nov 1, 2022
@AdamL-Microsoft AdamL-Microsoft mentioned this pull request Nov 3, 2022
@ghost ghost locked as resolved and limited conversation to collaborators Dec 1, 2022
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Decommission node strategy should delete nodes
4 participants