Initial support for CRDs (upgrade) policies #250

alex-berger · 2021-04-19T11:43:17Z

This PR adds preliminary support for Upgrading CRDs which are part of Helm Charts managed via HelmRelease objects.

See also fluxcd/flux2#1071 for more information.

Background & Motivation

Helm still does not provide any built-in solution to the CRD upgrade problem, which is well-known and
documented:

There is no support at this time for upgrading or deleting CRDs using Helm. This was an explicit decision after much community discussion due to the danger for unintentional data loss. Furthermore, there is currently no community consensus around how to handle CRDs and their lifecycle. As this evolves, Helm will add support for those use cases.

This limitation of Helm makes GitOps style "day 2 operations" of Kubernetes resources managed by
HelmRelease objects very difficult if not to say impossible. Currently, we have to manually
upgrade CRDs from HelmCharts referenced by HelmReleases, which is cumbersome and needs very
tight (timing) coordination between the commit to the GitOps repository and the manual CRD
upgrade on all systems that observe/apply that commit. This apporach is not only very error
prone it also might cause unnecessary long downtime of services.

Note, most of the Helm Charts that we install and operate are created and maintained by
3rd parties and are not under our control. Extracting all CRDs from every Helm Chart
upon each new release and manually applying those CRD resources is, as mentioned above,
very time intensive, error prone and cumbersome.

Our observation is, that most CRD upgrades are non-critical and only evolve an existing CRD
in a backward compatible fashion. Therefore, I am wondering whether it would make sense to
extends the HelmRelease resources with an opt-in flag to automatically upgrade CRD objects
(if needed).

alex-berger · 2021-04-19T12:19:32Z

Unfortunately, I run out of ideas how to make the e2e test working with pull request. E2e tests work on my feature branches which are real branches that can be referenced by the GitRepository, but pull request are not real branches so it fails.

hiddeco

This already looks great, couple of minor comments and a suggestion that I would like to discuss. 🏅

internal/runner/runner.go

docs/spec/v2beta1/helmreleases.md

internal/runner/runner.go

api/v2beta1/helmrelease_types.go

hiddeco

Looks like the tests do now work?

docs/spec/v2beta1/helmreleases.md

internal/runner/runner.go

alex-berger · 2021-04-20T11:59:04Z

Looks like the tests do now work?

As mentioned above the test run on branches and tags, but unfortunately not on pull-requests. See my feature branch (the source branch of the PR at https://github.com/alex-berger/helm-controller/actions for test results).

I have not yet figured out whether there is a way to reference a GitHub pull-request as branch (I wasted 6 hours trying to find a way, but did not find one yet). As my time is limited I have given up and disabled the (new) tests for pull-requests, such that they only run on branches (and maybe tags).

Signed-off-by: Alexander Berger <alex-berger@gmx.ch>

hiddeco

I am OK with the e2e approach for now, we should move many of the tests to Go anyway, as has been done with a lot of kustomize-controller.

Anyway, thanks a lot @alex-berger. 💯 🌻

hiddeco · 2021-04-21T16:32:51Z

internal/runner/runner.go

@@ -58,7 +70,7 @@ type Runner struct {
 // namespace configured to the provided values.
 func NewRunner(getter genericclioptions.RESTClientGetter, storageNamespace string, logger logr.Logger) (*Runner, error) {
 	runner := &Runner{
-		logBuffer: NewLogBuffer(NewDebugLog(logger).Log, 0),
+		logBuffer: NewLogBuffer(NewDebugLog(logger).Log, 100),


I somehow missed this during review, and this change is not related to any of the CRD things. Can you explain why this was changed? As it changes the default of including the last 5 log lines (after de-duplication) in a failure condition and/or event to 100, which makes any failed event we produce now quite verbose.

I increased it for debugging during feature development and eventually forgot to decrease it again. 😳

Thanks, that means my restore PR is justified and I can continue with the release :-)

joejulian · 2021-04-21T16:53:14Z

Looks like I'm moments late to comment on this, but has the justifications for helm, itself, not supporting this been considered? https://github.com/helm/community/blob/main/hips/hip-0011.md

hiddeco · 2021-04-21T16:58:29Z

They have been considered, which is why we default to not enable the behavior as it can be potentially disastrous.

However, given that people use the helm-controller to automatically drive operations, having access to a configuration option that does the right thing if you know how your chart (or the application behind it) behaves on Custom Resource Definition upgrades, it is in my opinion justified.

joejulian · 2021-04-21T20:58:44Z

This feature would need to be able to add the conversion webhooks simultaneously. Links to them are defined in the CRD itself: see the cert-manager CRDs for example. Without them, the API server will reject create/change verbs to those CRDs.

alex-berger force-pushed the feature/crd-upgrade branch from e4d1931 to d54ac8b Compare April 19, 2021 12:02

hiddeco reviewed Apr 19, 2021

View reviewed changes

internal/runner/runner.go Outdated Show resolved Hide resolved

docs/spec/v2beta1/helmreleases.md Outdated Show resolved Hide resolved

internal/runner/runner.go Outdated Show resolved Hide resolved

api/v2beta1/helmrelease_types.go Outdated Show resolved Hide resolved

alex-berger force-pushed the feature/crd-upgrade branch 3 times, most recently from 7dac7b8 to e32e706 Compare April 20, 2021 08:09

stefanprodan added the enhancement New feature or request label Apr 20, 2021

hiddeco reviewed Apr 20, 2021

View reviewed changes

docs/spec/v2beta1/helmreleases.md Outdated Show resolved Hide resolved

internal/runner/runner.go Outdated Show resolved Hide resolved

alex-berger added 4 commits April 20, 2021 14:21

Initial support for HelmRelease for upgrading CRDs

a6cc150

Signed-off-by: Alexander Berger <alex-berger@gmx.ch>

Integrate feedback from hiddeco

defee3d

Signed-off-by: Alexander Berger <alex-berger@gmx.ch>

Add deprecation notice to SkipCRDs attribute.

fe766fb

Signed-off-by: Alexander Berger <alex-berger@gmx.ch>

Fix typos

4b60855

Signed-off-by: Alexander Berger <alex-berger@gmx.ch>

alex-berger force-pushed the feature/crd-upgrade branch from 31b92ca to 4b60855 Compare April 20, 2021 12:21

hiddeco approved these changes Apr 20, 2021

View reviewed changes

hiddeco merged commit 9a049a1 into fluxcd:main Apr 20, 2021

hiddeco reviewed Apr 21, 2021

View reviewed changes

hiddeco changed the title ~~Initial support for HelmRelease for upgrading CRDs~~ Initial support for CRDs (upgrade) policies Apr 21, 2021

hiddeco mentioned this pull request May 10, 2021

Give CRD policy precedence over skipCRDs field #261

Merged

alex-berger deleted the feature/crd-upgrade branch June 25, 2021 13:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial support for CRDs (upgrade) policies #250

Initial support for CRDs (upgrade) policies #250

alex-berger commented Apr 19, 2021

alex-berger commented Apr 19, 2021

hiddeco left a comment

hiddeco left a comment

alex-berger commented Apr 20, 2021 •

edited

Loading

hiddeco left a comment

hiddeco Apr 21, 2021

alex-berger Apr 21, 2021

hiddeco Apr 21, 2021

joejulian commented Apr 21, 2021

hiddeco commented Apr 21, 2021 •

edited

Loading

joejulian commented Apr 21, 2021

Initial support for CRDs (upgrade) policies #250

Initial support for CRDs (upgrade) policies #250

Conversation

alex-berger commented Apr 19, 2021

Background & Motivation

alex-berger commented Apr 19, 2021

hiddeco left a comment

Choose a reason for hiding this comment

hiddeco left a comment

Choose a reason for hiding this comment

alex-berger commented Apr 20, 2021 • edited Loading

hiddeco left a comment

Choose a reason for hiding this comment

hiddeco Apr 21, 2021

Choose a reason for hiding this comment

alex-berger Apr 21, 2021

Choose a reason for hiding this comment

hiddeco Apr 21, 2021

Choose a reason for hiding this comment

joejulian commented Apr 21, 2021

hiddeco commented Apr 21, 2021 • edited Loading

joejulian commented Apr 21, 2021

alex-berger commented Apr 20, 2021 •

edited

Loading

hiddeco commented Apr 21, 2021 •

edited

Loading