Add the recreateOption to the object template #253

mprahl · 2024-05-23T21:03:58Z

When a user needs to update an object's immutable fields, the object must be replaced. The user may opt-in to setting recreateOption on an object template to "IfRequired" and "Always". When set to "IfRequired", normal updates proceed when possible.

Relates:
https://issues.redhat.com/browse/ACM-11846

api/v1/configurationpolicy_types.go

yiraeChristineKim · 2024-05-28T13:44:03Z

controllers/configurationpolicy_controller.go

+			} else {
+				removeFieldsForComparison(dryRunUpdatedObj)
+
+				if reflect.DeepEqual(dryRunUpdatedObj.Object, existingObjectCopy.Object) {


Why is this log mismatch message?

What do you mean?

if reflect.DeepEqual(dryRunUpdatedObj.Object, existingObjectCopy.Object) then the log message is "A mismatch was detected but a dry run update didn't make any changes. Assuming the object " + "is compliant.", is this intended?

Yes, this is existing code but it was moved in this PR.

This code is detecting the case where the config-policy-controller thought there was a difference but the dry run update request showed that it was not different after all. This can happen when empty values are not shown in the API output but are set in the policy.

mprahl · 2024-05-28T13:44:29Z

/hold for reviews

yiraeChristineKim · 2024-05-28T13:50:59Z

controllers/configurationpolicy_controller.go

+				}
+
+				if time.Since(start) > time.Second*10 {
+					message = fmt.Sprintf(


In this for loop, it doesn't delete the obj. so the message should be changed?

This message is correct because this only happens if there is an error and the error is because the object still exists.

JustinKuli

Mainly I'm wondering if the loop to retry the Create call is necessary.

JustinKuli · 2024-05-28T14:30:57Z

api/v1/configurationpolicy_types.go

+// RecreateOption describes the condition when to delete and recreate an object when an update is required. IfRequired
+// will recreate the object when updating an immutable field. Always will always recreate the object if a mismatch is
+// detected. RecreateOption has no effect when the remediationAction is inform. IfRequired has no effect on clusters
+// without dry run update support. Default is None.
+// +kubebuilder:validation:Enum=None;IfRequired;Always
+type RecreateOption string


What do you think of moving this docstring to inside the ObjectTemplate struct? I don't think it makes a difference for the CRD, but I think it will appear more often via CodeLens when it's defined on a struct field, as opposed to the type.

Also, are you opposed to //+kubebuilder:default=None?

You're right. The docstring works in the suggested location.

I'll go ahead and do //+kubebuilder:default=None but we did this in pruneObjectBehavior and it led to unintended consequences such as tests failing that check the whole ConfigurationPolicy and the template-sync needing to change how it determines if the ConfigurationPolicy on the cluster matches.

Does template-sync handle it nicely now, or do you think there will be more changes needed? I thought it had been working nicely for some OperatorPolicy fields like this

This is the code that needs updating but it's not a big deal. I have the local changes queued up. Just need to run the tests once this merges:
https://github.com/open-cluster-management-io/governance-policy-framework-addon/blob/f299b7020823b3a09ab872ea4731080b42ff09d8/controllers/templatesync/template_sync.go#L959-L967

Reading that makes me think that template-sync might be thinking it needs to do more updates to OperatorPolicy than it does. I thought I had checked and it wasn't stuck in a loop at least, but I'll need to investigate more

controllers/configurationpolicy_controller.go

JustinKuli · 2024-05-28T14:55:20Z

controllers/configurationpolicy_controller.go

+
+			start := time.Now()
+
+			for {


I'm worried about policy latency (if that's the right term) with this loop.

Say I have x number of these recreate policies, but the objects they work with have finalizers. Then I think each ends up waiting 10 seconds every config-policy-controller evaluation loop. There is some concurrency, c, (by default 2 goroutines I think), but it means that the loop takes a minimum of floor(x/c)*10 seconds. They could degrade the performance of the other policies in the cluster, since they would have to wait that long between evaluations.

What happens if there isn't a loop, can it just try the Create immediately after the Delete call returns, and if it fails just get it on the next evaluation? I think the shouldEvaluatePolicy logic could check for this to ensure it keeps getting evaluated.

@JustinKuli I'll have it try three times instead and then give up. I'm worried about the deletion just taking a couple of seconds but then it leading to a long time before the object is recreated if the config-policy-controller is saturated.

The shouldEvaluatePolicy logic already immediately schedules a policy with the timeout status message.

api/v1/configurationpolicy_types.go

controllers/configurationpolicy_controller.go

When a user needs to update an object's immutable fields, the object must be replaced. The user may opt-in to setting recreateOption on an object template to "IfRequired" and "Always". When set to "IfRequired", normal updates proceed when possible. Relates: https://issues.redhat.com/browse/ACM-11846 Signed-off-by: mprahl <mprahl@users.noreply.github.com>

openshift-ci · 2024-05-28T15:37:33Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: JustinKuli, mprahl

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [JustinKuli,mprahl]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

mprahl · 2024-05-28T15:47:12Z

/unhold

openshift-ci bot added the dco-signoff: yes label May 23, 2024

openshift-ci bot requested review from JustinKuli and yiraeChristineKim May 23, 2024 21:04

openshift-ci bot added the approved label May 23, 2024

mprahl force-pushed the recreate branch from 2c90e80 to 3a3bca2 Compare May 28, 2024 12:29

mprahl commented May 28, 2024

View reviewed changes

api/v1/configurationpolicy_types.go Outdated Show resolved Hide resolved

yiraeChristineKim reviewed May 28, 2024

View reviewed changes

mprahl force-pushed the recreate branch from 3a3bca2 to e0ef2bc Compare May 28, 2024 13:44

openshift-ci bot added the do-not-merge/hold label May 28, 2024

yiraeChristineKim reviewed May 28, 2024

View reviewed changes

yiraeChristineKim previously approved these changes May 28, 2024

View reviewed changes

openshift-ci bot assigned yiraeChristineKim May 28, 2024

openshift-ci bot added the lgtm label May 28, 2024

JustinKuli reviewed May 28, 2024

View reviewed changes

dhaiducek reviewed May 28, 2024

View reviewed changes

api/v1/configurationpolicy_types.go Outdated Show resolved Hide resolved

controllers/configurationpolicy_controller.go Outdated Show resolved Hide resolved

controllers/configurationpolicy_controller.go Show resolved Hide resolved

mprahl dismissed yiraeChristineKim’s stale review via 91aa552 May 28, 2024 15:26

mprahl force-pushed the recreate branch from e0ef2bc to 91aa552 Compare May 28, 2024 15:26

openshift-ci bot removed the lgtm label May 28, 2024

mprahl requested review from JustinKuli, dhaiducek and yiraeChristineKim May 28, 2024 15:26

JustinKuli approved these changes May 28, 2024

View reviewed changes

openshift-ci bot assigned JustinKuli May 28, 2024

openshift-ci bot added the lgtm label May 28, 2024

openshift-ci bot removed the do-not-merge/hold label May 28, 2024

openshift-merge-bot bot merged commit 0bb6329 into open-cluster-management-io:main May 28, 2024
9 checks passed

This was referenced May 28, 2024

🤖 Sync from open-cluster-management-io/config-policy-controller: #255, #253 stolostron/config-policy-controller#878

Merged

😿 Failed to sync the upstream PRs: #255, #253 stolostron/config-policy-controller#879

Closed

dhaiducek mentioned this pull request May 29, 2024

Sync Config/Operator CRDs open-cluster-management-io/governance-policy-addon-controller#156

Merged

dhaiducek mentioned this pull request Jun 6, 2024

Sync OperatorPolicy CRD complianceConfig field open-cluster-management-io/governance-policy-addon-controller#159

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add the recreateOption to the object template #253

Add the recreateOption to the object template #253

mprahl commented May 23, 2024

yiraeChristineKim May 28, 2024

mprahl May 28, 2024

yiraeChristineKim May 28, 2024

mprahl May 28, 2024

mprahl commented May 28, 2024

yiraeChristineKim May 28, 2024

mprahl May 28, 2024

JustinKuli left a comment

JustinKuli May 28, 2024

mprahl May 28, 2024

JustinKuli May 28, 2024

mprahl May 28, 2024

JustinKuli May 28, 2024

JustinKuli May 28, 2024

mprahl May 28, 2024

mprahl May 28, 2024

openshift-ci bot commented May 28, 2024

mprahl commented May 28, 2024

Add the recreateOption to the object template #253

Add the recreateOption to the object template #253

Conversation

mprahl commented May 23, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mprahl commented May 28, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

JustinKuli left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

openshift-ci bot commented May 28, 2024

mprahl commented May 28, 2024