sig-api-machinery KEP: Defaulting for Custom Resources #1006

sttts · 2019-04-26T10:38:06Z

This topic is unblocked by #1002.

sttts · 2019-04-26T10:39:17Z

/assign @deads2k @lavalamp @liggitt @mbohlool @apelisse

keps/sig-api-machinery/20190426-crd-defaulting.md

apelisse

Thanks for writing this Stefan. I'd just really like native types and CRD to look and feel very similar. You're not mentioning how kubebuilder would integrate with this feature (I think that could be useful).

keps/sig-api-machinery/20190426-crd-defaulting.md

sttts · 2019-04-27T09:22:48Z

@apelisse added a sentence about kubebuilder. It just needs another tag to define the default. Should be super straight forward. /cc @DirectXMan12

DirectXMan12

couple of comments inline, otherwise big 👍 from me

keps/sig-api-machinery/20190426-crd-defaulting.md

sttts · 2019-04-30T08:14:36Z

@DirectXMan12 addressed your comment.

keps/sig-api-machinery/20190426-crd-defaulting.md

liggitt · 2019-05-01T00:40:53Z

keps/sig-api-machinery/20190426-crd-defaulting.md

+
+![Decoding steps which must apply defaults](20190426-crd-defaulting-pipeline.png)
+
+We rely on the validation steps in the request pipeline to verify that the default value is of the right type.


would it be practicable to validate the default value at CRD create/update time? If possible, I'd like to avoid persisting fundamentally flawed CRDs

With structural schemas, yes. We can validate them in advance.

More precisely we can verify the types, not value validation. So there will always be a chance that the default does not fullfil the later. But as both validation and defaults are under control of the same party, we should be fine.

liggitt · 2019-05-01T00:47:27Z

keps/sig-api-machinery/20190426-crd-defaulting.md

+
+We rely on the validation steps in the request pipeline to verify that the default value is of the right type.
+
+The `default` field in the CRD types is considered alpha quality. We will add a `CustomResourceDefaulting` feature gate. Values for `default` will be rejected if the gate is not enabled. 


if we anticipate this field being able to be populated by default in 1.16, we must not fail validation if we encounter this in existing objects. we would need to follow a process similar to https://github.com/kubernetes/community/blob/master/contributors/devel/sig-architecture/api_changes.md#alpha-field-in-existing-api-version

Since CRD validation has already been rejecting this field, the only tweak would be "Before persisting the object to storage, reject the disabled alpha field on create, and on update if the existing object does not already have a value in the field."

The important thing is that we allow/preserve data in the Default field on update if the existing object had data in that field, even if the alpha feature gate is disabled

sounds good

added ratcheting validaton

keps/sig-api-machinery/20190426-crd-defaulting.md

liggitt · 2019-05-01T00:50:02Z

keps/sig-api-machinery/20190426-crd-defaulting.md

+       type: array
+       items:
+         type: integer
+       default: [1]


I went looking for the footnote...
...
it's been a long day

keps/sig-api-machinery/20190426-crd-defaulting.md

liggitt · 2019-05-01T01:53:40Z

keps/sig-api-machinery/20190426-crd-defaulting.md

+
+We do this in the serializer by passing a real defaulter to [`versioningserializer.NewCodec`](https://github.com/kubernetes/apimachinery/blob/master/pkg/runtime/serializer/versioning/versioning.go#L49) such that defaulting is done natively just after the binary payload has been unmarshalled into an `map[string]interface{}` and pruning of [KEP: Pruning for CustomResources](https://github.com/kubernetes/enhancements/pull/709) was done, compare the yellow boxes in the following figure:
+
+![Decoding steps which must apply defaults](20190426-crd-defaulting-pipeline.png)


I hard a hard time telling from the diagram, what about after reading the response from a conversion webhook?

Like for native types: no defaulting after conversion.

what is persisted into etcd in this scenario?

crd has two versions (v1, v2)

v1 defaults field a to 1

v2 defaults field b to 2

v2 is the storage version

user submits v1 object with a and b unset

Clearly, a defaults to 1 as part of deserialize->default->validate of the user's request

What is less clear to me is if v2 defaulting (setting b to 2) is applied after conversion, before storing in etcd.

oops, crossed wires. that answered my question, thanks.

probably worth calling out explicitly (maybe even with that scenario). the v2 defaulting would get applied on the way out of storage, so from an API user's perspective, I think they would see the newly created object returned with both a:1, b:2 set, but only a:1 would be in etcd, right?

made this explicit:

Like for native resources, we do defaulting * during request payload deserialization * after mutating webhook admission * during read from storage. Note: like for native resources, we do not default after webhook conversions. Hence, webhook conversions must be complete in the sense that they return defaulted objects. Technically we could do defaulting, but to match native resources, we do not.

keps/sig-api-machinery/20190426-crd-defaulting.md

liggitt · 2019-05-01T03:35:17Z

keps/sig-api-machinery/20190426-crd-defaulting.md

+2. recursively follow the given CustomResource instance and the structural schema, applying defaults where an object field is 
+  * undefined (`_, ok := obj[field]; !ok`)
+  * `nil` if the field not nullable
+  * empty in case of lists and maps, and if nullable is not set.


in case of lists

clarify how we determine this.

type: "array" in the schema?

_, ok := default.([]interface{})?

_, ok := value.([]interface{})?

all of the above?

and maps

clarify how we determine this.

type: "object" in the schema?

_, ok := default.(map[string]interface{})?

_, ok := value.(map[string]interface{})?

all of the above?

I mostly want to make sure defaulting doesn't replace an empty array or object with a default value of the correct type prior to validation and mask what should be reported to the user as a schema validation error

Made the cases explicit as you wrote down above.

Note: we could do type validation during pruning, i.e. as part of the deserialization process, to match native types.

keps/sig-api-machinery/20190426-crd-defaulting.md

liggitt · 2019-05-01T14:20:28Z

keps/sig-api-machinery/20190426-crd-defaulting.md


-[Kubebuilder's crd-gen](https://github.com/kubernetes-sigs/controller-tools/tree/master/cmd/crd) can make use of this feature by adding another tag, e.g. `// +default=<arbitrary-json-value>`. Defaults are arbitrary JSON values, which must also validate and are not subject to pruning (defaulting happens after pruning). This is an implicit assumption that will be checked by the apiserver.
+[Kubebuilder's crd-gen](https://github.com/kubernetes-sigs/controller-tools/tree/master/cmd/crd) can make use of this feature by adding another tag, e.g. `// +default=<arbitrary-json-value>`. Defaults are arbitrary JSON values, which must also validate (types are checked during CRD creation and update, value validation is checked for requests, but not for etcd reads) and are not subject to pruning (defaulting happens after pruning).


is it possible to ensure the default value does not contain a field that would get dropped via pruning? this would help prevent typos. I'm willing to do more expensive things in CRD create/update validation to improve the user experience, as long as it falls out in a relatively straightforward way in the code

for a given JSONSchemaProps object, I was envisioning checks like this:

if props.Default != null { reflect.DeepEqual( props.Default, prune( props.Default, makeStructuralSchema(props), isPreservingUnknownFields, /* from parent schemas or CRD field */ ), ) validate(props.Default, ConvertJSONSchemaProps(props)) }

yeah, we can do that.

liggitt · 2019-05-01T14:50:28Z

keps/sig-api-machinery/20190426-crd-defaulting.md

+   and for `array` type in the schema one of these:
+
+   * `if v, ok := obj[fld]; !ok` => default
+   * `else if !nullable && v == nil` => default


if I'm reading this correctly, we'd want a nullable field to skip all the checks after the first one, right? something like this?

* `if v, ok := obj[fld]; !ok` => default * `else if nullable` => no default * `else if v == nil` => default ...

same for object

@liggitt I read this as being very similar to what Stefan wrote, but harder to read?

if the schema specifies the field is nullable, and the current value is [], my example would not default, and stefan's would

I think we want to avoid treating nil and []/{} values differently for defaulting purposes

Oh I see what happened here.

If it's nil and non-nullable, shouldn't it be an error anyway?

Let's think in terms of UX behavior:

{ defaulted-list: [] # Do I want this defaulted? Not sure, I'm specifying a value after all ... } --- { defaulted-list: null # Do I want this defaulted? Not sure, I'm specificying a value after all ... } --- { # defaulted-list should certainly be defaulted }

One of the principle we used in apply was that if users specified something, then we assume that it's what they want, and setting something to nil is different from not setting it at all.

If it's nil and non-nullable, shouldn't it be an error anyway?

That's a good point. We could default only if the property was completely absent.

Things I like about only defaulting unspecified properties:

It's simpler to explain

It's simpler to implement

It doesn't mess with user data in any way

It avoids masking schema errors if they explicitly send null for a property that is not nullable, even if it has a default

Opposing considerations:

The distinction between "null" and "absent" is not easily maintained in all serialization formats (notably protobuf), so if that got lost in a round-trip, we could end up applying defaulting anyway (this isn't a huge concern, given we currently maintain the distinction for custom resources, so I think we have to figure out a way to continue doing so if we ever switch to persist in other formats like proto)

We would not be able to replicate defaulting rules for built-in resources using this standard in a hypothetical world where built-in types get converted to CRDs (this also isn't a huge concern, given there are tons of defaulting rules for built-in types we couldn't replicate using openapi at all)

Sending explicit null values in some patch formats today removes fields, so there's some prior art for null == unset in kube. For example, kubectl patch ... --type=merge -p '{"spec":{"key":null}}' removes an existing key property, it does not persist a literal null value. I'm not sure what server-side apply does here.

Overall, I think the simpler and more intuitive behavior is probably what we want.

liggitt · 2019-05-01T15:07:58Z

/lgtm

would like a follow-up clarifying #1006 (comment)

liggitt · 2019-05-01T15:08:17Z

cc @lavalamp @deads2k
for approval

keps/sig-api-machinery/20190426-crd-defaulting.md

liggitt · 2019-05-02T13:51:26Z

/lgtm

deads2k · 2019-05-02T16:32:47Z

Spoken with the stakeholders, we think this is ready.

/approve

k8s-ci-robot · 2019-05-02T16:32:59Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: deads2k, liggitt, sttts

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~keps/sig-api-machinery/OWNERS~~ [deads2k]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

k8s-ci-robot added the cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. label Apr 26, 2019

k8s-ci-robot requested review from deads2k and lavalamp April 26, 2019 10:38

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. kind/kep Categorizes KEP tracking issues and PRs modifying the KEP directory sig/api-machinery Categorizes an issue or PR as relevant to SIG API Machinery. labels Apr 26, 2019

sttts force-pushed the sttts-defaulting branch from 0b4d990 to 6bd255e Compare April 26, 2019 10:38

k8s-ci-robot assigned apelisse, deads2k, lavalamp, liggitt and mbohlool Apr 26, 2019

sttts force-pushed the sttts-defaulting branch 3 times, most recently from 3cfa64d to 2fd2eef Compare April 26, 2019 10:49

sttts mentioned this pull request Apr 26, 2019

sig-api-machinery: add "Graduate CustomResourceDefinitions to GA" #990

Merged

sttts changed the title ~~Sig-API-Machinery KEP: Defaulting for Custom Resources~~ sig-api-machinery KEP: Defaulting for Custom Resources Apr 26, 2019

sttts commented Apr 26, 2019

View reviewed changes

keps/sig-api-machinery/20190426-crd-defaulting.md Outdated Show resolved Hide resolved

apelisse reviewed Apr 26, 2019

View reviewed changes

keps/sig-api-machinery/20190426-crd-defaulting.md Outdated Show resolved Hide resolved

keps/sig-api-machinery/20190426-crd-defaulting.md Outdated Show resolved Hide resolved

sttts force-pushed the sttts-defaulting branch from 7b72256 to 49b0dc5 Compare April 27, 2019 09:41

DirectXMan12 suggested changes Apr 29, 2019

View reviewed changes

keps/sig-api-machinery/20190426-crd-defaulting.md Show resolved Hide resolved

keps/sig-api-machinery/20190426-crd-defaulting.md Outdated Show resolved Hide resolved

sttts force-pushed the sttts-defaulting branch from f286b71 to bed3ccf Compare April 30, 2019 14:16

liggitt mentioned this pull request Apr 30, 2019

sig-api-machinery KEP: Pruning for CustomResources #709

Merged

sttts force-pushed the sttts-defaulting branch from 08edcff to 846bcf0 Compare April 30, 2019 21:57

liggitt reviewed May 1, 2019

View reviewed changes

keps/sig-api-machinery/20190426-crd-defaulting.md Show resolved Hide resolved

liggitt reviewed May 1, 2019

View reviewed changes

k8s-ci-robot removed this from the v1.15 milestone May 1, 2019

sttts force-pushed the sttts-defaulting branch from c2caf51 to 2840c9e Compare May 1, 2019 10:57

liggitt reviewed May 1, 2019

View reviewed changes

keps/sig-api-machinery/20190426-crd-defaulting.md Outdated Show resolved Hide resolved

liggitt reviewed May 1, 2019

View reviewed changes

keps/sig-api-machinery/20190426-crd-defaulting.md Outdated Show resolved Hide resolved

liggitt reviewed May 1, 2019

View reviewed changes

keps/sig-api-machinery/20190426-crd-defaulting.md Outdated Show resolved Hide resolved

liggitt reviewed May 1, 2019

View reviewed changes

keps/sig-api-machinery/20190426-crd-defaulting.md Outdated Show resolved Hide resolved

liggitt reviewed May 1, 2019

View reviewed changes

sttts force-pushed the sttts-defaulting branch from d98e790 to a7fde86 Compare May 1, 2019 14:42

liggitt reviewed May 1, 2019

View reviewed changes

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label May 1, 2019

apelisse reviewed May 1, 2019

View reviewed changes

keps/sig-api-machinery/20190426-crd-defaulting.md Outdated Show resolved Hide resolved

apelisse reviewed May 1, 2019

View reviewed changes

keps/sig-api-machinery/20190426-crd-defaulting.md Show resolved Hide resolved

sttts force-pushed the sttts-defaulting branch from a7fde86 to 99b2486 Compare May 1, 2019 21:45

k8s-ci-robot removed the lgtm "Looks good to me", indicates that a PR is ready to be merged. label May 1, 2019

sttts force-pushed the sttts-defaulting branch 2 times, most recently from cd531c7 to 8c96d68 Compare May 1, 2019 21:53

sttts commented May 1, 2019

View reviewed changes

keps/sig-api-machinery/20190426-crd-defaulting.md Outdated Show resolved Hide resolved

sttts force-pushed the sttts-defaulting branch 3 times, most recently from 4ba62c6 to 5ce436c Compare May 2, 2019 13:39

sig-api-machinery: add "Defaulting for CustomResources"

fc7d559

sttts force-pushed the sttts-defaulting branch from 5ce436c to fc7d559 Compare May 2, 2019 13:48

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label May 2, 2019

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label May 2, 2019

k8s-ci-robot merged commit 7134f6e into kubernetes:master May 2, 2019


		![Decoding steps which must apply defaults](20190426-crd-defaulting-pipeline.png)

		We rely on the validation steps in the request pipeline to verify that the default value is of the right type.


		We rely on the validation steps in the request pipeline to verify that the default value is of the right type.

		The `default` field in the CRD types is considered alpha quality. We will add a `CustomResourceDefaulting` feature gate. Values for `default` will be rejected if the gate is not enabled.


		We do this in the serializer by passing a real defaulter to [`versioningserializer.NewCodec`](https://github.com/kubernetes/apimachinery/blob/master/pkg/runtime/serializer/versioning/versioning.go#L49) such that defaulting is done natively just after the binary payload has been unmarshalled into an `map[string]interface{}` and pruning of [KEP: Pruning for CustomResources](https://github.com/kubernetes/enhancements/pull/709) was done, compare the yellow boxes in the following figure:

		![Decoding steps which must apply defaults](20190426-crd-defaulting-pipeline.png)


		[Kubebuilder's crd-gen](https://github.com/kubernetes-sigs/controller-tools/tree/master/cmd/crd) can make use of this feature by adding another tag, e.g. `// +default=<arbitrary-json-value>`. Defaults are arbitrary JSON values, which must also validate and are not subject to pruning (defaulting happens after pruning). This is an implicit assumption that will be checked by the apiserver.
		[Kubebuilder's crd-gen](https://github.com/kubernetes-sigs/controller-tools/tree/master/cmd/crd) can make use of this feature by adding another tag, e.g. `// +default=<arbitrary-json-value>`. Defaults are arbitrary JSON values, which must also validate (types are checked during CRD creation and update, value validation is checked for requests, but not for etcd reads) and are not subject to pruning (defaulting happens after pruning).

sig-api-machinery KEP: Defaulting for Custom Resources #1006

sig-api-machinery KEP: Defaulting for Custom Resources #1006

Conversation

sttts commented Apr 26, 2019

sttts commented Apr 26, 2019

apelisse left a comment

Choose a reason for hiding this comment

sttts commented Apr 27, 2019

DirectXMan12 left a comment

Choose a reason for hiding this comment

sttts commented Apr 30, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

liggitt May 1, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

liggitt May 1, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

liggitt May 1, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

liggitt May 1, 2019 • edited Loading

Choose a reason for hiding this comment

liggitt May 1, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

liggitt May 2, 2019 • edited Loading

Choose a reason for hiding this comment

liggitt commented May 1, 2019

liggitt commented May 1, 2019

liggitt commented May 2, 2019

deads2k commented May 2, 2019 • edited Loading

k8s-ci-robot commented May 2, 2019

liggitt May 1, 2019 •

edited

Loading

liggitt May 1, 2019 •

edited

Loading

liggitt May 1, 2019 •

edited

Loading

liggitt May 1, 2019 •

edited

Loading

liggitt May 1, 2019 •

edited

Loading

liggitt May 2, 2019 •

edited

Loading

deads2k commented May 2, 2019 •

edited

Loading