KEP-2985: Public KRM Functions Registry #2986

mengqiy · 2021-09-22T15:48:00Z

One-line PR description: adding new KEP for public KRM functions registry

Issue link: Public KRM functions registry #2985

Other comments:

keps/sig-cli/2985-public-krm-functions-registry/README.md

KnVerey · 2021-09-22T19:21:53Z

keps/sig-cli/2985-public-krm-functions-registry/README.md

+Publishers are responsible for the security of their KRM functions. Publishers
+are responsible for clearly communicating the expectation (e.g. maturity) to
+their users (e.g. through a file for the publisher. We should standardize it
+eventually). For example, Kustomize wants to provide a small set of carefully


Unless I'm missing something about what you mean here, we already have a means of indicating maturity: each function config's GV. I wouldn't expect a publisher's entire catalog to have a single maturity level.

KnVerey · 2021-09-22T19:26:08Z

keps/sig-cli/2985-public-krm-functions-registry/README.md

+_kustomize/experimental_ for the latter.
+
+If some functions don't have any publishers, the users should use it at their
+own risk.


What would be the use case for anonymous publishers as opposed to the "experimental" SIG publisher above? Even if the publisher is an individual Github user, I feel they should still be identified as responsible for the function.

Maintainers are responsible for the functions they contributed no matter even when they are not published with a publisher.

keps/sig-cli/2985-public-krm-functions-registry/README.md

KnVerey · 2021-09-22T19:50:13Z

keps/sig-cli/2985-public-krm-functions-registry/README.md

+It should be sponsored by SIG-CLI.
+
+The repo name can be `KRM-functions`, `KRM-function-catalog`,
+`KRM-function-registry` or something more reasonable.


👍 I like the first and last options, since this will generate more than one catalog.

If the website is going to be krm-fn-registry.sigs.k8s.io, I prefer KRM-function-registry or KRM-fn-registry for consistency.

Agreed that we should align the domain and repo name, which implies using lower kebab case. We can mark the exact name as unresolved for now.

keps/sig-cli/2985-public-krm-functions-registry/README.md

mikebz · 2021-09-24T01:23:47Z

keps/sig-cli/2985-public-krm-functions-registry/README.md

+apiVersion: kustomize.config.k8s.io/v1beta1
+kind: Kustomization
+catalogs:
+- https://krm-fn-registry.sigs.k8s.io/catalog.yaml?publisher=kustomize


Not clear to me why I need to specify the global registry. I generally would err on the side of providing reasonable defaults. Having to provide obvious stuff is tedious, discourages quick creation of kustomization.yaml files. The next thing we get is a request from someone to create a kustomization.yaml with all the boilerplate.

keps/sig-cli/2985-public-krm-functions-registry/README.md

KnVerey · 2021-09-24T15:02:32Z

Since the way the Registry generates Catalogs will determine what means of consuming Catalog are available to most users, I think we should more explicitly consider what we want them to do. E.g.:

For discovery purposes and the imperative kpt workflows, hitting a website endpoint that dynamically generates a catalog feels suitable. In this case, freshness arguably matters most, and I'd guess traffic levels wouldn't be a problem.
For declarative purposes like specification in Kustomizations (and in confirmatory CLI arguments), we should probably favor static options such as publishing OCI artifacts and/or git-committed snapshots addressable by SHA and semver tags. I think the KEP could use more discussion of what we'll make available/encourage in this regard, and I suspect @jeremyrickard will have some opinions on this as well. There's also a potential scalability discussion to be had here if the endpoints we encourage Kustomize users to reference is hosted on SIG infrastructure, since I'd expect significant traffic from sources like CI pipelines.

mengqiy · 2021-09-24T18:20:11Z

For declarative purposes like specification in Kustomizations (and in confirmatory CLI arguments), we should probably favor static options such as publishing OCI artifacts and/or git-committed snapshots addressable by SHA and semver tags.

I agree a static (versioned) catalog can be useful for declarative purposes and having reproducible build which is especially important for production use.
But if we should require semver for tagging a catalog is questionable, since it's not clear to me if it will provide more benefits than confusions. e.g. What are the exception for a major, minor or patch release of a catalog? @KnVerey @jeremyrickard Thoughts?

Add workflow to publish a function.

mengqiy · 2021-10-01T23:14:33Z

The KEP has been updated to include KRM function metadata and workflow to publish a function

mengqiy · 2021-10-01T23:16:59Z

@mikebz GitHub doesn't allow me to re-request review from you. I just ping you here.

keps/sig-cli/2985-public-krm-functions-registry/README.md

mikebz · 2021-10-04T21:50:47Z

keps/sig-cli/2985-public-krm-functions-registry/README.md

+    - apiVersion: example.com/v1alpha1
+      kind: LegacySetNamespace
+      deprecated: true
+  usage: <a URL pointing to a README.md>


how does this work with a thing like docsify? Docsify wants your markdown files locally AFAIK? So will we then have to develop something that fetches the files when the index changes?

This is a good point!

So will we then have to develop something that fetches the files when the index changes?

I'd avoid it since it will introduce some maintenance burden.

One way to solve the freshness of the index is that we do both on-demand (e.g. merging PR) and cron-based (e.g. nightly) site building.
An alternative we can consider is that we require the contributors to check in these markdown files (usage and examples) in the registry repo and the URL in the metadata would be pointing to the markdown files in the same repo.

We could also do both: allow a URL, in which case the website will have a hyperlink instead of inline docs, or a local markdown file, in which case the docs will be embedded into the equivalent field when Catalog is built.

When I said using the markdown files in the same repo, I meant we can use a URL pointing the file in same repo like https://raw.githubusercontent.com/kubernetes-sigs/krm-functions-registry/master/path/to/markdown. It will make it easier to construct a Catalog resource than using a relative path.

mikebz · 2021-10-04T21:53:18Z

keps/sig-cli/2985-public-krm-functions-registry/README.md

+
+```shell
+├── publishers
+│   ├── communities


why would we want to separate those? What advantage does that give us? Hypothetical question - why don't I just have a one long list fo functions at the root level by name and the owner metadata is already specified?

We now introduce a dependency between what's in the file and where it exists. People might forget to move if for instance the publisher changes (gets acquired for instance).

Different types of publishers will have slightly different requirements. When we saw a publisher Foo in a function metadata, we need to know the type of Foo (community, company or individual) to determine the verification requirement needed.
I'm open to suggestions about the layout.

We now introduce a dependency between what's in the file and where it exists. People might forget to move if for instance the publisher changes (gets acquired for instance).

Are you worrying about that the image may disappear (e.g. renamed or moved) after publishing the function?
To mitigate that, we can have some cron-based CI like checking nightly to see if the images still exist. If not, notify the maintainers by GH issue and(or) email.

What do you think about defining a kind: Publisher and having that be a second required file in each directory (which could then be flat as Mike suggests)? That kind could have fields for the publisher type, and perhaps some of its fields could also be used to default fields in KRMFunctions during Catalog construction.

Does it mean each publisher need to duplicate thekind: Publisher file in every function directories?
To avoid duplication, we can have a publisher directory that contains a list of kind: Publisher files. And we can have a list of functions at the root level directory and function metadata can point to a publisher.

Since the publishers each already have a directory in your proposed structure, I was picturing the publisher.yaml going in there. But we can iterate on details like that as we figure out what makes sense.

KnVerey

I would love to get this in as provisional ahead of my and Jeff's related talk next Wednesday. To that end, can we tag the ongoing discussion points as unresolved and aim for a merge early next week? My main quibbles are either about things that I think are fine to mark unresolved at this stage, or about the schema of KRMFunction, which I think should ultimately move to the Catalog KEP, because they need to match IMO (maybe I'm missing something about why they shouldn't?).

keps/sig-cli/2985-public-krm-functions-registry/README.md

KnVerey · 2021-10-07T22:26:07Z

keps/sig-cli/2985-public-krm-functions-registry/README.md

+It should be sponsored by SIG-CLI.
+
+The repo name can be `KRM-functions`, `KRM-function-catalog`,
+`KRM-function-registry` or something more reasonable.


Agreed that we should align the domain and repo name, which implies using lower kebab case. We can mark the exact name as unresolved for now.

KnVerey · 2021-10-07T22:35:49Z

keps/sig-cli/2985-public-krm-functions-registry/README.md

+function. We will only support container-based KRM functions in the public
+registry.
+
+Ideally, the content under field `spec` should be able to be used directly in a


Is this just ideal, or is it required? When should they differ, given that a catalog needs to be programatically constructed from this data? I'd expect the Registry's catalog compilation to essentially just be collecting a set of these and inserting them into the catalog's functions field. And if it HAS to match, then one KEP or the other (Catalog, IMO) should be the source of truth for it before either is moved to implementable, at the latest.

Currently Catalog has a definition: field that presumably points to a separate artifact with most of the information you have below. I think I agree with inlining it, but I also think we should make the structure closer to a CRD. That's really what a function is after all: a client-side CR, and kind: KRMFunction is shaping up to be the CRD analogue. I think we should embrace this analogy on principle, but there are also some issues below that the CRD structure would solve. Notably, in CRD, most fields fall under items in a versions list. That should be the case for most of the fields you have under spec directly right now: schema, runtime, examples, licence, idempotent, and license all could change with the version.

Please mark this entire "Function Metadata" section as unresolved.

Updated the wording.
Marked this section as "unresolved"

keps/sig-cli/2985-public-krm-functions-registry/README.md

KnVerey · 2021-10-07T23:22:53Z

keps/sig-cli/2985-public-krm-functions-registry/README.md

+```
+
+The following is an example for exec-based KRM function. We will not allow
+contributors to publish exec-based KRM functions. But we want to standardize the


I think we should allow them in the registry even if we don't publish any of the first-party ones this way. Otherwise they're effectively not supported. Orchestrators can make their own choices about what to run/prefer. On the other hand, we should require all verification-related fields to be populated for all exec functions in the registry.

I think we should allow them in the registry even if we don't publish any of the first-party ones this way. Otherwise they're effectively not supported.

How strong do you feel about allowing publishing exec-based KRM functions from day 1?

Orchestrators can make their own choices about what to run/prefer.

Orchestrators should be secure by default.
Ideally kustomize should have a --allow-exec-plugins flag or something similar. Imagine we have the following use case:
A kustomize user is trying to achieve a task. The user finds a kustomization.yaml from internet (e.g. stackoverflow) that claims it can solve it. kustomization.yaml uses a catalog that contains exec-based plugins. When the user runs kustomize build, it should ask the user to set the --allow-exec-plugins flag.

How strong do you feel about allowing publishing exec-based KRM functions from day 1?

As long as the schema we propose supports it in theory, I think it's reasonable to implement container-based only in the initial pass and e.g. require exec support for beta. We can outline this in the rollout plan in the follow-up PR that moves this KEP to implementable.

Ideally kustomize should have a --allow-exec-plugins flag or something similar. Imagine we have the following use case:
A kustomize user is trying to achieve a task. The user finds a kustomization.yaml from internet (e.g. stackoverflow) that claims it can solve it. kustomization.yaml uses a catalog that contains exec-based plugins. When the user runs kustomize build, it should ask the user to set the --allow-exec-plugins flag.

This is a discussion for the "plugin graduation" KEP, but as a quick tl;dr it is proposing that Kustomize require the end user to explicitly trust the catalog rather than authorize categories of runtimes. E.g. they want to trust one specific exec plugin that that kustomization from the internet requires, and they've checked it out specifically--we should not ask them to authorize all exec plugins. By requiring them to type --trusted-catalog=[the catalog they've vetted], they are authorizing a specific set of things that they can audit and that will not silently change. If the Kustomization is remote and gets changed to use additional plugins, Kustomize will fail the build and complain that the catalog the user is trusting does not include the new ones. Another way to think about is that a catalog reference in a Kustomization says to Kustomize "you need this" and the command-line flag says "you can use this". If the latter isn't a superset of the former, the build fails.

KnVerey · 2021-10-07T23:24:48Z

keps/sig-cli/2985-public-krm-functions-registry/README.md

+    - apiVersion: example.com/v1alpha1
+      kind: LegacySetNamespace
+      deprecated: true
+  usage: <a URL pointing to a README.md>


We could also do both: allow a URL, in which case the website will have a hyperlink instead of inline docs, or a local markdown file, in which case the docs will be embedded into the equivalent field when Catalog is built.

keps/sig-cli/2985-public-krm-functions-registry/README.md

KnVerey · 2021-10-07T23:32:05Z

keps/sig-cli/2985-public-krm-functions-registry/README.md

+
+```shell
+├── publishers
+│   ├── communities


What do you think about defining a kind: Publisher and having that be a second required file in each directory (which could then be flat as Mike suggests)? That kind could have fields for the publisher type, and perhaps some of its fields could also be used to default fields in KRMFunctions during Catalog construction.

mengqiy · 2021-10-08T19:18:36Z

@KnVerey I will update the KEP and mark unsolved as UNRESOLVED. We can aim to merge it as provisional before the KubeCon.

mengqiy · 2021-10-12T21:57:36Z

/cc @KnVerey @monopole

KnVerey

/lgtm
/approve

k8s-ci-robot · 2021-10-12T22:41:33Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: KnVerey, mengqiy

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~keps/sig-cli/OWNERS~~ [KnVerey]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

public KRM functions registry KEP

b607466

k8s-ci-robot requested review from seans3 and soltysh September 22, 2021 15:48

update toc

2121c89

KnVerey reviewed Sep 22, 2021

View reviewed changes

mikebz reviewed Sep 24, 2021

View reviewed changes

address comments

75ac46a

k8s-ci-robot added size/XXL Denotes a PR that changes 1000+ lines, ignoring generated files. and removed size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. labels Sep 24, 2021

KnVerey reviewed Sep 24, 2021

View reviewed changes

keps/sig-cli/2985-public-krm-functions-registry/README.md Outdated Show resolved Hide resolved

Address comment

4de30bc

Add workflow to publish a function.

mengqiy requested review from KnVerey, jeremyrickard and natasha41575 October 1, 2021 23:15

mikebz reviewed Oct 4, 2021

View reviewed changes

keps/sig-cli/2985-public-krm-functions-registry/README.md Outdated Show resolved Hide resolved

mikebz reviewed Oct 4, 2021

View reviewed changes

KnVerey reviewed Oct 7, 2021

View reviewed changes

KnVerey mentioned this pull request Oct 8, 2021

KEP-2906: Adding initial KEP artifacts #2908

Merged

kikisdeliveryservice mentioned this pull request Oct 8, 2021

Public KRM functions registry #2985

Closed

4 tasks

address comments and mark some sections as unresolved

b6f2bbc

fix spelling and toc

8d1d7c0

mengqiy requested a review from KnVerey October 11, 2021 17:45

update reviewers and approvers

0faba0b

k8s-ci-robot requested a review from monopole October 12, 2021 21:57

KnVerey approved these changes Oct 12, 2021

View reviewed changes

k8s-ci-robot assigned KnVerey Oct 12, 2021

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Oct 12, 2021

k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Oct 12, 2021

k8s-ci-robot merged commit e7f51ff into kubernetes:master Oct 12, 2021

k8s-ci-robot added this to the v1.23 milestone Oct 12, 2021

mengqiy deleted the krmfn branch October 12, 2021 22:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KEP-2985: Public KRM Functions Registry #2986

KEP-2985: Public KRM Functions Registry #2986

mengqiy commented Sep 22, 2021

KnVerey Sep 22, 2021

KnVerey Sep 22, 2021

mengqiy Sep 24, 2021

KnVerey Sep 22, 2021

natasha41575 Sep 22, 2021 •

edited

Loading

KnVerey Oct 7, 2021

mikebz Sep 24, 2021

KnVerey commented Sep 24, 2021

mengqiy commented Sep 24, 2021

mengqiy commented Oct 1, 2021

mengqiy commented Oct 1, 2021

mikebz Oct 4, 2021

mengqiy Oct 5, 2021 •

edited

Loading

KnVerey Oct 7, 2021

mengqiy Oct 9, 2021

mikebz Oct 4, 2021

mengqiy Oct 5, 2021

KnVerey Oct 7, 2021

mengqiy Oct 9, 2021

KnVerey Oct 12, 2021

KnVerey left a comment

KnVerey Oct 7, 2021

KnVerey Oct 7, 2021

mengqiy Oct 11, 2021

KnVerey Oct 7, 2021

mengqiy Oct 8, 2021

KnVerey Oct 12, 2021

KnVerey Oct 7, 2021

KnVerey Oct 7, 2021

mengqiy commented Oct 8, 2021

mengqiy commented Oct 12, 2021

KnVerey left a comment

k8s-ci-robot commented Oct 12, 2021

KEP-2985: Public KRM Functions Registry #2986

KEP-2985: Public KRM Functions Registry #2986

Conversation

mengqiy commented Sep 22, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

natasha41575 Sep 22, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

KnVerey commented Sep 24, 2021

mengqiy commented Sep 24, 2021

mengqiy commented Oct 1, 2021

mengqiy commented Oct 1, 2021

Choose a reason for hiding this comment

mengqiy Oct 5, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

KnVerey left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mengqiy commented Oct 8, 2021

mengqiy commented Oct 12, 2021

KnVerey left a comment

Choose a reason for hiding this comment

k8s-ci-robot commented Oct 12, 2021

natasha41575 Sep 22, 2021 •

edited

Loading

mengqiy Oct 5, 2021 •

edited

Loading