New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

WIP Workload partitioning API enhancement #802

Closed

MarSik wants to merge 4 commits into openshift:master from MarSik:feat-wlp-api

Contributor

MarSik commented Jun 4, 2021

This is part of a bigger Workload partitioning enhancement that deals with the trigger API specifics.

Still heavy WIP.

Signed-off-by: Martin Sivák msivak@redhat.com

openshift-ci bot added the do-not-merge/work-in-progress label

openshift-ci bot requested review from abhinavdahiya and staebler

June 4, 2021 09:06

Contributor

openshift-ci bot commented Jun 4, 2021

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
To complete the pull request process, please assign derekwaynecarr after the PR has been reviewed.
You can assign the PR to them by writing /assign @derekwaynecarr in a comment when ready.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Contributor

dhellmann commented Jun 4, 2021

/priority important-soon

openshift-ci bot added the priority/important-soon label

dhellmann reviewed

View reviewed changes

enhancements/management-workload-partitioning-api.md Outdated Show resolved Hide resolved

enhancements/management-workload-partitioning-api.md Outdated Show resolved Hide resolved

enhancements/management-workload-partitioning-api.md Outdated


		### Goals

		This feature is about designing a read-only API that will describe the enabled workload partitions (types, classes, etc.). This information is needed for kubelet to start exposing the right resources as well for the admission webhook to know when the pod manipulation is needed.

Contributor

dhellmann Jun 4, 2021

This paragraph would be good as the intro to the Motivation section on line 67.

enhancements/management-workload-partitioning-api.md Outdated


		This feature is about designing a read-only API that will describe the enabled workload partitions (types, classes, etc.). This information is needed for kubelet to start exposing the right resources as well for the admission webhook to know when the pod manipulation is needed.

		It is expected this API will be created at the installation process. Either manually or using the Performance addon operator render mode.

Contributor

dhellmann Jun 4, 2021

We should work this sentence into the Proposal section.

enhancements/management-workload-partitioning-api.md Outdated

+                # arbitrary name, all objects of this Kind should be processed and merged
+                name: management-partition
+              status:
+                # list of strings, defines partition names to be exposed by kubelet

Contributor

dhellmann Jun 4, 2021

This list is also used by the admission hook as a way to know when pods asking to be partitioned should be mutated.

Kubelet doesn't actually look at the list here, the PAO will use it to configure kubelet, right?

Contributor Author

MarSik Jun 4, 2021

Oh yeah, you are right.

enhancements/management-workload-partitioning-api.md Outdated

+              The proposal is to define a new cluster-wide Custom Resource Definition that would describe the allowed partition names in the status section. That way it hints at being a read only object where no user/admin input or modifications are expected.
+              ```yaml
+              apiVersion: workloadpartitioning.openshift.io/v1

Contributor

dhellmann Jun 4, 2021

In the original enhancement we use workload.openshift.io in some annotations. Should we do that here, too, and make this something like partitioning.workload.openshift.io What is standard/preferred for OpenShift APIs?

Alternatively, do we anticipate other API inputs related to workloads that might live on this CR later, so we should not include "partition" in the name?

enhancements/management-workload-partitioning-api.md Outdated

+                name: management-partition
+              status:
+                # list of strings, defines partition names to be exposed by kubelet
+                globalPartitionNames:

Contributor

dhellmann Jun 4, 2021

Bike shed: We should be consistent and either call it "global" or "cluster" but not use those two terms interchangeably.

enhancements/management-workload-partitioning-api.md Outdated

+              Similar to the `Drawbacks` section the `Alternatives` section is used to
+              highlight and record other possible approaches to delivering the value proposed
+              by an enhancement.

Contributor

dhellmann Jun 4, 2021

Elsewhere we've discussed the possible need to add per-workload parameters, like scope. The proposal above implies those would end up as separate lists, which I think is fine. We should include the other form here in the Alternatives section, for completeness. Our notes doc has an example as

workloadTypes:
    - name: user-defined-workload-type
      scope: Pool
    - name: management
      Scope: ClusterWide

Contributor Author

MarSik Jun 4, 2021

Good point!

enhancements/management-workload-partitioning-api.md Outdated

+                  - management
+              ```
+              To allow for future extensibility and possible multiple sources of workload partition names (coming from customers, the installer, or other operators, etc.), we propose that there might be multiple `WorkloadPartitions` objects injected into the cluster. The expected behavior is that all components would just merge all the defined names together.

Contributor

dhellmann Jun 4, 2021

In other places where we have cluster-scoped configuration resources like this we look for 1 well-defined name, often cluster. I'm not sure how we decide between 1 and many, though. Maybe in this case many makes sense if we assume there may be multiple creators but that the CRD only has status fields. Do we really anticipate multiple creators?

Contributor Author

MarSik Jun 4, 2021

If we allow custom partitions in the future then I would say yes, there might be multiple owners in such case (customer creates one and PAO renders another one). I just did not want to close the door prematurely.

enhancements/management-workload-partitioning-api.md Outdated

+) An admin creates a WorkloadPartitions object manually after the cluster has been running for some time on a cluster that already has partitioning enabled
+) An admin creates the WorkloadPartitions object manually after the cluster has been running for some time on a cluster with no workload partitioning enabled
+) An admin deletes the WorkloadPartitions object that was created during the install process
+) A random user manages to create a WorkloadPartitions object due to a bug in the defined RBAC rules

Contributor

dhellmann Jun 4, 2021

This list is a good start. We should add some detail about what ill effects might result from each of these cases.

Tal-or reviewed

View reviewed changes

enhancements/management-workload-partitioning-api.md Outdated


		## Proposal

		The proposal is to define a new cluster-wide Custom Resource Definition that would describe the allowed partition names in the status section. That way it hints at being a read only object where no user/admin input or modifications are expected.

Contributor

Tal-or Jun 9, 2021

It is not clear from the proposal who should own this CRD?

Contributor

dhellmann Jun 9, 2021

That's explained a little better in #753 but we should probably summarize it here. The CRD will be defined in openshift/api.

MarSik added 3 commits

June 10, 2021 15:45


          WIP Workload partitioning API enhancement

7cb72c1

Signed-off-by: Martin Sivák <msivak@redhat.com>


          Remove trailing whitespace

b2a5c75


          Addressing review comments

00420a5

MarSik force-pushed the feat-wlp-api branch from 0e450bb to 00420a5 Compare

June 10, 2021 14:44

cynepco3hahue reviewed

View reviewed changes

enhancements/workload-partitioning/management-workload-partitioning-api.md


		To allow for future extensibility and possible multiple sources of workload partition names (coming from customers, the installer, or other operators, etc.), we propose that there might be multiple `WorkloadPartitions` objects injected into the cluster. The expected behavior is that all components would just merge all the defined names together.

		There is no controller or reconcile loop as part of this proposal. Only the cluster administrator will have the ability to create or manipulate the WorkloadPartitions objects. Anyone will be allowed to read them.

cynepco3hahue Jun 21, 2021

We still will run reconciliation under the PAO once the WorkloadPartitions resource updated.

enhancements/workload-partitioning/management-workload-partitioning-api.md


		### Risks and Mitigations

		1) An admin creates a WorkloadPartitions object manually after the cluster has been running for some time on a cluster that already has partitioning enabled

cynepco3hahue Jun 21, 2021

looks like something happened with ordering

enhancements/workload-partitioning/management-workload-partitioning-api.md

+                # List of strings, defines partition names that will be recognized by the
+                # workload partitioning webhook. This list will also inform PAO about partitions
+                # that should be configured on the kubelet and CRI-O level.
+                clusterPartitionNames:

cynepco3hahue Jun 21, 2021

hm what about CPUs that should be used for the CPU pinning or will it use reservedCPUs by default?

Contributor Author

MarSik Jun 21, 2021

The API server does not care. Kubelet / CRIO will get the right ids from PAO.


          Add Doug H as approver

632fb9e

dhellmann removed the priority/important-soon label

openshift-bot commented Sep 21, 2021

Inactive enhancement proposals go stale after 28d of inactivity.

See https://github.com/openshift/enhancements#life-cycle for details.

Mark the proposal as fresh by commenting /remove-lifecycle stale.
Stale proposals rot after an additional 30d of inactivity and eventually close.
Exclude this proposal from closing by commenting /lifecycle frozen.

If this proposal is safe to close now please do so with /close.

/lifecycle stale

openshift-ci bot added the lifecycle/stale label

openshift-bot commented Sep 28, 2021

Stale enhancement proposals rot after 7d of inactivity.

See https://github.com/openshift/enhancements#life-cycle for details.

Mark the proposal as fresh by commenting /remove-lifecycle rotten.
Rotten proposals close after an additional 7d of inactivity.
Exclude this proposal from closing by commenting /lifecycle frozen.

If this proposal is safe to close now please do so with /close.

/lifecycle rotten
/remove-lifecycle stale

openshift-ci bot added lifecycle/rotten and removed lifecycle/stale labels

openshift-bot commented Oct 5, 2021

Rotten enhancement proposals close after 7d of inactivity.

See https://github.com/openshift/enhancements#life-cycle for details.

Reopen the proposal by commenting /reopen.
Mark the proposal as fresh by commenting /remove-lifecycle rotten.
Exclude this proposal from closing again by commenting /lifecycle frozen.

/close

openshift-ci bot closed this

Contributor

openshift-ci bot commented Oct 5, 2021

@openshift-bot: Closed this PR.

In response to this:

Rotten enhancement proposals close after 7d of inactivity.

See https://github.com/openshift/enhancements#life-cycle for details.

Reopen the proposal by commenting /reopen.
Mark the proposal as fresh by commenting /remove-lifecycle rotten.
Exclude this proposal from closing again by commenting /lifecycle frozen.

/close

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Reviewers

dhellmann dhellmann left review comments

cynepco3hahue cynepco3hahue left review comments

Tal-or Tal-or left review comments

abhinavdahiya Awaiting requested review from abhinavdahiya

staebler Awaiting requested review from staebler

Labels

do-not-merge/work-in-progress lifecycle/rotten