Feature Request: Policy-Server should always run fault tolerant and PDBs should be configured #564

Martin-Weiss · 2023-10-26T11:56:51Z

Is your feature request related to a problem?

When we deploy the kubewarden-defaults chart - we end up with a single replica policy-server.

This has a problem with fault tolerance (drain) and when increasing the number of replicas we would need anti-affinity and pod disruption budgets as well.

Solution you'd like

Not sure if this should be on the operator or on the policy server - but we should have settings that allow the configuration of PDB and anti-affinity as well as number of replicas (default for all policy servers as well as single policy servers).

Alternatives you've considered

No response

Anything else?

No response

flavio · 2023-10-26T13:53:20Z

I've moved the issue to the controller repository because this is an epic.

Anti-affinity rules

I propose to extend the CRD of the PolicyServer by adding a new attribute called affinity of type vi/Affinity.

This object, when set, is then copied into the PodSpec of the Policy Server Deploment by our controller.

The default helm chart should then allow this affinity value to be set for the default Policy Server it creates.

Pod disruption budget

We should extend the PolicyServer CRD by adding the minAvailable and/or the maxUnavailable fields.

Question for @Martin-Weiss: do you think we should expose both attributes or just one of them?

The controller will then take care of creating the PodDisruptionBudget object that targets the specific Policy Server pods.

Also in this case, we should change the default helm chart to allow this value to be set for the default Policy Server.

Replica size

The defaults helm chart already allows the replica number to be set. I would leave the default value to 1, because I think the default values should not be the ones aimed for a production deployment.

mpepping · 2024-01-30T14:22:40Z

Policy-servers should match the criticality of Kubernetes API-server replicas within a cluster. Running as a DaemonSet on control plane nodes, or something among those lines

viccuad · 2024-02-01T11:15:02Z

Configurable/autoscalable resources

Since newly released Kubewarden 1.10, policy-servers have a different architecture that is both more efficient when scaling horizontally and more performant. See https://www.kubewarden.io/blog/2024/kubewarden-1-10-release/. This ameliorates the autoscalability problems.
For adding something like horizontalPodAutoscaler, one must be aware that policy-server deployments get the scheduled policies as active once a rollout has happened. Meanwhile the rollout, the old policies are accessible via the old policy-server pod as the webhooks point to it, and once the new policy-server pod is ready, the webhooks are updated to point to it. Hence triggering autoscalability may be counterproductive by the amount of rollouts. This may change in the future though.

configurable system-cluster-critical priorityClass

Given the notes listed in https://kubernetes.io/docs/concepts/scheduling-eviction/pod-priority-preemption/#notes-about-podpriority-and-existing-clusters, this feels like a one-way street. Setting it in the CRD should be optional, and once set up, maybe it shouldn’t be removable.

flavio · 2024-04-08T13:16:21Z

Marking as done. This is going to be part of 1.12 once tagged

flavio transferred this issue from kubewarden/policy-server Oct 26, 2023

flavio transferred this issue from kubewarden/helm-charts Oct 26, 2023

flavio added the kind/epic label Oct 26, 2023

jvanz added the kind/to-be-refined label Jan 30, 2024

flavio mentioned this issue Feb 7, 2024

Prevent Policy Server Crash in case of a maintenance in Kubernetes Nodepools kubewarden/helm-charts#383

Closed

kkaempf added this to the 1.11 milestone Feb 8, 2024

kkaempf added the kind/enhancement label Mar 1, 2024

flavio modified the milestones: 1.11, 1.12 Mar 5, 2024

viccuad mentioned this issue Mar 11, 2024

Feature Request: Ability to configure limits and requests of CPU/Memory on PolicyServer CRD #710

Closed

This was referenced Mar 25, 2024

Feature Request: PolicyServer CRD should expose anti-affinity rules #691

Closed

Feature Request: allow PolicyServer to have Pod disruption budget #692

Closed

flavio closed this as completed Apr 8, 2024

viccuad mentioned this issue Apr 8, 2024

Expose anti-affinity, PDB, limits and requests in questions.yaml kubewarden/helm-charts#418

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Policy-Server should always run fault tolerant and PDBs should be configured #564

Feature Request: Policy-Server should always run fault tolerant and PDBs should be configured #564

Martin-Weiss commented Oct 26, 2023

flavio commented Oct 26, 2023

mpepping commented Jan 30, 2024

viccuad commented Feb 1, 2024

flavio commented Apr 8, 2024 •

edited by viccuad

Loading

Feature Request: Policy-Server should always run fault tolerant and PDBs should be configured #564

Feature Request: Policy-Server should always run fault tolerant and PDBs should be configured #564

Comments

Martin-Weiss commented Oct 26, 2023

Is your feature request related to a problem?

Solution you'd like

Alternatives you've considered

Anything else?

flavio commented Oct 26, 2023

Anti-affinity rules

Pod disruption budget

Replica size

mpepping commented Jan 30, 2024

viccuad commented Feb 1, 2024

Configurable/autoscalable resources

configurable system-cluster-critical priorityClass

flavio commented Apr 8, 2024 • edited by viccuad Loading

flavio commented Apr 8, 2024 •

edited by viccuad

Loading