Proposal to allow mtu changes #926

jcaamano · 2021-10-07T17:16:09Z

This is a reboot of #603 restricted to MTU changes.

/cc @mccv1r0
For MTU changes
/cc @trozet
For OVN kubernetes
/cc @danwinship @knobunc @dcbw
For general thoughts

openshift-ci · 2021-10-07T17:16:35Z

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by:
To complete the pull request process, please assign trozet after the PR has been reviewed.
You can assign the PR to them by writing /assign @trozet in a comment when ready.

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these files:

enhancements/network/OWNERS

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

jcaamano · 2021-10-07T17:33:03Z

And of course, as the initial author
/cc @juanluisvaladas

enhancements/network/allow-mtu-changes.md

danwinship · 2021-10-12T13:05:11Z

enhancements/network/allow-mtu-changes.md

+
+### Goals
+
+* Allow to change MTU post install on OVN Kubernetes.


I think we had wanted to do this for openshift-sdn too?

I think we do but I understood that not at the same time and not for 4.10 anyway so I did not cover it in this enhancement just because of time constraints and the fact that I don't know anything about it.

enhancements/network/allow-mtu-changes.md

danwinship · 2021-10-12T13:13:47Z

enhancements/network/allow-mtu-changes.md

+3. Once all the previous pods are finished, deploy other set of pods on every
+   node that will handle the actual change of the MTU. Wait for them to be
+   ready and running.
+4. Cordon every node, we don't want pods created from this point on.


Cordoning every node simultaneously may have drastically bad failure modes... we'll need discussion with other teams about this to make sure this is OK.

Is it because being it done simultaneously? Could you give a probable example? Would it be better to document it as a precondition to the procedure instead of part of it?

Well, for example, CNO does not tolerate the unschedulable taint, so if something bad happened and it exited/crashed/was killed, it would not be possible to restart it, and then the MTU change process would be stalled.

I changed to remove the need of cordoning as it might bring more problems to the table than we need.

But know the procedure is to restart ovn-kube node before changing running pod's MTU always, regardless of whether we are decreasing or increasing MTU. Increased chances of temporary disruption when decreasing the MTU, but overall more robust procedure.

enhancements/network/allow-mtu-changes.md

danwinship · 2021-10-12T13:41:20Z

enhancements/network/allow-mtu-changes.md

+5. If any of these steps failed, the pod will exit with code 1, if all were
+   successful it will exit with code 0.


does it make any attempt at rollback?

No, should we? Assuming rollback would fail as well at that point.

Actually, no rollback because we will reboot the node on failure anyway?

Assuming rollback would fail as well at that point.

Well, like, maybe it changes the pod MTUs but then fails to change the bridge MTU. It could probably roll back the pod MTU changes in that case...

Actually, no rollback because we will reboot the node on failure anyway?

Does it say that somewhere?

Step 10. Also changed that node iunterfaces MTU is changed by restart of ovnkube-node. So this specific pod with only change pods MTU.

enhancements/network/allow-mtu-changes.md

mccv1r0 · 2021-10-12T20:45:35Z

Since nothing happens atomically, especially across all nodes/pods in a cluster, things will be inconsistent for a time and must be addressed. Any changes to MTU will ONLY be used by new connections. Existing TCP [1] connections, node-node, node-pod, pod-pod will continue to use the negotiated MSS [2]

For existing TCP [1] connections, best case, changing MTU will not change a thing. The MSS that was negotiated when the connection was established will continue to be used. If you change MTU to a larger value, no problem. If you change to a smaller value, an ICMP Type 3 Code 4 will result (best case) or traffic will be black holed (worse case.) This is the simple case.
OCP lives in the not so simple case: change MTU on overlay while leaving the MTU of the node (underlay) as is. Tunneled traffic might be too big/small for existing settings.

The term pod can be ambiguous, e.g. a pod on the host network is also a pod. For simplicity:

when I refer to nodes, I mean anything using the host network interfaces, even one that connects to overlay such as ovn-k8s-mp0
when I refer to pods, I mean anything just a using the overlay

In general, increasing MTU could work. As the change rolls out, existing connections are not impacted. New connections where one side has larger MTU and the other still does not yet have the larger value will continue to use MIN(MSS from remote, local MTU). This assumes the order described for increase of "physical", br-ex, veth of OVS and veth of pod. (Note: Dan's earlier comment about not changing node at all, leaving it up to user to do still results in what I describe, but the "roll out" is longer and manual. Thus the time to converge is longer.) It also assumes the other node(s) "physical", br-ex, veth of OVS and veth of pod already is larger, if not, packet [3] will be dropped and ICMP generated. Since one node at least has to go first, the cluster will fill up with Route Cache Entries (RCE) due to the ICMP Type 3 Code 4 from all the nodes that haven't got the memo to use larger value. This continues until cluster converges on the new settings. It may black hole traffic. I'm still drinking on this.

I suggest that all ICMP Type 3 Code 4 messages be dropped (via iptables rules added as part of the enhancement) until we think we are done with the rollout. Just dropping DF=1 packets is enough. Also sending ICMP telling every node/pod that the MTU should be what we are trying to get it to be only wastes cycles (and executes code paths not often used.)

Decreasing MTU on underlay is problematic. As before, existing connections will ignore the change. When a datagram from an existing connection meets a now lower MTU on a interface on the path, an ICMP Type 3 Code 4 will result [3]. It's possible that control traffic is black holed "long enough" to make e.g. apiserver not ready. On overlay, there will be some churn (as described above) but things should be ok eventually.

[1] The same is true for e.g. SCTP

[2] The MSS can change after the fact due to PMTU discovery as I describe.

[3] Assuming DF=1 which is typical or IP Fragmentation will be performed.

enhancements/network/allow-mtu-changes.md

trozet · 2021-10-12T20:46:11Z

enhancements/network/allow-mtu-changes.md

+   node that will handle the actual change of the MTU. Wait for them to be
+   ready and running.
+4. Cordon every node, we don't want pods created from this point on.
+5. Ensure via machine-config-operator, that upon reboot, configure-ovs will


@cybertron FYI just in case there is any interaction with keyfiles placed by MCO with #817

I don't think this will be a problem, at least in terms of day 1 or MCO. The day 1 configuration will be baked into the image and not managed by MCO, so after deployment if any changes are made MCO won't overwrite them.

The place where it could be an issue is kubernetes-nmstate. Any keyfiles that nmstate writes are at risk of being overwritten if they're modified outside nmstate. If the MTU of an interface configured by kubernetes-nmstate needs to be modified, it probably needs to be done via nmstate.

I'm not sure that's actually a problem right now as I don't think we support nmstate modifying the configuration on the OVNK interface anyway, but I believe they were looking to add support for that upstream so it might be in the future.

enhancements/network/allow-mtu-changes.md

trozet · 2021-10-12T20:55:49Z

enhancements/network/allow-mtu-changes.md

+    - Progressing: false
+    - Degraded: false
+
+The steps to change the MTU performed by pods of previous step 3 are:


should this pod detect if there are MTU problems after migration is complete, and post an event or something to indicate if it was successful or not?

What do you mean with detect?

If openshift has a verification procedure to health check deployments then we probably can suggest in the documentation to run it after this procedure.

I mean that in your step 5:

5. Once all the previous pods finish successfully, deploy other set of pods with `restartPolicy: Never` on every node that will handle the actual change of the MTU (explained in more detail below). Wait for them to be ready and running.

So you are going to deploy another set of pods that do the configuration. I'm wondering if these pods will remain for some time after configuration and if they can run a healthcheck until all of the nodes are finished updating MTU. The check could be pinging from this pod to other "configuration pods" on other nodes with max MTU. If it doesn't come up after some time, then maybe an event can be posted or something to indicate to the user that MTU change failed.

Usually when you do a new deployment you ran some verification to check that the deployment has been done correctly and that cluster is healthy. If this exists for openshift we can suggest in documentation to run it again. Otherwise, it would probably be better to have a different set of pods with aliveness probe or the like rather than adding to these specific pods.

we have the network check target pods, but I was thinking something specifically scoped to the MTU change to give the user a signal that the MTU update worked as part of the MTU update process itself. Like the pods that you launch for doing the MTU upgrade exit successfully and log some message like MTU upgrade complete, or if they check network connectivity and something is now broken, they either crash or post an event to their pod saying MTU upgrade problem. If you think it's not necessary then that's fine to ignore.

I would probably then use the network check target pods and enhance that for any specific MTU verification we think we need to do. Do you know where I can check them out?

These MTU change pods only change the MTU of pods, which is an operation for which we should know definitively if it succeeded or not, and is only one step of a 3 step process which also includes changing the host sdn interfaces MTU and the host external interfaces MTU, so I feel that a final verification of the MTU in these pods could be out of place.

jcaamano · 2021-10-13T10:57:39Z

Since nothing happens atomically, especially across all nodes/pods in a cluster, things will be inconsistent for a time and must be addressed. Any changes to MTU will ONLY be used by new connections. Existing TCP [1] connections, node-node, node-pod, pod-pod will continue to use the negotiated MSS [2]

For existing TCP [1] connections, best case, changing MTU will not change a thing. The MSS that was negotiated when the connection was established will continue to be used. If you change MTU to a larger value, no problem. If you change to a smaller value, an ICMP Type 3 Code 4 will result (best case) or traffic will be black holed (worse case.) This is the simple case. OCP lives in the not so simple case: change MTU on overlay while leaving the MTU of the node (underlay) as is. Tunneled traffic might be too big/small for existing settings.

The term pod can be ambiguous, e.g. a pod on the host network is also a pod. For simplicity:

when I refer to nodes, I mean anything using the host network interfaces, even one that connects to overlay such as ovn-k8s-mp0

when I refer to pods, I mean anything just a using the overlay

In general, increasing MTU could work. As the change rolls out, existing connections are not impacted. New connections where one side has larger MTU and the other still does not yet have the larger value will continue to use MIN(MSS from remote, local MTU). This assumes the order described for increase of "physical", br-ex, veth of OVS and veth of pod. (Note: Dan's earlier comment about not changing node at all, leaving it up to user to do still results in what I describe, but the "roll out" is longer and manual. Thus the time to converge is longer.) It also assumes the other node(s) "physical", br-ex, veth of OVS and veth of pod already is larger, if not, packet [3] will be dropped and ICMP generated. Since one node at least has to go first, the cluster will fill up with Route Cache Entries (RCE) due to the ICMP Type 3 Code 4 from all the nodes that haven't got the memo to use larger value. This continues until cluster converges on the new settings. It may black hole traffic. I'm still drinking on this.

I suggest that all ICMP Type 3 Code 4 messages be dropped (via iptables rules added as part of the enhancement) until we think we are done with the rollout. Just dropping DF=1 packets is enough. Also sending ICMP telling every node/pod that the MTU should be what we are trying to get it to be only wastes cycles (and executes code paths not often used.)

Decreasing MTU on underlay is problematic. As before, existing connections will ignore the change. When a datagram from an existing connection meets a now lower MTU on a interface on the path, an ICMP Type 3 Code 4 will result [3]. It's possible that control traffic is black holed "long enough" to make e.g. apiserver not ready. On overlay, there will be some churn (as described above) but things should be ok eventually.

[1] The same is true for e.g. SCTP

[2] The MSS can change after the fact due to PMTU discovery as I describe.

[3] Assuming DF=1 which is typical or IP Fragmentation will be performed.

I am working on a section to analyze this.

mccv1r0 · 2021-10-13T14:07:07Z

The OVS side of any veth pair should have an MTU of 65000, similar to what we appear to do for geneve (and vxlan):

# ip addr show genev_sys_6081 
6: genev_sys_6081: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 65000 qdisc noqueue master ovs-system state UNKNOWN group default qlen 1000
    link/ether e6:44:9b:7a:48:6b brd ff:ff:ff:ff:ff:ff
    inet6 fe80::e444:9bff:fe7a:486b/64 scope link 
       valid_lft forever preferred_lft forever
#

Then we never have to change them. There is no reason to deal with MTU issues inside a bridge/switch. If the datagram is really to large, the other side of the veth pair (e.g. pod) will have the L3 knowledge to know what to do.

It might make sense to do this first, as part of z-stream, so that this enhancement doesn't have to worry about the lack of an atomic change of both pod veth and "OVS veth"

jcaamano · 2021-10-13T14:46:36Z

The OVS side of any veth pair should have an MTU of 65000, similar to what we appear to do for geneve (and vxlan):
# ip addr show genev_sys_6081 
6: genev_sys_6081: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 65000 qdisc noqueue master ovs-system state UNKNOWN group default qlen 1000
    link/ether e6:44:9b:7a:48:6b brd ff:ff:ff:ff:ff:ff
    inet6 fe80::e444:9bff:fe7a:486b/64 scope link 
       valid_lft forever preferred_lft forever
# 
Then we never have to change them. There is no reason to deal with MTU issues inside a bridge/switch. If the datagram is really to large, the other side of the veth pair (e.g. pod) will have the L3 knowledge to know what to do.

I think this is generally a good idea so we don't worry about managing it but it won't help with traffic itself.

veth_A (65000) <-----> veth_B(1500)

If we are sending >1500 from veth_A to veth_B it will drop anyway. veth treats MTU as MRU as well.

enhancements/network/allow-mtu-changes.md

openshift-ci · 2021-11-04T09:35:54Z

@jcaamano: The following test failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:

Test name	Commit	Details	Required	Rerun command
ci/prow/markdownlint	`4fcde8e`	link	true	`/test markdownlint`

Full PR test history. Your PR dashboard.

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here.

danwinship · 2021-11-05T15:04:45Z

enhancements/network/allow-mtu-changes.md

+
+### Non goals
+
+* Change the MTU without service disruption.


Suggested change

* Change the MTU without service disruption.

* Change the MTU with absolutely no service disruption.

danwinship · 2021-11-05T15:08:13Z

enhancements/network/allow-mtu-changes.md

+9. Set the new MTU value to the applied-cluster config map AND wait for pods of
+   step 3 to complete successfully.


danwinship · 2021-11-05T15:09:36Z

enhancements/network/allow-mtu-changes.md

+10. If any of the previous steps (8,9) failed, reboot the node, wait for the
+    kubelet to be reporting as Ready again.


If any of the previous steps (8,9) failed on any nodes, drain and reboot the failed nodes, one at a time, and wait for each one to be reporting as Ready again.

danwinship · 2021-11-05T15:09:57Z

enhancements/network/allow-mtu-changes.md

+11. Upon completion, set conditions to:
+    - Progressing: false
+    - Degraded: false


- Upgradeable: true

danwinship · 2021-11-05T15:14:01Z

enhancements/network/allow-mtu-changes.md

+    - Progressing: false
+    - Degraded: false
+
+The steps to change the MTU performed by pods of previous step 3 are:


danwinship · 2021-11-05T15:26:17Z

enhancements/network/allow-mtu-changes.md

+   - Degraded: true
+   Update the operator configuration status with a description of the problem.
+   At this point the process is interrupted and we require manual intervention.
+8. Force a rollout of the ovnkube-node daemonset. This will ensure


So this means the ovnkube upgrade is totally out of sync with the pod-level upgrades, and every node needs to wait for every other node to finish its pod-level upgrades before any of them can do the ovnkube-level upgrade.

A better approach might be: instead of having CNO force a re-rollout of the DaemonSet, just have the step 5 pod kill the local ovnkube-node process, forcing it to be restarted.

And then it could even choose to do that step before or after the pod-level fixes, depending on which direction the MTU is changing in...

The thing is that if we restart ovn-kube after the step 5 pod changes the MTUs, there is a time in between where new pods my allocate with the old MTU.

That's why we restart ovn-kube first, we'll do the roll-out with max unavailability so that is quick and then we let the step 5 pod proceed with the MTU changes. Yes, it is out of sync, but hopefully quick enough.

danwinship · 2021-11-05T15:30:55Z

enhancements/network/allow-mtu-changes.md

+An administrator should be able to change the cluster network MTU through
+CNO configuration change. This would encompass the following tasks:
+
+##### Implement a pod that changes the actual MTU on running pods


So again, the parts talking about implementation details don't belong in "User Stories". And they're redundant with what you've already said, so you can just remove them.

I will move them to a specific section. The thing is that there is no way to map (user) stories here to (non-user) stories in Jira.

danwinship · 2021-11-05T15:32:39Z

enhancements/network/allow-mtu-changes.md

+
+### Risks and Mitigations
+
+* If unexpected problems ocurr this procedure, the mitigation is an automated


Suggested change

* If unexpected problems ocurr this procedure, the mitigation is an automated

* If unexpected problems occur during this procedure, the mitigation is an automated

danwinship · 2021-11-05T15:33:42Z

enhancements/network/allow-mtu-changes.md

+There are circumstances that prevent an endpoint from being aware of the actual
+MTU to a destination, which depends on Path MTU discovery and specific ICMP
+`FRAG_NEEDED` messages:


which in general seem to not work over OVS

danwinship · 2021-11-05T15:36:32Z

enhancements/network/allow-mtu-changes.md

+
+## Alternatives
+
+### New ovn-k setting: `routable-mtu`


So this sounds better than the proposed solution... why aren't we doing it this way?

A double rolling reboot seemed unacceptable. But I don't know what is the latest stance on it. Perhaps @vpickard can comment on this.

Yes, I was concerned that 2 reboots would not be acceptable from a customer perspective. @mcurry-rh What are your thoughts on having to perform 2 reboots to change the mtu?

Replicating the feedback we got form @mcurry-rh
not ideal...acceptable...MTU adjustment is a rare event, so 2 reboots, while painful, is not fatal and achieves the objective
So the second alternative is based on already available node maintenance knowledge, simpler to implement and a safer approach all around while the main alternative is more efficient at the cost of that safety. We could prototype as well.
@abhat @trozet @knobunc @dcbw we would need to make a call on this. Do you have any opinion?

So, if the cluster is actually "broken" because of the bad MTU, then having to do two reboots isn't that bad since you're probably not running anything useful anyway.

And if it's not broken, then the MTU change probably isn't urgent, and the procedure doesn't actually require that the two reboots happen back-to-back; they could happen 24 hours apart or something. (Right? The cluster is stable/consistent in the inter-reboot phase?) So we could even just make it so that the CNO doesn't initiate any rolling reboots itself, it just does:

CNO makes the initial change to mtu/routable-mtu

CNO observes nodes until it sees that every node has rebooted (for whatever reason) and is using the changed configuration.

CNO makes the second change to mtu/routable-mtu

CNO observes nodes until it sees that every node has rebooted and is using the changed configuration.

CNO updates the operator status accordingly

So then the admin could schedule two sets of rolling reboots on consecutive nights, or even just make the config change and then forget about it, and the first change would complete the next time they did a z-stream update and the second change would complete after the next update after that.

(Right? The cluster is stable/consistent in the inter-reboot phase?)

Yes.

"So, if the cluster is actually "broken" because of the bad MTU" -- That is not always a safe assumption. One case we had was where a customer had a large, running cluster and wanted to add new nodes. But the new nodes were on OpenShift and they needed to drop the MTU to allow for the VxLAN header in the OSP networking. I assume most cases will be like that, otherwise they could just reinstall...

"So, if the cluster is actually "broken" because of the bad MTU" -- That is not always a safe assumption.

Hence the "if"

trozet · 2021-11-17T14:25:21Z

enhancements/network/allow-mtu-changes.md

+  since the actual interfaces MTU did not change they will not drop traffic
+  coming from other nodes.
+* Set in ovn-config a `mtu` equal to `routable-mtu` or replace `mtu` with the
+  `routing-mtu` value and remove the latter.


I'm not sure what routing-mtu is here?

That should be routable-mtu

trozet · 2021-11-17T14:30:33Z

enhancements/network/allow-mtu-changes.md

+  traffic drop is expected.
+
+Increase example:
+* Set in ovn-config the actual `mtu` as `routable-mtu` and a new `mtu` setting


This is confusing... I would suggest just using some numbers in your example.

Prototyped it in ovn-kubernetes/ovn-kubernetes#2654, perhaps the description I gave there is easier to understand:

routable-mtu setting is introduced to faciliate a procedure allowing to change the MTU on a running cluster with minimum service disruption. Given current and target mtu values: 1. Set mtu to the higher MTU value and routable-mtu to the lower MTU value. 2. Do a rolling reboot. As a node restarts, routable-mtu is set on all appropriate routes while interfaces have mtu configured. The node will effectively use the lower routable-mtu for outgoing traffic, but be able to handle incoming traffic up to the higher mtu. 3. Change the MTU on all interfaces not handled by ovn-k to the target MTU value. Since the MTU effectively used in the cluster is the lower one, this has no impact on traffic. 4. Set mtu to the target MTU value and unset routable-mtu. 5. Do a rolling reboot. As a node restarts, the target MTU value is set on the interfaces and the routes are reset to default MTU values. Since the MTU effectively used in other nodes of the cluster is the lower onei but able to handle the higher one, this has no impact on traffic. routable-mtu is set as the MTU for the following routes: * pod default route * non link scoped management port route, * services route * link scoped node routes

jcaamano · 2021-11-23T09:02:24Z

Closed in favor of #963 which expands on the alternative of performing the MTU change through rolling reboots.

openshift-ci bot requested review from danwinship, dcbw, knobunc, mccv1r0 and trozet October 7, 2021 17:16

openshift-ci bot requested a review from juanluisvaladas October 7, 2021 17:33

mccv1r0 reviewed Oct 7, 2021

View reviewed changes

enhancements/network/allow-mtu-changes.md Outdated Show resolved Hide resolved

jcaamano force-pushed the change-mtu branch from faed94e to 97c6e5b Compare October 7, 2021 18:23

danwinship reviewed Oct 12, 2021

View reviewed changes

mccv1r0 reviewed Oct 12, 2021

View reviewed changes

enhancements/network/allow-mtu-changes.md Outdated Show resolved Hide resolved

trozet reviewed Oct 12, 2021

View reviewed changes

jcaamano force-pushed the change-mtu branch 2 times, most recently from 8512ed1 to ca6125d Compare October 13, 2021 12:25

jcaamano force-pushed the change-mtu branch 2 times, most recently from 125c363 to 7e122f1 Compare October 13, 2021 17:05

ricky-rav reviewed Oct 14, 2021

View reviewed changes

enhancements/network/allow-mtu-changes.md Outdated Show resolved Hide resolved

ricky-rav reviewed Oct 14, 2021

View reviewed changes

enhancements/network/allow-mtu-changes.md Outdated Show resolved Hide resolved

ricky-rav reviewed Oct 14, 2021

View reviewed changes

enhancements/network/allow-mtu-changes.md Outdated Show resolved Hide resolved

ricky-rav reviewed Oct 14, 2021

View reviewed changes

enhancements/network/allow-mtu-changes.md Outdated Show resolved Hide resolved

ricky-rav reviewed Oct 14, 2021

View reviewed changes

enhancements/network/allow-mtu-changes.md Outdated Show resolved Hide resolved

ricky-rav reviewed Oct 14, 2021

View reviewed changes

enhancements/network/allow-mtu-changes.md Outdated Show resolved Hide resolved

ricky-rav reviewed Oct 14, 2021

View reviewed changes

enhancements/network/allow-mtu-changes.md Outdated Show resolved Hide resolved

ricky-rav reviewed Oct 14, 2021

View reviewed changes

enhancements/network/allow-mtu-changes.md Outdated Show resolved Hide resolved

ricky-rav reviewed Oct 14, 2021

View reviewed changes

enhancements/network/allow-mtu-changes.md Outdated Show resolved Hide resolved

ricky-rav reviewed Oct 14, 2021

View reviewed changes

enhancements/network/allow-mtu-changes.md Outdated Show resolved Hide resolved

ricky-rav reviewed Oct 14, 2021

View reviewed changes

enhancements/network/allow-mtu-changes.md Outdated Show resolved Hide resolved

jcaamano force-pushed the change-mtu branch from 7e122f1 to 38cd0be Compare October 14, 2021 16:15

kyrtapz mentioned this pull request Oct 25, 2021

Recreate existing service routes on startup to allow for MTU changes ovn-kubernetes/ovn-kubernetes#2600

Merged

Initial proposal to allow mtu changes

4fcde8e

jcaamano force-pushed the change-mtu branch from 38cd0be to 4fcde8e Compare November 4, 2021 09:30

danwinship reviewed Nov 5, 2021

View reviewed changes

trozet reviewed Nov 17, 2021

View reviewed changes

jcaamano mentioned this pull request Nov 23, 2021

Proposal to allow MTU changes with rolling reboots #963

Merged

jcaamano closed this Nov 23, 2021


		### Goals

		* Allow to change MTU post install on OVN Kubernetes.

		5. If any of these steps failed, the pod will exit with code 1, if all were
		successful it will exit with code 0.

	* Change the MTU without service disruption.
	* Change the MTU with absolutely no service disruption.

		9. Set the new MTU value to the applied-cluster config map AND wait for pods of
		step 3 to complete successfully.

		10. If any of the previous steps (8,9) failed, reboot the node, wait for the
		kubelet to be reporting as Ready again.


		### Risks and Mitigations

		* If unexpected problems ocurr this procedure, the mitigation is an automated

	* If unexpected problems ocurr this procedure, the mitigation is an automated
	* If unexpected problems occur during this procedure, the mitigation is an automated

Proposal to allow mtu changes #926

Proposal to allow mtu changes #926

Conversation

jcaamano commented Oct 7, 2021

openshift-ci bot commented Oct 7, 2021

jcaamano commented Oct 7, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mccv1r0 commented Oct 12, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jcaamano Nov 4, 2021 • edited Loading

Choose a reason for hiding this comment

jcaamano commented Oct 13, 2021 • edited Loading

mccv1r0 commented Oct 13, 2021

jcaamano commented Oct 13, 2021

openshift-ci bot commented Nov 4, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jcaamano Nov 10, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jcaamano Nov 17, 2021 • edited Loading

Choose a reason for hiding this comment

jcaamano commented Nov 23, 2021

jcaamano Nov 4, 2021 •

edited

Loading

jcaamano commented Oct 13, 2021 •

edited

Loading

jcaamano Nov 10, 2021 •

edited

Loading

jcaamano Nov 17, 2021 •

edited

Loading