Add configurable reconciliation loop for pods, namespaces, and networ… #3772

naemono · 2020-02-13T20:47:31Z

This adds a configurable reconciliation loop option, which defaults for the existing behavior of 0s (no reconciliation loop), for pods, namespaces, and network policies. We have been testing these changes in 7 internal non-production clusters with a defined reconciliation interval of > 0s for the past week with no noticeable negative impacts , and will be rolling out to our production clusters in the upcoming week. We also have not had a single reoccurrence of the issue in issues 3764 since this release.

…k policies

bboreham · 2020-02-14T10:46:45Z

Thanks for the PR! Some unit tests failed; please take a look at https://circleci.com/gh/weaveworks/weave/13224

naemono · 2020-02-17T16:06:06Z

Thanks for the PR! Some unit tests failed; please take a look at https://circleci.com/gh/weaveworks/weave/13224

This is now resolved. The resourceversion of objects wasn't being emulated in tests.

naemono · 2020-02-17T16:43:40Z

#!/bin/bash -eo pipefail
bin/provision_test_vms.sh
Cannot run smoke tests: no secret key

Exited with code exit status 1

is this normal? Do I need to do something to get the smoke tests to run?

bboreham · 2020-02-17T17:32:20Z

The smoke-tests run with credentials on Google Cloud (accessed via the "secret key"); we don't allow PRs from other repos to see those credentials. I've pushed your branch to the main repo so it will run the smoke-tests.

bboreham · 2020-02-26T11:57:26Z

I would like to clarify what this PR achieves. Take the "update pod" event: if a message comes in that the pod data is updated, but the data is the same as the previous version, then weave-npc will do nothing.

So this PR will help in the case that the Kubernetes data in memory is out of sync with the api-server, but does not help in the case that the iptables rules or ipsets are out of sync with the data.

Is that your understanding? Is there any evidence that #3764 is caused by the first kind of mismatch rather than the second?

naemono · 2020-02-26T16:53:12Z

Yes, that is a good point, it certainly wouldn't appear to have an effect if the ipsets were out of sync on the host itself during a reconciliation, as it's not doing a comparison. But it would catch one of the 2 out of sync scenarios you reference. Would you prefer that this PR includes data comparison during reconciliation? That seems like it would make a lot of sense.

bboreham · 2020-02-28T11:34:54Z

I think these two things are orthogonal and I always prefer to do unrelated changes in separate PRs.
Adding a periodic reconcilliation from the state of Kubernetes objects to iptables is a good idea, as discussed at #3764.

I would not call this one "reconcilliation"; Kubernetes calls it "resync".

bboreham · 2020-05-27T10:44:15Z

@naemono are you likely to return to this?
Note some work at #3802 for recreating iptables rules when they get wiped out.

naemono · 2020-05-27T13:54:14Z

I will also get back to this in next Sprint next week. Sorry for delays.

naemono · 2020-06-03T17:11:30Z

closing this as #3792 solves the actual, underlying problem of weave-npc application continuing to run when goroutine panics.

Add configurable reconciliation loop for pods, namespaces, and networ…

f40b0a9

…k policies

Fix tests where resourceversion was missing

c182c02

naemono closed this Jun 3, 2020

bboreham added this to the n/a milestone Jul 29, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add configurable reconciliation loop for pods, namespaces, and networ… #3772

Add configurable reconciliation loop for pods, namespaces, and networ… #3772

naemono commented Feb 13, 2020

bboreham commented Feb 14, 2020

naemono commented Feb 17, 2020 •

edited

Loading

naemono commented Feb 17, 2020

bboreham commented Feb 17, 2020

bboreham commented Feb 26, 2020 •

edited

Loading

naemono commented Feb 26, 2020

bboreham commented Feb 28, 2020

bboreham commented May 27, 2020

naemono commented May 27, 2020

naemono commented Jun 3, 2020

Add configurable reconciliation loop for pods, namespaces, and networ… #3772

Add configurable reconciliation loop for pods, namespaces, and networ… #3772

Conversation

naemono commented Feb 13, 2020

bboreham commented Feb 14, 2020

naemono commented Feb 17, 2020 • edited Loading

naemono commented Feb 17, 2020

bboreham commented Feb 17, 2020

bboreham commented Feb 26, 2020 • edited Loading

naemono commented Feb 26, 2020

bboreham commented Feb 28, 2020

bboreham commented May 27, 2020

naemono commented May 27, 2020

naemono commented Jun 3, 2020

naemono commented Feb 17, 2020 •

edited

Loading

bboreham commented Feb 26, 2020 •

edited

Loading