Remove refetching from resourceWatcher #14262

rosstimothy · 2022-07-08T21:28:14Z

The resourceWatcher is meant to be a long lived way for a component
to receive events about a particular resource from an upstream cache.
However, there was a refetching mechanism that would cause a healthy
and subscribed watcher to be closed, the resourceQatcher to fetch all
the resource types it is watching from the upstream cache and to create a
new watcher every 10 minutes. This causes unneeded load on
the upstream cache and also eats up network bandwidth.

This removes the refetching behavior entirely to ensure watchers
aren't unnecessarily closed. The change should be transparent to
users of the resourceWatcher, but should noticeably reduce both
the number of init events being emitted through out a cluster
and the number of cache reads.

Fixes #14234

The resourceWatcher is meant to be a long lived way for a component to receive events about a particular resource from an upstream cache. However, there was a refetching mechanism that would cause a healthy and subscribed watcher to be closed, the resourceWatcher to fetch all the resource types it is watching from the upstream cache and to create a new watcher **every 10 minutes**. This causes unneeded load on the upstream cache and also eats up network bandwidth. This removes the refetching behavior entirely to ensure watchers aren't unnecessarily closed. The change should be transparent to users of the resourceWatcher, but should noticeably reduce both the number of init events being emitted through out a cluster and the number of cache reads. Fixes #14234

rosstimothy · 2022-07-08T21:30:03Z

Metrics from running a 10k cluster off this branch. Compare against the snapshot from #14234 to see the reduction in init and cache reads.

fspmarshall

red diffs ❤️

github-actions · 2022-07-11T14:27:53Z

@rosstimothy See the table below for backport results.

Branch	Result
branch/v10	Create PR
branch/v7	Failed
branch/v8	Create PR
branch/v9	Create PR

The resourceWatcher is meant to be a long lived way for a component to receive events about a particular resource from an upstream cache. However, there was a refetching mechanism that would cause a healthy and subscribed watcher to be closed, the resourceWatcher to fetch all the resource types it is watching from the upstream cache and to create a new watcher **every 10 minutes**. This causes unneeded load on the upstream cache and also eats up network bandwidth. This removes the refetching behavior entirely to ensure watchers aren't unnecessarily closed. The change should be transparent to users of the resourceWatcher, but should noticeably reduce both the number of init events being emitted through out a cluster and the number of cache reads. Fixes #14234 (cherry picked from commit dea633f) # Conflicts: # lib/services/watcher.go

rosstimothy · 2022-07-11T15:30:15Z

💚 All backports created successfully

Status	Branch	Result
✅	branch/v7

Questions ?

Please refer to the Backport tool documentation

Remove refetching from resourceWatcher (#14262) The resourceWatcher is meant to be a long lived way for a component to receive events about a particular resource from an upstream cache. However, there was a refetching mechanism that would cause a healthy and subscribed watcher to be closed, the resourceWatcher to fetch all the resource types it is watching from the upstream cache and to create a new watcher **every 10 minutes**. This causes unneeded load on the upstream cache and also eats up network bandwidth. This removes the refetching behavior entirely to ensure watchers aren't unnecessarily closed. The change should be transparent to users of the resourceWatcher, but should noticeably reduce both the number of init events being emitted through out a cluster and the number of cache reads. Fixes #14234 (cherry picked from commit dea633f) # Conflicts: # lib/services/watcher.go

Prior to #14262, resource watchers would periodically close their watcher, create a new one and refetch the current set of resources. It turns out that the reverse tunnel subsytem relied on this behavior to periodically broadcast the list of proxies to agents during steady state. Now that watchers are persistent and no longer perform a refetch, agents that are unable to connect to a proxy expire them after a period of time, and since they never receive the periodic refresh, they never attempt to connect to said proxy again. To remedy this, a new ticker is added to the `localsite` that grabs the current set of proxies from its proxy watcher and sends a discovery request to the agent. The frequency of the ticker is set to fire prior to the tracker would expire the proxy so that if a proxy exists in the cluster, then the agent will continually try to connect to it.

* Periodically resync proxies to agents Prior to #14262, resource watchers would periodically close their watcher, create a new one and refetch the current set of resources. It turns out that the reverse tunnel subsystem relied on this behavior to periodically broadcast the list of proxies to agents during steady state. Now that watchers are persistent and no longer perform a refetch, agents that are unable to connect to a proxy expire them after a period of time, and since they never receive the periodic refresh, they never attempt to connect to said proxy again. To remedy this, a new ticker is added to the `localsite` that grabs the current set of proxies from its proxy watcher and sends a discovery request to the agent. The frequency of the ticker is set to fire prior to the tracker would expire the proxy so that if a proxy exists in the cluster, then the agent will continually try to connect to it.

Prior to #14262, resource watchers would periodically close their watcher, create a new one and refetch the current set of resources. It turns out that the reverse tunnel subsytem relied on this behavior to periodically broadcast the list of proxies to agents during steady state. Now that watchers are persistent and no longer perform a refetch, agents that are unable to connect to a proxy expire them after a period of time, and since they never receive the periodic refresh, they never attempt to connect to said proxy again. To remedy this, a new ticker is added to the `localsite` that grabs the current set of proxies from its proxy watcher and sends a discovery request to the agent. The frequency of the ticker is set to fire prior to the tracker would expire the proxy so that if a proxy exists in the cluster, then the agent will continually try to connect to it.

Prior to #14262, resource watchers would periodically close their watcher, create a new one and refetch the current set of resources. It turns out that the reverse tunnel subsystem relied on this behavior to periodically broadcast the list of proxies to agents during steady state. Now that watchers are persistent and no longer perform a refetch, agents that are unable to connect to a proxy expire them after a period of time, and since they never receive the periodic refresh, they never attempt to connect to said proxy again. To remedy this, a new ticker is added to the `localsite` that grabs the current set of proxies from its proxy watcher and sends a discovery request to the agent. The frequency of the ticker is set to fire prior to the tracker would expire the proxy so that if a proxy exists in the cluster, then the agent will continually try to connect to it.

* Periodically resync proxies to agents (#18050) Prior to #14262, resource watchers would periodically close their watcher, create a new one and refetch the current set of resources. It turns out that the reverse tunnel subsystem relied on this behavior to periodically broadcast the list of proxies to agents during steady state. Now that watchers are persistent and no longer perform a refetch, agents that are unable to connect to a proxy expire them after a period of time, and since they never receive the periodic refresh, they never attempt to connect to said proxy again. To remedy this, a new ticker is added to the `localsite` that grabs the current set of proxies from its proxy watcher and sends a discovery request to the agent. The frequency of the ticker is set to fire prior to the tracker would expire the proxy so that if a proxy exists in the cluster, then the agent will continually try to connect to it.

* Periodically resync proxies to agents Prior to #14262, resource watchers would periodically close their watcher, create a new one and refetch the current set of resources. It turns out that the reverse tunnel subsytem relied on this behavior to periodically broadcast the list of proxies to agents during steady state. Now that watchers are persistent and no longer perform a refetch, agents that are unable to connect to a proxy expire them after a period of time, and since they never receive the periodic refresh, they never attempt to connect to said proxy again. To remedy this, a new ticker is added to the `localsite` that grabs the current set of proxies from its proxy watcher and sends a discovery request to the agent. The frequency of the ticker is set to fire prior to the tracker would expire the proxy so that if a proxy exists in the cluster, then the agent will continually try to connect to it.

Prior to #14262, resource watchers would periodically close their watcher, create a new one and refetch the current set of resources. It turns out that the reverse tunnel subsystem relied on this behavior to periodically broadcast the list of proxies to agents during steady state. Now that watchers are persistent and no longer perform a refetch, agents that are unable to connect to a proxy expire them after a period of time, and since they never receive the periodic refresh, they never attempt to connect to said proxy again. To remedy this, a new ticker is added to the `localsite` that grabs the current set of proxies from its proxy watcher and sends a discovery request to the agent. The frequency of the ticker is set to fire prior to the tracker would expire the proxy so that if a proxy exists in the cluster, then the agent will continually try to connect to it.

fspmarshall approved these changes Jul 8, 2022

View reviewed changes

rosstimothy marked this pull request as ready for review July 8, 2022 21:33

github-actions bot requested review from tcsc and zmb3 July 8, 2022 21:34

espadolini approved these changes Jul 8, 2022

View reviewed changes

rosstimothy added backport-required backport/branch/v9 labels Jul 9, 2022

zmb3 approved these changes Jul 10, 2022

View reviewed changes

Merge branch 'master' into tross/persistent_resource_watcher

d0f2d6d

rosstimothy enabled auto-merge (squash) July 11, 2022 12:20

Merge branch 'master' into tross/persistent_resource_watcher

b568e51

rosstimothy merged commit dea633f into master Jul 11, 2022

This was referenced Jul 11, 2022

[v10] Remove refetching from resourceWatcher #14304

Merged

[v8] Remove refetching from resourceWatcher #14305

Merged

[v9] Remove refetching from resourceWatcher #14306

Merged

rosstimothy mentioned this pull request Jul 11, 2022

[branch/v7] Remove refetching from resourceWatcher (#14262) #14307

Merged

zmb3 deleted the tross/persistent_resource_watcher branch September 9, 2022 18:55

rosstimothy mentioned this pull request Nov 2, 2022

Periodically resync proxies to agents #18050

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove refetching from resourceWatcher #14262

Remove refetching from resourceWatcher #14262

rosstimothy commented Jul 8, 2022

rosstimothy commented Jul 8, 2022

fspmarshall left a comment •

edited

Loading

github-actions bot commented Jul 11, 2022

rosstimothy commented Jul 11, 2022

Remove refetching from resourceWatcher #14262

Remove refetching from resourceWatcher #14262

Conversation

rosstimothy commented Jul 8, 2022

rosstimothy commented Jul 8, 2022

fspmarshall left a comment • edited Loading

Choose a reason for hiding this comment

github-actions bot commented Jul 11, 2022

rosstimothy commented Jul 11, 2022

💚 All backports created successfully

Questions ?

fspmarshall left a comment •

edited

Loading