Avoid repeated push of rejected configuration #46

kyessenov · 2018-02-28T20:28:50Z

If envoy rejects a config, the snapshot cache will attempt to push the same config immediately after envoy requests it again. If envoy does not limit its requests, the server will drive envoy into a loop of request-receive-reject, potentially causing unnecessary CPU load.

kyessenov · 2018-03-01T19:23:23Z

Tracking issue in envoy: envoyproxy/envoy#2169

gsagula · 2018-03-02T20:37:51Z

@kyessenov It saw that you are tracking this 2169 ^^. I'll be pushing the PR over the weekend. Quick question: Do you know by any chance what would be a good baseline to detect continuous bursts (requests/sec). We are not making it configurable for now. Thanks!

kyessenov · 2018-03-02T20:42:06Z

I don't think I know the number that was validated in the real world. I think something <100qps is a realistic upper-bound for config updates per proxy. It might be worth trying to push invalid configs for each xDS at certain qps, and see CPU impact. Currently, envoy just consumes all available CPU while trying to invalidate the config.

gsagula · 2018-03-03T18:57:01Z

I will try that and if I can get a more accurate number, but something like <100qps seems realistic for me too. Thanks!

mihaitodor · 2020-08-05T17:18:13Z

I think this has been fixed here: envoyproxy/envoy#4787 It exposes rate_limit_settings in the ads_config of the Envoy config.

kyessenov · 2020-08-05T17:34:07Z

Correct. It would be nice to wire the feedback from the server. Some messages are useful (invalid config, missing reference, etc).

mihaitodor · 2020-08-05T18:08:20Z

Hmmm... I'm using the Callbacks to implement the OnStreamResponse method and I get this message when adding a duplicate listener: Error adding/updating listener(s) whoami-http-one:10003: cannot bind '192.168.168.168:10003': Address already in use. It also provides error code 13. The issue was that it was flooding my service by calling OnStreamResponse like crazy if this misconfiguration happened, until I configured the rate_limit_settings in Envoy to make it repeat the request less often.

Not sure if this is what you're referring to, though.

kyessenov · 2020-08-05T20:18:07Z

Yeah, it is just there is no way to plug in server side rate limiter ATM. Some of these errors are ephemeral so not sure what action to take anyways until it settles (e.g. the older listener stops listening in your case)

mihaitodor · 2020-08-06T14:03:22Z

Ah, I see what you mean now. Thanks for the clarification! I'm not sure either how the server should behave, but the default rate_limit_settings in Envoy seem excessive to me. Retrying so fast can easily DDoS the go-control-plane server and I'm curious why anyone would need these retries to happen faster than once / second or so...

github-actions · 2021-04-06T05:05:16Z

This issue has been automatically marked as stale because it has not had activity in the last 30 days. It will be closed in the next 7 days unless it is tagged "help wanted" or "no stalebot" or other activity occurs. Thank you for your contributions.

github-actions · 2021-04-13T08:08:12Z

This issue has been automatically closed because it has not had activity in the last 37 days. If this issue is still valid, please ping a maintainer and ask them to label it as "help wanted" or "no stalebot". Thank you for your contributions.

akshaysngupta · 2023-10-11T01:12:26Z

Another option could be for control plane to allow implementors to choose if they want to retry on a specific error.

In following code, we can introduce a callback for OnStreamResponseNacked and OnStreamDeltaResponseNacked which can evaluate the error and decide to retry.

kyessenov mentioned this issue Feb 28, 2018

Add Fetch and non-ADS mode to the snapshot cache #45

Merged

kyessenov added the enhancement label Jul 3, 2018

jpeach mentioned this issue Aug 10, 2020

internal/grpc: properly handle Envoy ACK/NACK protocol projectcontour/contour#1176

Open

youngnick mentioned this issue Oct 19, 2020

Support custom JSON fields for Envoy access logs projectcontour/contour#3032

Closed

github-actions bot added the stale label Apr 6, 2021

github-actions bot closed this as completed Apr 13, 2021

easwars mentioned this issue Apr 21, 2021

snapshot cache does not respond with resources #426

Closed

relistan mentioned this issue Sep 13, 2021

Check for port collisions before telling Envoy NinesStack/sidecar#64

Merged

akshaysngupta mentioned this issue Oct 11, 2023

Provide callback to decide if nack should be retried #794

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Avoid repeated push of rejected configuration #46

Avoid repeated push of rejected configuration #46

kyessenov commented Feb 28, 2018

kyessenov commented Mar 1, 2018

gsagula commented Mar 2, 2018

kyessenov commented Mar 2, 2018

gsagula commented Mar 3, 2018

mihaitodor commented Aug 5, 2020

kyessenov commented Aug 5, 2020

mihaitodor commented Aug 5, 2020 •

edited

Loading

kyessenov commented Aug 5, 2020

mihaitodor commented Aug 6, 2020

github-actions bot commented Apr 6, 2021

github-actions bot commented Apr 13, 2021

akshaysngupta commented Oct 11, 2023 •

edited

Loading

Avoid repeated push of rejected configuration #46

Avoid repeated push of rejected configuration #46

Comments

kyessenov commented Feb 28, 2018

kyessenov commented Mar 1, 2018

gsagula commented Mar 2, 2018

kyessenov commented Mar 2, 2018

gsagula commented Mar 3, 2018

mihaitodor commented Aug 5, 2020

kyessenov commented Aug 5, 2020

mihaitodor commented Aug 5, 2020 • edited Loading

kyessenov commented Aug 5, 2020

mihaitodor commented Aug 6, 2020

github-actions bot commented Apr 6, 2021

github-actions bot commented Apr 13, 2021

akshaysngupta commented Oct 11, 2023 • edited Loading

mihaitodor commented Aug 5, 2020 •

edited

Loading

akshaysngupta commented Oct 11, 2023 •

edited

Loading