Propagate downstream timeout to upstream through multiple Envoys #7358

snowp · 2019-06-21T20:36:06Z

When running Envoy on both egress and ingress, the client will provide a timeout header to the egress Envoy, which will propagate the expected upstream timeout in x-envoy-expected-rq-timeout-ms. The upstream Envoy will not read this header, so it will resolve a new timeout value that is set as the expected timeout for the upstream service. This means that the deadline expected by the egress Envoy is ignored in favor of the ingress Envoy, resulting in the upstream service having an incorrect view of the actual deadline.

It seems like either

inserting x-envoy-rq-timeout-ms with the expected timeout on egress
parsing x-envoy-expected-rq-timeout-ms as the deadline on the ingress side

would solve the issue (likely guarded by a config flag).

Thoughts?

The text was updated successfully, but these errors were encountered:

mattklein123 · 2019-06-21T21:33:49Z

Yeah agreed I think either would solve. I don't have a super strong opinion on which one.

This does bring up the general topic of deadline propagation which we haven't really tackled yet in a holistic way. I've thought at some point we may also want to stick the deadline in trace context baggage since this would also propagate through app calls, though that is a larger problem than you are trying to solve here.

ramaraochavali · 2019-06-25T10:23:27Z

+1. It would be very useful. Inserting x-envoy-rq-timeout-ms on the ingress side may be good idea. That might require users to set use_remote_address to ensure that it is not sanitized?

stale · 2019-07-25T10:25:14Z

This issue has been automatically marked as stale because it has not had activity in the last 30 days. It will be closed in the next 7 days unless it is tagged "help wanted" or other activity occurs. Thank you for your contributions.

nezdolik · 2019-08-01T15:18:17Z

@snowp would like to help out with this.

snowp · 2019-08-01T17:29:07Z

@nezdolik Great, I'll assign this one to you. Feel free to ping me with any questions

snowp changed the title ~~Propagate downstream timeout to upstream~~ Propagate downstream timeout to upstream through multiple Envoys Jun 21, 2019

mattklein123 added the enhancement Feature requests. Not bugs or questions. label Jun 21, 2019

stale bot added the stale stalebot believes this issue/PR has not been touched recently label Jul 25, 2019

snowp added the help wanted Needs help! label Jul 25, 2019

stale bot removed the stale stalebot believes this issue/PR has not been touched recently label Jul 25, 2019

snowp assigned nezdolik Aug 1, 2019

nezdolik mentioned this issue Aug 27, 2019

router: set correct timeout for egress->ingress envoys #8051

Merged

mattklein123 closed this as completed in #8051 Oct 8, 2019

blake mentioned this issue Feb 18, 2020

Consul connect provide a way to configure envoy route timeout hashicorp/consul#6382

Open

freddygv mentioned this issue Jun 16, 2021

respect_expected_rq_timeout is not leading to expected behavior #17016

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Propagate downstream timeout to upstream through multiple Envoys #7358

Propagate downstream timeout to upstream through multiple Envoys #7358

snowp commented Jun 21, 2019

mattklein123 commented Jun 21, 2019

ramaraochavali commented Jun 25, 2019

stale bot commented Jul 25, 2019

nezdolik commented Aug 1, 2019

snowp commented Aug 1, 2019

Propagate downstream timeout to upstream through multiple Envoys #7358

Propagate downstream timeout to upstream through multiple Envoys #7358

Comments

snowp commented Jun 21, 2019

mattklein123 commented Jun 21, 2019

ramaraochavali commented Jun 25, 2019

stale bot commented Jul 25, 2019

nezdolik commented Aug 1, 2019

snowp commented Aug 1, 2019