Fix high memory usage in retry middleware #2740

m3co-code · 2018-01-23T10:59:21Z

This PR reduces the amount of RAM Traefik needs to transfer the response body to the client when the retry middleware is enabled. This is accomplished by rebuilding the response recorder for the retry middleware. The principle now is quite simple, the new response recorder does not have a temporary buffer anymore, but directly delegates the calls to the original response writer in case the response should not be retried. This means that the only "overhead" we have left is the 32KB buffer the standard library uses in it's Write calls on the http connection.

As the former response recorder was used also at the error handler, I moved it there. I think the error handler is not that critical when it comes to response sizes, e.g. no files will be transferred through it I guess, but there is also room for improvement for the future.

To get also something started I created a very basic integration test now as well for retries. Could also be extended in the future with other things like websockets etc, but I didn't want to make this PR too huge and time is sparse :-)

Below I have to snapshots of the pprof heap utility. In both cases my setup looks the same. I am proxying through Traefik to a service that delivers a 1.5 GB file. At the moment in time where about 1.0 GB are download I called the /heap endpoint to see the difference. Please note the total values, in the old version there is 235.86MB in the new one only 2.06MB!

old version

Fetching profile over HTTP from http://localhost:9090/debug/pprof/heap
Saved profile in /home/marco/pprof/pprof.traefik.alloc_objects.alloc_space.inuse_objects.inuse_space.012.pb.gz
File: traefik
Type: inuse_space
Time: Jan 23, 2018 at 11:51am (CET)
Entering interactive mode (type "help" for commands, "o" for options)
(pprof) top 10
Showing nodes accounting for 234.36MB, 99.36% of 235.86MB total
Dropped 28 nodes (cum <= 1.18MB)
Showing top 10 nodes out of 27
      flat  flat%   sum%        cum   cum%
  234.36MB 99.36% 99.36%   234.36MB 99.36%  bytes.makeSlice /usr/local/go/src/bytes/buffer.go
         0     0% 99.36%   234.36MB 99.36%  bytes.(*Buffer).Write /usr/local/go/src/bytes/buffer.go
         0     0% 99.36%   234.36MB 99.36%  bytes.(*Buffer).grow /usr/local/go/src/bytes/buffer.go
         0     0% 99.36%   234.36MB 99.36%  github.com/containous/traefik/middlewares.(*EmptyBackendHandler).ServeHTTP /home/marco/go/src/github.com/containous/traefik/middlewares/empty_backend_handler.go
         0     0% 99.36%   234.36MB 99.36%  github.com/containous/traefik/middlewares.(*HandlerSwitcher).ServeHTTP /home/marco/go/src/github.com/containous/traefik/middlewares/handlerSwitcher.go
         0     0% 99.36%   234.36MB 99.36%  github.com/containous/traefik/middlewares.(*Retry).ServeHTTP /home/marco/go/src/github.com/containous/traefik/middlewares/retry.go
         0     0% 99.36%   234.36MB 99.36%  github.com/containous/traefik/middlewares.(*StripPrefix).ServeHTTP /home/marco/go/src/github.com/containous/traefik/middlewares/stripPrefix.go
         0     0% 99.36%   234.36MB 99.36%  github.com/containous/traefik/middlewares.(*StripPrefix).serveRequest /home/marco/go/src/github.com/containous/traefik/middlewares/stripPrefix.go
         0     0% 99.36%   234.36MB 99.36%  github.com/containous/traefik/middlewares.(*retryResponseRecorderWithoutCloseNotify).Write /home/marco/go/src/github.com/containous/traefik/middlewares/retry.go
         0     0% 99.36%   234.36MB 99.36%  github.com/containous/traefik/middlewares/accesslog.(*SaveBackend).ServeHTTP /home/marco/go/src/github.com/containous/traefik/middlewares/accesslog/save_backend.go

new version

Fetching profile over HTTP from http://localhost:9090/debug/pprof/heap
Saved profile in /home/marco/pprof/pprof.traefik.alloc_objects.alloc_space.inuse_objects.inuse_space.011.pb.gz
File: traefik
Type: inuse_space
Time: Jan 23, 2018 at 11:49am (CET)
Entering interactive mode (type "help" for commands, "o" for options)
(pprof) top
Showing nodes accounting for 2065.01kB, 100% of 2065.01kB total
Showing top 10 nodes out of 29
      flat  flat%   sum%        cum   cum%
 1536.84kB 74.42% 74.42%  1536.84kB 74.42%  reflect.mapassign /usr/local/go/src/runtime/hashmap.go
  528.17kB 25.58%   100%   528.17kB 25.58%  regexp.(*bitState).reset /usr/local/go/src/regexp/backtrack.go
         0     0%   100%  1536.84kB 74.42%  encoding/json.(*decodeState).object /usr/local/go/src/encoding/json/decode.go
         0     0%   100%  1536.84kB 74.42%  encoding/json.(*decodeState).unmarshal /usr/local/go/src/encoding/json/decode.go
         0     0%   100%  1536.84kB 74.42%  encoding/json.(*decodeState).value /usr/local/go/src/encoding/json/decode.go
         0     0%   100%  1536.84kB 74.42%  encoding/json.Unmarshal /usr/local/go/src/encoding/json/decode.go
         0     0%   100%  1536.84kB 74.42%  github.com/containous/traefik/configuration.init <autogenerated>
         0     0%   100%  1536.84kB 74.42%  github.com/containous/traefik/provider/kubernetes.init <autogenerated>
         0     0%   100%   528.17kB 25.58%  github.com/containous/traefik/vendor/github.com/containous/mux.(*Route).Match /home/marco/go/src/github.com/containous/traefik/vendor/github.com/containous/mux/route.go
         0     0%   100%   528.17kB 25.58%  github.com/containous/traefik/vendor/github.com/containous/mux.(*Router).Match /home/marco/go/src/github.com/containous/traefik/vendor/github.com/containous/mux/mux.go

timoreimann

We have reviewed this internally already, so I'm able to give my LGTM right away.

ldez · 2018-01-24T17:01:07Z

@marco-jantke could you rebase? 🎢

errm

LGTM, this is a very neat solution!

m3co-code · 2018-01-25T08:01:48Z

@ldez done.

ldez · 2018-01-25T10:54:35Z

middlewares/retry.go

@@ -114,107 +100,69 @@ func (l RetryListeners) Retried(req *http.Request, attempt int) {
 	}
 }

-type retryResponseRecorder interface {
+type retryResponseWriter interface {


why rename to retryResponseRecorder to retryResponseWriter and make a errorPagesResponseRecorder ?

I renamed it because the current implementation does not record anything anymore. It writes directly through to the original response writer in case the request should not be retried, so it's no recorder anymore.

Regarding the errorPagesResponseRecorder:

As the former response recorder was used also at the error handler, I moved it there. I think the error handler is not that critical when it comes to response sizes, e.g. no files will be transferred through it I guess, but there is also room for improvement for the future.

It's basically that the retryResponseRecorder was re-used in the error pages middleware and that I didn't want to make changes there. As the error pages middleware is the last place where the original retryResponseRecorder is used I renamed it accordingly.

iahmedov · 2018-01-26T04:14:02Z

@marco-jantke This looks great.

Just one question, have you tested a case when you are downloading lets say 2Gb file from endpoint
and when it reached 1Gb, restart endpoint, is it going to continue correctly? (without, curl, wget options to retry)

Current version of retry logic with cache handled this correctly because it caches all the data and then transmits only if, traefik got all the data from endpoint. But here it looks like streaming data (maybe I am missing something, haven't checked out and tested) and when endpoint restarts when you reach 1Gb, next retry transmits again 2Gb, since rr.responseWriter not closed, in total you may receive 3Gb data when it was only 2Gb

m3co-code · 2018-01-26T11:06:19Z

@iahmedov very good point. I have to say I am utterly surprised but the case you describe works exactly as expected!

What did I do? I have a simple file configuration with one frontend and two backends which in turn are nginx servers that just serve files. I start both backends and Traefik and then request in the browser a file that is about 1.5 GB through Traefik. It requests server A and when the download is at 1 GB I just kill server A hard. You can see a short pause on the download in the browser and then it resumes it at the proper location. I also verified that the result is not corrupted and in the correct size etc.

I am not sure what is making that work so well, but it should be some primitive of HTTP (chunked transfer maybe?) or tcp(?).

m3co-code · 2018-01-26T11:33:17Z

After digging deeper and taking debug logs apart, I figured that this actually is working due to the chunked transfer encoding and that the client (in my case chromium) and backend server (nginx) initiate the chunked transfer properly and behave correctly. To gather more knowledge, can you verify whether this approach also works in your scenario?

But knowing that there is a way to resume downloads with plain HTTP, without extra logic in the reverse proxy, I'd argue we should keep the proxy as simple as possible.

iahmedov · 2018-01-26T11:49:11Z

I guess traefik should create some kind of rules which it follows when retrying:

retry should never happen when data started streaming to downstream.
retry if traefik could not connect to endpoint
retry if 5xx error happened (should be configurable, maybe some one wrote API where even 5xx errors update database state, in which case retry can update data even more, think about simple counter)
retry if states are idempotent, (plain GET requests which doesn't change server state, also should be configurable to switch on or off)

Reason:

Data could be updated by request, if you try again, you can update it multiple times
Whenever single byte received by downstream who requested data from traefik, it must handle backend termination by itself, traefik should not handle this case, because it doesn't know about how backend works and its properties

I think if retry issued after single byte streamed to downstream, then this is wrong already, you never know how backend behaves for retried request when it correctly handled request, but because of network error failed to deliver it to you.

m3co-code · 2018-01-26T12:01:40Z

retry should never happen when data started streaming to downstream.

This is the case. Retries only happen when on the initial connection establishment a net error occurred.

retry if traefik could not connect to endpoint

As above.

retry if 5xx error happened (should be configurable, maybe some one wrote API where even 5xx errors update database state, in which case retry can update data even more, think about simple counter)

We had this discussion about that in the past and concluded it's not safe to retry something that just delivered a 5xx header. In the end the state in the backend server could already partially changed. In case you think this is a valuable feature, though, please open a separate feature request for this where we can discuss it.

retry if states are idempotent, (plain GET requests which doesn't change server state, also should be configurable to switch on or off)

Basically same as above. You don't know in which environments Traefik will be used and whether everyone implements HTTP as it is suggested. Basically you can't guarantee that apps change no state on a GET.

ldez

LGTM 👏

The retry middleware will be refactored in the next commit, so that it doesn't need the original retryResponseRecorder anymore. The retryResponseRecorder, however, was used also for the error pages and so I am moving it to the file and rename it accordingly in this commit.

traefiker added area/middleware size/L status/0-needs-triage labels Jan 23, 2018

m3co-code mentioned this pull request Jan 23, 2018

Fix high memory usage in retry middleware for big files #2442

Closed

2 tasks

mmatur added status/2-needs-review and removed status/0-needs-triage labels Jan 23, 2018

mmatur added this to the 1.6 milestone Jan 23, 2018

m3co-code mentioned this pull request Jan 23, 2018

Traefik bad perf for routing big data files #1064

Closed

timoreimann approved these changes Jan 23, 2018

View reviewed changes

errm self-requested a review January 23, 2018 22:06

errm reviewed Jan 24, 2018

View reviewed changes

errm approved these changes Jan 24, 2018

View reviewed changes

m3co-code force-pushed the fix-high-memory-usage-in-retry branch from 1be1406 to 999eaff Compare January 25, 2018 08:01

ldez added the kind/bug/fix a bug fix label Jan 25, 2018

ldez suggested changes Jan 25, 2018

View reviewed changes

traefiker added the contributor/waiting-for-corrections label Jan 25, 2018

m3co-code removed the contributor/waiting-for-corrections label Jan 26, 2018

ldez approved these changes Jan 26, 2018

View reviewed changes

ldez added status/3-needs-merge and removed status/2-needs-review labels Jan 26, 2018

traefiker added the status/4-merge-in-progress label Jan 26, 2018

m3co-code added 2 commits January 26, 2018 17:04

add basic retry integration test

0dec2cf

fix retry middleware high RAM usage

c8cbacd

traefiker force-pushed the fix-high-memory-usage-in-retry branch from 999eaff to c8cbacd Compare January 26, 2018 17:04

traefiker removed the status/4-merge-in-progress label Jan 26, 2018

traefiker merged commit ef4aa20 into traefik:master Jan 26, 2018

traefiker removed the status/3-needs-merge label Jan 26, 2018

m3co-code deleted the fix-high-memory-usage-in-retry branch January 26, 2018 22:13

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix high memory usage in retry middleware #2740

Fix high memory usage in retry middleware #2740

m3co-code commented Jan 23, 2018

timoreimann left a comment

ldez commented Jan 24, 2018

errm left a comment •

edited

Loading

m3co-code commented Jan 25, 2018

ldez Jan 25, 2018

m3co-code Jan 26, 2018

iahmedov commented Jan 26, 2018

m3co-code commented Jan 26, 2018 •

edited

Loading

m3co-code commented Jan 26, 2018

iahmedov commented Jan 26, 2018

m3co-code commented Jan 26, 2018

ldez left a comment

Fix high memory usage in retry middleware #2740

Fix high memory usage in retry middleware #2740

Conversation

m3co-code commented Jan 23, 2018

old version

new version

timoreimann left a comment

Choose a reason for hiding this comment

ldez commented Jan 24, 2018

errm left a comment • edited Loading

Choose a reason for hiding this comment

m3co-code commented Jan 25, 2018

ldez Jan 25, 2018

Choose a reason for hiding this comment

m3co-code Jan 26, 2018

Choose a reason for hiding this comment

iahmedov commented Jan 26, 2018

m3co-code commented Jan 26, 2018 • edited Loading

m3co-code commented Jan 26, 2018

iahmedov commented Jan 26, 2018

m3co-code commented Jan 26, 2018

ldez left a comment

Choose a reason for hiding this comment

errm left a comment •

edited

Loading

m3co-code commented Jan 26, 2018 •

edited

Loading