Fix memory leak when double invoking RestChannel.sendResponse #89873

original-brownbear · 2022-09-07T13:27:14Z

When using the resource handling channel we must make sure that if we (by what is IMO a bug) try to double invoke it after having already sent a response (or tried to do so) we at least release the memory in the channel's outbound buffer. Otherwise we will leak any memory from it that was used to create the now failing to send RestResponse.

When using the resource handling channel we must make sure that if we (by what is IMO a bug) try to double invoke it after having already sent a response (or tried to do so) we at least release the memory in the channel's outbound buffer. Otherwise we will leak any memory from it that was used to create the now failing to send `RestResponse`.

elasticsearchmachine · 2022-09-07T13:27:38Z

Pinging @elastic/es-distributed (Team:Distributed)

elasticsearchmachine · 2022-09-07T13:27:38Z

Hi @original-brownbear, I've created a changelog YAML for you.

Tim-Brooks

LGTM - was this happening somewhere?

original-brownbear · 2022-09-07T15:27:46Z

Thanks Tim!

was this happening somewhere?

Jup there's some logging for this in Cloud logs ever since we moved to the Netty allocator for Rest responses and get leak detection (mostly from EQL where there's an obvious bug behind it that I'll open a fix for in a bit).

elasticsearchmachine · 2022-09-07T15:29:31Z

💔 Backport failed

Status	Branch	Result
❌	7.17	Commit could not be cherrypicked due to conflicts
✅	8.4

You can use sqren/backport to manually backport by running backport --upstream elastic/elasticsearch --pr 89873

…c#89873) When using the resource handling channel we must make sure that if we (by what is IMO a bug) try to double invoke it after having already sent a response (or tried to do so) we at least release the memory in the channel's outbound buffer. Otherwise we will leak any memory from it that was used to create the now failing to send `RestResponse`.

#89881) When using the resource handling channel we must make sure that if we (by what is IMO a bug) try to double invoke it after having already sent a response (or tried to do so) we at least release the memory in the channel's outbound buffer. Otherwise we will leak any memory from it that was used to create the now failing to send `RestResponse`.

#89885) When using the resource handling channel we must make sure that if we (by what is IMO a bug) try to double invoke it after having already sent a response (or tried to do so) we at least release the memory in the channel's outbound buffer. Otherwise we will leak any memory from it that was used to create the now failing to send `RestResponse`.

* main: (175 commits) Fix integration test on Windows (elastic#89894) Avoiding the use of dynamic map keys in the cluster_formation results of the stable master health indicator (elastic#89842) Mute org.elasticsearch.tracing.apm.ApmIT.testCapturesTracesForHttpTraffic (elastic#89891) Fix typos in audit event types (elastic#89886) Synthetic _source: support histogram field (elastic#89833) [TSDB] Rename rollup public API to downsample (elastic#89809) Format script values access (elastic#89780) [DOCS] Simplifies composite aggregation recommendation (elastic#89878) [DOCS] Update CCS compatibility matrix for 8.3 (elastic#88906) Fix memory leak when double invoking RestChannel.sendResponse (elastic#89873) [ML] Add processor autoscaling decider (elastic#89645) Update disk-usage.asciidoc (elastic#89709) (elastic#89874) Add allocation deciders in createComponents (elastic#89836) Mute flaky H3LatLonGeometryTest.testIndexPoints (elastic#89870) Fix typo in get-snapshot-status-api doc (elastic#89865) Picking master eligible node at random in the master stability health indicator (elastic#89841) Do not reuse the client after a disruption elastic#89815 (elastic#89866) [ML] Distribute trained model allocations across availability zones (elastic#89822) Increment clientCalledCount before onResponse (elastic#89858) AwaitsFix for elastic#89867 ...

* main: (175 commits) Fix integration test on Windows (elastic#89894) Avoiding the use of dynamic map keys in the cluster_formation results of the stable master health indicator (elastic#89842) Mute org.elasticsearch.tracing.apm.ApmIT.testCapturesTracesForHttpTraffic (elastic#89891) Fix typos in audit event types (elastic#89886) Synthetic _source: support histogram field (elastic#89833) [TSDB] Rename rollup public API to downsample (elastic#89809) Format script values access (elastic#89780) [DOCS] Simplifies composite aggregation recommendation (elastic#89878) [DOCS] Update CCS compatibility matrix for 8.3 (elastic#88906) Fix memory leak when double invoking RestChannel.sendResponse (elastic#89873) [ML] Add processor autoscaling decider (elastic#89645) Update disk-usage.asciidoc (elastic#89709) (elastic#89874) Add allocation deciders in createComponents (elastic#89836) Mute flaky H3LatLonGeometryTest.testIndexPoints (elastic#89870) Fix typo in get-snapshot-status-api doc (elastic#89865) Picking master eligible node at random in the master stability health indicator (elastic#89841) Do not reuse the client after a disruption elastic#89815 (elastic#89866) [ML] Distribute trained model allocations across availability zones (elastic#89822) Increment clientCalledCount before onResponse (elastic#89858) AwaitsFix for elastic#89867 ... # Conflicts: # x-pack/plugin/rollup/src/main/java/org/elasticsearch/xpack/downsample/RollupShardIndexer.java

* main: (283 commits) Fix integration test on Windows (elastic#89894) Avoiding the use of dynamic map keys in the cluster_formation results of the stable master health indicator (elastic#89842) Mute org.elasticsearch.tracing.apm.ApmIT.testCapturesTracesForHttpTraffic (elastic#89891) Fix typos in audit event types (elastic#89886) Synthetic _source: support histogram field (elastic#89833) [TSDB] Rename rollup public API to downsample (elastic#89809) Format script values access (elastic#89780) [DOCS] Simplifies composite aggregation recommendation (elastic#89878) [DOCS] Update CCS compatibility matrix for 8.3 (elastic#88906) Fix memory leak when double invoking RestChannel.sendResponse (elastic#89873) [ML] Add processor autoscaling decider (elastic#89645) Update disk-usage.asciidoc (elastic#89709) (elastic#89874) Add allocation deciders in createComponents (elastic#89836) Mute flaky H3LatLonGeometryTest.testIndexPoints (elastic#89870) Fix typo in get-snapshot-status-api doc (elastic#89865) Picking master eligible node at random in the master stability health indicator (elastic#89841) Do not reuse the client after a disruption elastic#89815 (elastic#89866) [ML] Distribute trained model allocations across availability zones (elastic#89822) Increment clientCalledCount before onResponse (elastic#89858) AwaitsFix for elastic#89867 ...

DaveCTurner · 2022-09-08T07:30:58Z

... IMO a bug) try to double invoke it

LGTM2, and yes this sounds like a bug to me too. My only question is whether we could assert that this doesn't happen (ofc fixing any cases where it does first).

original-brownbear · 2022-09-08T07:53:05Z

My only question is whether we could assert that this doesn't happen

It's not entirely trivial unfortunately because of the current tests we have, but yes we should try to move towards that assertion.

DaveCTurner · 2022-09-08T08:09:27Z

Ok, I opened #89902 to track that.

…ticationAction This fixes an obvious bug where the listener was resolved twice if any of the first two failure conditions in the changed method were met. Prior to elastic#89873 this would lead to a memory leak.

…ticationAction (#89930) This fixes an obvious bug where the listener was resolved twice if any of the first two failure conditions in the changed method were met. Prior to #89873 this would lead to a memory leak.

…ticationAction (elastic#89930) This fixes an obvious bug where the listener was resolved twice if any of the first two failure conditions in the changed method were met. Prior to elastic#89873 this would lead to a memory leak.

…ticationAction (#89930) (#89954) This fixes an obvious bug where the listener was resolved twice if any of the first two failure conditions in the changed method were met. Prior to #89873 this would lead to a memory leak.

…eAuthenticationAction (#89930) (#89953) * Fix double sending of response in TransportOpenIdConnectPrepareAuthenticationAction (#89930) This fixes an obvious bug where the listener was resolved twice if any of the first two failure conditions in the changed method were met. Prior to #89873 this would lead to a memory leak. * fix compile

original-brownbear added >bug :Distributed Coordination/Network Http and internode communication implementations v8.5.0 v7.17.7 v8.4.2 labels Sep 7, 2022

elasticsearchmachine added the Team:Distributed (Obsolete) Meta label for distributed team (obsolete). Replaced by Distributed Indexing/Coordination. label Sep 7, 2022

Update docs/changelog/89873.yaml

88f1c17

original-brownbear requested review from DaveCTurner and Tim-Brooks September 7, 2022 15:13

Tim-Brooks approved these changes Sep 7, 2022

View reviewed changes

original-brownbear added the auto-backport-and-merge label Sep 7, 2022

original-brownbear merged commit 7c67116 into elastic:main Sep 7, 2022

original-brownbear deleted the fix-rest-double-invoke-leak branch September 7, 2022 15:28

original-brownbear mentioned this pull request Sep 7, 2022

[8.4] Fix memory leak when double invoking RestChannel.sendResponse (#89873) #89881

Merged

original-brownbear mentioned this pull request Sep 7, 2022

Fix memory leak when double invoking RestChannel.sendResponse (#89873) #89885

Merged

DaveCTurner mentioned this pull request Sep 8, 2022

Prevent double-calling RestChannel#sendResponse #89902

Open

This was referenced Sep 8, 2022

Fix double sending of response in TransportOpenIdConnectPrepareAuthenticationAction #89930

Merged

RestEqlSearchAction trips assertion and tries to send a response to a REST channel twice #89932

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix memory leak when double invoking RestChannel.sendResponse #89873

Fix memory leak when double invoking RestChannel.sendResponse #89873

original-brownbear commented Sep 7, 2022

elasticsearchmachine commented Sep 7, 2022

elasticsearchmachine commented Sep 7, 2022

Tim-Brooks left a comment

original-brownbear commented Sep 7, 2022 •

edited

Loading

elasticsearchmachine commented Sep 7, 2022

DaveCTurner commented Sep 8, 2022

original-brownbear commented Sep 8, 2022

DaveCTurner commented Sep 8, 2022

Fix memory leak when double invoking RestChannel.sendResponse #89873

Fix memory leak when double invoking RestChannel.sendResponse #89873

Conversation

original-brownbear commented Sep 7, 2022

elasticsearchmachine commented Sep 7, 2022

elasticsearchmachine commented Sep 7, 2022

Tim-Brooks left a comment

Choose a reason for hiding this comment

original-brownbear commented Sep 7, 2022 • edited Loading

elasticsearchmachine commented Sep 7, 2022

💔 Backport failed

DaveCTurner commented Sep 8, 2022

original-brownbear commented Sep 8, 2022

DaveCTurner commented Sep 8, 2022

original-brownbear commented Sep 7, 2022 •

edited

Loading