Improve logic to cleanup server-session exactly once #6408

shinrich · 2020-02-07T21:56:39Z

Breaking apart the PR in #6401. This one includes the logic to better ensure that the server_session is closed and deleted in a timely manner. The crash we were seeing was due to a server_session staying around after the HttpSM had been deleted. The inactivity cop would execute the handleEvent on the deleted HttpSM continuation.

masaori335 · 2020-02-10T01:34:38Z

proxy/http/HttpSM.cc

    tunnel.deallocate_buffers();
    tunnel.reset();
+    if (server_entry) {
+      server_entry->in_tunnel = false;


Is this necessary? It looks like this is done in line 5421.

masaori335 · 2020-02-10T01:54:07Z

proxy/http/HttpSM.cc

      SMDebug("http_tunnel", "send 408 response to client to vc %p, tunnel vc %p", ua_txn->get_netvc(), p->vc);

      tunnel.chain_abort_all(p);
-      server_session = nullptr;


IIUC, the crash will be fixed by setting server_entry->in_tunnel = false here.
I'm a bit afraid that this PR is too generic for fixing a crash.

masaori335 · 2020-02-10T02:26:17Z

Unfortunately, I still get the crash with this patch. Have you tried this with --trailer? Am I missing something?

nghttp -v --no-dep -s "https://127.0.0.1:4443/post/" -d /usr/local/var/www/128kb --trailer 'foo: bar'

CONFIG proxy.config.http.transaction_no_activity_timeout_in INT 3
CONFIG proxy.config.http.transaction_active_timeout_out INT 5

shinrich · 2020-02-10T14:15:31Z

We also needed PR #6407 to fix the crash. Perhaps that is all we needed. Will do some experiments today with combinations of this PR #6407 and PR #6404

shinrich · 2020-02-10T15:50:52Z

So far, our 9.0.x plus PR #6407 and PR #6404 is not crashing. Our crashes were intermittent, so I'll keep an eye on things today.

Unfortunately, PR #6407 and #6404 do not play together well. With the fix for #6407, the read_complete has already been sent for the post data so the HttpSM handler is state_watch_for_client_abort when the second trailer READ_COMPLETE is sent. This causes the transaction to fail and the nghttp request is responded with a 502 error.

In our case, the faulty clients are not sending trailing headers, so just fixing it in the trailing header logic is not sufficient.

masaori335 · 2020-02-10T23:28:41Z

So far, our 9.0.x plus PR #6407 and PR #6404 is not crashing.

Do you mean #6407 and #6408 ?
#6407 will hide the issue which uncovered by the commit.

The reason why I asked trying nghttp with trailers is that this patch should fix the crash without #6407 nor #6404 changes.

For trailers, we should not signal READ_COMPLETE on final DATA frame. I can do that on #6404, after #6407 is merged.

shinrich · 2020-02-10T23:38:37Z

I talked with @bryancall today, and since the according to the spec we should really be returning a a HTTP/2 error to the client if they are not sending a data frame EOS. I didn't get time today, but I would like to spend some more time tomorrow correctly identifying and cleaning up this case. While PR #6408 avoids the crash, it does not return the error to the client.

shinrich · 2020-02-21T23:19:57Z

I think we can take this one out of the mix

zwoop · 2020-02-21T23:45:39Z

Remember to clear Milestone / Project when closing a PR without merging :).

Improve logic to cleanup server-session exactly once

4e1531d

shinrich added HTTP Crash labels Feb 7, 2020

shinrich added this to the 10.0.0 milestone Feb 7, 2020

shinrich self-assigned this Feb 7, 2020

shinrich mentioned this pull request Feb 7, 2020

Correctly track H2 post vio bytes #6407

Closed

shinrich requested review from SolidWallOfCode, masaori335 and zwoop February 7, 2020 22:06

masaori335 reviewed Feb 10, 2020

View reviewed changes

shinrich closed this Feb 21, 2020

zwoop removed this from the 10.0.0 milestone Feb 21, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve logic to cleanup server-session exactly once #6408

Improve logic to cleanup server-session exactly once #6408

Uh oh!

shinrich commented Feb 7, 2020

Uh oh!

masaori335 Feb 10, 2020

Uh oh!

masaori335 Feb 10, 2020

Uh oh!

masaori335 commented Feb 10, 2020 •

edited

Loading

Uh oh!

shinrich commented Feb 10, 2020

Uh oh!

shinrich commented Feb 10, 2020

Uh oh!

masaori335 commented Feb 10, 2020 •

edited

Loading

Uh oh!

shinrich commented Feb 10, 2020

Uh oh!

shinrich commented Feb 21, 2020

Uh oh!

zwoop commented Feb 21, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Improve logic to cleanup server-session exactly once #6408

Improve logic to cleanup server-session exactly once #6408

Uh oh!

Conversation

shinrich commented Feb 7, 2020

Uh oh!

masaori335 Feb 10, 2020

Choose a reason for hiding this comment

Uh oh!

masaori335 Feb 10, 2020

Choose a reason for hiding this comment

Uh oh!

masaori335 commented Feb 10, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shinrich commented Feb 10, 2020

Uh oh!

shinrich commented Feb 10, 2020

Uh oh!

masaori335 commented Feb 10, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

shinrich commented Feb 10, 2020

Uh oh!

shinrich commented Feb 21, 2020

Uh oh!

zwoop commented Feb 21, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

masaori335 commented Feb 10, 2020 •

edited

Loading

masaori335 commented Feb 10, 2020 •

edited

Loading