Better handle Thread#raise and Thread#kill #963

casperisfine · 2023-05-16T07:23:00Z

While it's heavily discouraged, Timeout.timeout end up being relatively frequently used in production, so ideally it's better to try to handle it gracefully.

This patch is inspired from redis-rb/redis-client@5f82254 before sending a request we increment a counter, and once we fully read the response(s), we decrement it.

If the counter is not 0 when we start a request, we know the connection may have unread responses from a previously aborted request, and we automatically discard it.

lib/dalli/protocol/base.rb

petergoldstein · 2023-05-16T12:54:55Z

I need to give this a closer look, but conceptually the changes look good. Thanks @casperisfine .

Would you mind addressing the various lints? Six of them will autocorrect and I think most or all of the rest are suppressed exceptions, which you can override locally as part of the PR.

casperisfine · 2023-05-16T13:06:52Z

Thanks for the review. I addressed the linter failures and updated the CHANGELOG.

cornu-ammonis

Thanks @casperisfine - I think this is really close. It doesn't quite solve my original test case for multiget - if the interrupt occurs at this critical point, we will have already sent the getkq ops to memcached but @request_in_progress will be false, so the socket can still be subsequently re-used with an incomplete read.

cornu-ammonis · 2023-05-16T19:52:44Z

lib/dalli/protocol/base.rb

@@ -66,9 +70,9 @@ def unlock!; end
      # Returns nothing.
      def pipeline_response_setup
        verify_state(:getkq)
+        @connection_manager.start_request!


I think ideally we'd have already called start_request! by the time we get here, since we've already sent the :getkq commands at this point.

maybe per the comment in def pipelined_get , we could move the noop along with start_request! and verify_state to that point? I tried that with my original PR and now that you've changed the write method to pull out start_request!, it can be even simpler to do, making the rest of what I had in that PR unnecessary.

casperisfine · 2023-05-17T07:00:20Z

@cornu-ammonis yeah good catch 🤦 .

I refactored the code a bit to handle this, I'll add a couple comments on the diff.

casperisfine · 2023-05-17T07:04:34Z

lib/dalli/protocol/base.rb

+          response = send(opkey, *args)
+
+          # pipelined_get emit query but doesn't read the response(s)
+          unless opkey == :pipelined_get


I don't like having to single out that one opkey, but request is clearly defined as a choke point, so I assume going through another method isn't desirable as it may break various monitoring patches.

Also since it takes *args, I can't really have the caller pass an argument to tell the method that the request is incomplete.

But if you are ok with having a specialized send_request method or something like that, I'm happy to refactor this.

Fix: petergoldstein#956 While it's heavily discouraged, `Timeout.timeout` end up being relatively frequently used in production, so ideally it's better to try to handle it gracefully. This patch is inspired from redis-rb/redis-client@5f82254 before sending a request we flip a flag, and once we fully read the response(s), we flip it back. If the flag is not `false` when we start a request, we know the connection may have unread responses from a previously aborted request, and we automatically discard it.

cornu-ammonis · 2023-05-17T21:47:22Z

@cornu-ammonis yeah good catch 🤦 .

I refactored the code a bit to handle this, I'll add a couple comments on the diff.

Great 👍 , I confirmed that this handles my multiget test case. It reconnects instead of returning an incorrect response. Thanks!

petergoldstein · 2023-05-30T03:57:08Z

Not sure why initially we had so many spec failures. Rerunning things got everything green.

Minor nit that verify_pipelined_state doesn't need an argument, since it's a new method, internal only, and the argument is unused. But that can be fixed post-merge.

And the other unrelated changes (consolidating on byroot in the CHANGELOG) are fine.

Thanks @byroot

casperisfine commented May 16, 2023

View reviewed changes

lib/dalli/protocol/base.rb Show resolved Hide resolved

casperisfine mentioned this pull request May 16, 2023

Dalli sometimes returns incorrect values #956

Closed

casperisfine force-pushed the handle-async-interrupt branch 3 times, most recently from fa5cf15 to 290159d Compare May 16, 2023 13:05

cornu-ammonis reviewed May 16, 2023

View reviewed changes

casperisfine force-pushed the handle-async-interrupt branch from 290159d to 738bfd6 Compare May 17, 2023 06:59

casperisfine commented May 17, 2023

View reviewed changes

casperisfine force-pushed the handle-async-interrupt branch 2 times, most recently from 3555457 to 9e4f51d Compare May 17, 2023 07:34

casperisfine force-pushed the handle-async-interrupt branch from 9e4f51d to 0f2b374 Compare May 17, 2023 07:37

petergoldstein merged commit c01d410 into petergoldstein:main May 30, 2023

eugeneius mentioned this pull request Jan 25, 2024

Don't reconnect to send pipelined request no-op #983

Merged

marvinthepa mentioned this pull request Nov 2, 2024

interleave read and write on pipelined_get (#776, #941) #942

Draft

casperisfine deleted the handle-async-interrupt branch November 4, 2024 09:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better handle Thread#raise and Thread#kill #963

Better handle Thread#raise and Thread#kill #963

casperisfine commented May 16, 2023

petergoldstein commented May 16, 2023

casperisfine commented May 16, 2023

cornu-ammonis left a comment

cornu-ammonis May 16, 2023

casperisfine commented May 17, 2023

casperisfine May 17, 2023

cornu-ammonis commented May 17, 2023

petergoldstein commented May 30, 2023

Better handle Thread#raise and Thread#kill #963

Better handle Thread#raise and Thread#kill #963

Conversation

casperisfine commented May 16, 2023

petergoldstein commented May 16, 2023

casperisfine commented May 16, 2023

cornu-ammonis left a comment

Choose a reason for hiding this comment

cornu-ammonis May 16, 2023

Choose a reason for hiding this comment

casperisfine commented May 17, 2023

casperisfine May 17, 2023

Choose a reason for hiding this comment

cornu-ammonis commented May 17, 2023

petergoldstein commented May 30, 2023