Improve wait callback and timeout handling #110

jhawthorn · 2023-08-01T17:33:38Z

This makes a couple improvements to how we wait for the socket to become ready for I/O.

First this refactors TRILOGY_RB_TIMEOUT to a "real" error status in the C library (though currently nothing in the C library raises it), which allows it to be negative like all the other errors (and avoids being confused with a successful read/write of size 1).

Next this makes _cb_ruby_wait return this new status code instead of SYSERR when there is a timeout, allowing it to propogate through trilogy_sock_upgrade_ssl and similar correctly. This allows differentiating between actually syserrors which set errno and timeouts. Also this can show the difference between rb_wait_for_single_fd returning -1 (a syscall error, which I think in practice will be extremely rare) and 0 (a timeout).

Finally the last two commits ensure that the socket is shut down on either a socket timeout we see, or from an external exception (like Timeout.timeout). For the latter we must wrap the waiting in an rb_protect, which previously we were doing correctly for queries, but not for other operations (ex. ping, change_db). We have to shut down the socket because we've interrupted normal control flow and are likely either in the middle of a write (we've partly written our packet) or a read (there will be data sent from the server that we need to handle) so any further operations are invalid.

All other errors are negative so that they can be returned from functions like read/write which return a positive number for success.

This allows handling and raising TimeoutError in the same way as our other errors. This also allows for us to report syscall errors which occur in the socket wait callback (which I believe are very unlikely).

When we call rb_wait_for_single_fd we release the GVL and allow our Ruby thread to receive interrupts. The most obvious case of this is using Timeout.timeout. If we see an exception in this way we can't safely use our socket anymore as it was likely in the middle of another operation, so it should be shut down.

composerinteralia · 2023-08-01T18:03:21Z

contrib/ruby/ext/trilogy-ruby/cext.c

        return TRILOGY_SYSERR;
+    if (args.rc == 0)


I got confused for a second because in most other places rc == 0 means TRILOGY_OK. I understand now that this is the return value of rb_wait_for_single_fd, where 0 means a timeout: https://github.com/ruby/ruby/blob/1642e0c39220e95ddb16b4cbbbe78f24507dfd48/include/ruby/io.h#L902. Possibly worth an inline comment in addition to what you have in the commit message?

jhawthorn added 4 commits July 28, 2023 13:52

Make TRILOGY_TIMEOUT a real error

fa0a783

All other errors are negative so that they can be returned from functions like read/write which return a positive number for success.

Handle timeout errors as a return code

d755f2e

This allows handling and raising TimeoutError in the same way as our other errors. This also allows for us to report syscall errors which occur in the socket wait callback (which I believe are very unlikely).

Shutdown socket after timeout

284780b

composerinteralia approved these changes Aug 1, 2023

View reviewed changes

Add comment describing rb_wait_for_single_fd return value

fe9a7ae

jhawthorn merged commit a1f46bb into trilogy-libraries:main Aug 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve wait callback and timeout handling #110

Improve wait callback and timeout handling #110

jhawthorn commented Aug 1, 2023

composerinteralia Aug 1, 2023

Improve wait callback and timeout handling #110

Improve wait callback and timeout handling #110

Conversation

jhawthorn commented Aug 1, 2023

composerinteralia Aug 1, 2023

Choose a reason for hiding this comment