TLS streams are not guaranteed to be "split" (full-duplex) safe #40

Matthias247 · 2020-11-27T23:06:17Z

Currently every TLS stream can be split into a reading half and a writing half using tokio::util::split. This operation is however not always guaranteed to be correct and safe to perform:

The reason here is that with TLS reading and writing on a stream are not guaranteed to be decoupled, since TLS streams also transfer control data besides application data. Due to this, performing a read operation on the TLS stream might trigger a write operation on the socket (e.g. for sending a key update or alert), and the other way around.

This property can break assumptions that tokio/tokio-tls and the libraries make at the moment. As one example:

The user performs a read, which triggers an alert to be sent to the peer
The TLS library performs a write on the underlying socket, which report a blocked status. It forwards that blocked status to the application.
The task yields and waits to get woken up based on read readiness. Which might not happen, since the task which last registered for write readiness might be woken up. So the reading half would be stuck.
The last point happens if the read task doesn't install its Waker for write readiness. If it instead identifies being write blocked, and overwrites the write Waker the situation is not necessarily better. Now a concurrent write operation might instead be starved, since the Waker is gone.

What will exactly happen or won't happen is a bit of a property of the TLS library, and might therefore be handled completely different by rustls, openssl, schannel and security-framework. Therefore it's rather hard to describe what exactly "could go wrong" if someone tries to split a TLS stream.

With rustls reading and writing onto actual sockets is currently purely handled by tokio-tls, therefore its the most easy thing to grasp. Here any read or write operations in the wrapper simply don't seem to care whether rustls wants to perform IO in the opposite direction. That might not lead to Waker stealing and tasks getting starved. However I guess it could lead to a delay in sending certain updates. @ctz might know more whether this is problematic.

With native-tls + openssl it is very likely that it will perform IO in the opposite direction and handle readiness notifications wrong. https://dzone.com/articles/using-openssl-with-libuv provides a bit of information what needs to be done to handle those cases, but I think native-tls doesn't since it purely forwards all TLS calls to openssl which uses the fd to perform IO. From there on things rely on mio to report readiness, which only cares about the sockets read/write state and not the TLS streams read/write state.

How can this be fixed

It's unfortunately not easy. To avoid people running into starved streams, one solution is to get rid of tokio::util::split and hightlight in docs that streams support only one common Waker for both read and write operations. But that's not making streams full duplex.

I think to really enable full-duplex the following things could be done:

Make sure the TLS libraries don't directly perform IO, they just get fed incoming data, and buffer outgoing data (or write it to a buffered writer). If that buffer is full they exercise backpressure on the caller. That is kind of what rustls is already doing.
Besides the applications read and write task, there is a TLS IO task which makes sure that if there is any buffered outgoing data (either produced via a read or write operation) that data gets written to a socket. Only that task deals with OS readiness notifications. If sufficient data had been flushed, it wakes up the potentially blocked application write task. If new data from the socket had been received and decoded via the TLS library, it wakes up the read task.

This is kind of also the model that other multiplexing solutions - like HTTP/2 - use in their implementations. The downside is obviously the need for spawning as task, and potential overhead of task-switching which can degrade performance - especially if a multithreaded runtime is used, which would require synchronization between the application and TLS IO task.
It also makes the solution less runtime agnostic, and harder to force-cancel ongoing IO.

The text was updated successfully, but these errors were encountered:

Matthias247 · 2020-11-27T23:20:47Z

I had another thought on how to work around this in a generic fashion:

The split method could generate a new Waker type, similar to how FuturesUnordered works. I will call it VirtualWaker.
On all calls to TlsStream::read/write, the application tasks Waker would get stored inside the generated VirtualWaker, and the VirtualWaker would get forwarded to the call to the TLS library.
VirtualWaker::wake is implemented by waking up the applications read and write task, which can be done by wakeing all Wakers handles which are stored inside the generated VirtualWaker.

That would lead to spurious wakeups on the opposite direction and 50% wasted syscalls, but the same also happens if someone performs concurrent reads/writes on a TlsStream using select!/join!/etc.

quininer · 2020-11-28T04:09:24Z

I believe that tokio-rustls will hardly have such problems. The only place it performs IO in the opposite direction is when it tries to send an alert when an error occurs.

but the old version of tokio-rustls and some fork based on it do have this problem. like async-tls

Matthias247 · 2020-11-29T02:17:12Z

I agree this should be safe, since this is a terminal condition anyway.

The downside of this model is that if rustls itself would require e.g. sending re-negotations or other control data on reads tokio-tls would currently not perform this. But as far as I understand those things are not supported by rustls.

It might be more tricky to find out which ones of the native-tls versions would be safe to split, since they could all behave different.

I also looked at async-tls which you linked. That one indeed seems generally not safe to split, since it always tries to perform opposite direction IO after any interaction with it. But I'm also not sure if the async-std ecosystem has an equivalent generic split method like tokio::io::split which would trigger the issue.

carllerche · 2020-11-30T18:59:16Z

It sounds to me that the TLS streams have buggy implementation of the traits.

Another way to fix this would be for the TlsStream to hold its own read/write wakers and when calling the inner stream, it passes a weaker that fans out notification to both the read / write half. This will result in spurious wakeups but will fix the implmentations.

carllerche · 2020-11-30T19:03:04Z

cc @sfackler

rapiz1 · 2021-12-20T14:04:13Z

This sounds dangerous and can render split unusable...Any progress on this? I happened to want to use split

gftea · 2022-12-13T22:15:52Z

Hi, would it be a good idea the library provide callback functions for TLS control data?
when TLS readhalf receive tls control data, it dispatch to callback instead of application?

stanal · 2023-06-09T05:57:14Z

dose still has this problem now?

ry · 2024-07-12T17:56:36Z

https://github.com/denoland/rustls-tokio-stream

dgrr · 2024-11-01T12:38:55Z

I think the issue will always be there because it depends on the implementations, which at the end, are an infinite amount of crates.
Adding documentation about the possible bugs splitting a stream could cause would be ideal. Pointing to this issue is the best since the explanation of the issue gives very clear instructions on how the issue is caused.

Previously, mysql-srv split the TcpStream into two streams to create PacketReader and PacketWriter. However, this approach is incompatible with TLS streams, as splitting a TLS stream is not recommended (tokio-rs/tls#40). This commit merges PacketReader and PacketWriter into a single PacketConn struct, which implements both the AsyncRead and AsyncWrite traits. This refactor lays the groundwork for MySQL TLS support. Change-Id: I41ec4802ef3f7d0e423c3803b45376047bf64912

Previously, mysql-srv split the TcpStream into two streams to create PacketReader and PacketWriter. However, this approach is incompatible with TLS streams, as splitting a TLS stream is not recommended (tokio-rs/tls#40). This commit merges PacketReader and PacketWriter into a single PacketConn struct, which implements both the AsyncRead and AsyncWrite traits. This refactor lays the groundwork for MySQL TLS support. Change-Id: I41ec4802ef3f7d0e423c3803b45376047bf64912 Reviewed-on: https://gerrit.readyset.name/c/readyset/+/8688 Tested-by: Buildkite CI Reviewed-by: Jason Brown <jason.b@readyset.io>

Matthias247 mentioned this issue Nov 30, 2020

tokio::io::split is not guaranteed to work for all AsyncRead + AsyncWrite types tokio-rs/tokio#3200

Closed

lucacasonato mentioned this issue Jan 7, 2021

Deno 1.6.2 and 1.6.3 fail to fully handshake even after a Deno.startTls upgrade completes denoland/deno#9032

Closed

This was referenced Apr 15, 2021

fix(tls): flush send buffer in the background after closing TLS stream denoland/deno#10146

Merged

WakerProxy may be unnecessary snapview/tokio-tungstenite#163

Open

rapiz1 mentioned this issue Jan 2, 2022

TLS transport for UDP may be unsound rapiz1/rathole#41

Open

briansmith mentioned this issue Jan 17, 2022

Full-duplex mode rustls/rustls#288

Open

PrivateRookie mentioned this issue Jun 18, 2022

InsufficientLen error PrivateRookie/ws-tool#17

Closed

LizardWizzard mentioned this issue Mar 8, 2023

Remove synchronous postgres_backend neondatabase/neon#3576

Closed

5 tasks

peterholak mentioned this issue Jun 27, 2023

Add function to split a WebSocket into a read half and a write half denoland/fastwebsockets#40

Closed

cpu mentioned this issue Sep 16, 2024

Split TlsStream like TcpStream rustls/tokio-rustls#84

Open

vilgotf mentioned this issue Jan 26, 2025

"Task has lost its waker" and freeze when sending large packets Gelbpunkt/tokio-websockets#92

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TLS streams are not guaranteed to be "split" (full-duplex) safe #40

TLS streams are not guaranteed to be "split" (full-duplex) safe #40

Matthias247 commented Nov 27, 2020

Matthias247 commented Nov 27, 2020

quininer commented Nov 28, 2020

Matthias247 commented Nov 29, 2020

carllerche commented Nov 30, 2020

carllerche commented Nov 30, 2020

rapiz1 commented Dec 20, 2021 •

edited

Loading

gftea commented Dec 13, 2022

stanal commented Jun 9, 2023

ry commented Jul 12, 2024

dgrr commented Nov 1, 2024

TLS streams are not guaranteed to be "split" (full-duplex) safe #40

TLS streams are not guaranteed to be "split" (full-duplex) safe #40

Comments

Matthias247 commented Nov 27, 2020

How can this be fixed

Matthias247 commented Nov 27, 2020

quininer commented Nov 28, 2020

Matthias247 commented Nov 29, 2020

carllerche commented Nov 30, 2020

carllerche commented Nov 30, 2020

rapiz1 commented Dec 20, 2021 • edited Loading

gftea commented Dec 13, 2022

stanal commented Jun 9, 2023

ry commented Jul 12, 2024

dgrr commented Nov 1, 2024

rapiz1 commented Dec 20, 2021 •

edited

Loading