Refactor message serialization as a tokio codec. #22

hdevalence · 2019-09-24T22:53:41Z

Closes #20.

This lets us transform a TCP stream into an async stream of Messages rather than having to call send/recv directly.

This allows us to organize all of the Bitcoin-Zcash specific parts of the protocol into a subtree.

This provides a significantly cleaner API to consumers, because it allows using adaptors that convert a TCP stream to a stream of messages, and potentially allows more efficient message handling.

hdevalence · 2019-09-24T22:54:20Z

(This is still a draft because there was one bit of cleanup I wanted to do on the try_read_ functions).

zebra-network/src/network.rs

zebra-network/src/protocol/codec.rs

hdevalence · 2019-09-25T01:21:51Z

zebra-network/src/protocol/codec.rs

+        // XXX(HACK): this is inefficient and does an extra allocation.
+        // instead, we should have a size estimator for the message, reserve
+        // that much space, write the header (with zeroed checksum), then the body,
+        // then write the computed checksum in-place.  for now, just do an extra alloc.


We could record this refactor in an issue but I think saving an extra alloc is much less important than other stuff right now. However when we do get around to doing it properly, the tokio codec setup will let us perform a single allocation per message.

hdevalence · 2019-09-25T01:24:01Z

zebra-network/src/protocol/codec.rs

+// ======== Decoding =========
+
+#[derive(Debug)]
+enum DecodeState {


This tracks the decoder state; since decode can be called multiple times (see below) we need to track what phase (header/body) we're in and absorb the header contents (the decoder is responsible for removing parsed data from the buffer).

hdevalence · 2019-09-25T01:25:30Z

zebra-network/src/protocol/codec.rs

+    type Error = Error;
+
+    #[instrument(skip(src))]
+    fn decode(&mut self, src: &mut BytesMut) -> Result<Option<Self::Item>, Self::Error> {


decode returns

Err to signal an error;

Ok(None) to signal insufficient data;

Ok(Some(msg)) when an entire item is ready.

zebra-network/src/protocol/codec.rs

hdevalence · 2019-09-25T01:28:49Z

zebra-network/src/protocol/codec.rs

+                );
+
+                // Reserve buffer space for the expected body and the following header.
+                src.reserve(body_len + HEADER_LEN);


The tokio docs recommend reserving body_len + HEADER_LEN, so that no more allocations are required until after reading the next header.

hdevalence · 2019-09-25T01:31:02Z

zebra-network/src/protocol/codec.rs

+                // Now that we know we have the full body, split off the body,
+                // and reset the decoder state for the next message.
+                let body = src.split_to(body_len);
+                self.state = DecodeState::Head;


After we remove the body, we have to reset the decoder state to DecodeState::Head or we'll try to read over the next message header as a body with the same type and checksum as the current message, causing a checksum error (I forgot this step at first 😓 )

This would be good to have as a comment inline, just in case.

zebra-network/src/protocol/codec.rs

zebra-network/src/protocol/message.rs

zebrad/src/commands/connect.rs

dconnolly

codec.rs is quite large, if there's a way to break it up somehow that would be nice.

zebra-network/src/network.rs

zebra-network/src/protocol/codec.rs

hdevalence · 2019-09-25T16:11:01Z

Fixed the constant placement and moved the header encoding to be prior to the body encoding. I agree that the file is larger than desirable but I'm not sure there's a great way to split up its contents.

This is no longer required because the body reader methods have access to the version via the codec state.

dconnolly

🚚

…ion#22) ## What Make it so that creators can become stars/fans of eachother, "subscribe" and creators who `can_stream` can go live and their subscribers can join the "webinars" but also request the stage. ## Why Seems like it would be really cool.

hdevalence added 2 commits September 23, 2019 19:43

Make a new protocol module with message submodule.

e6e055b

This allows us to organize all of the Bitcoin-Zcash specific parts of the protocol into a subtree.

Refactor message serialization as a tokio codec.

c2e378f

This provides a significantly cleaner API to consumers, because it allows using adaptors that convert a TCP stream to a stream of messages, and potentially allows more efficient message handling.

hdevalence requested a review from dconnolly September 24, 2019 22:53

hdevalence added 2 commits September 24, 2019 16:14

Make message body reader fns part of Codec.

47e304e

Trace the decoded message in the decoder.

2796a1a

hdevalence force-pushed the tokio-codec branch from d2d1650 to 2796a1a Compare September 24, 2019 23:15

hdevalence marked this pull request as ready for review September 24, 2019 23:15

hdevalence mentioned this pull request Sep 24, 2019

Checksum(Read|Writ)er #8

Closed