Network specification update #1404

AgeManning · 2019-09-08T12:57:20Z

Provides updates to the networking specification.

Specifically:

Fixes the REQ_RESP_MAX_SIZE
Renames the previously similar BeaconBlocks and RecentBeaconBlocks to BeaconBlocksByRange and BeaconBlocksByRoot respectively
Removes the 1:1 mapping in BeaconBlocksByRoot, a responder may now return less blocks than requested
The responder to a BeaconBlocksByRange request should also limit their response by the REQ_RESP_MAX_SIZE or SSZ_MAX_LIST_SIZE
Adds the notion of a response_chunk. Responses that consist of a single SSZ-list (BeaconBlocksByRange, BeaconBlocksByRoot) are now sent back over the stream in individual response_chunk.
Adds clarification around the SSZ-encoding of the request/response types
Adds clarification to the RPC requests

This extends on the improvements in #1390

specs/networking/p2p-interface.md

djrtwo

Cleaned up the language a bit and clarified a some things.
One minor question

specs/networking/p2p-interface.md

zah · 2019-09-09T19:26:20Z

If the hello message is now valid at any time, shall we rename it to status? A lot of people will associate the term "hello" with something that is appropriate only at the start of the session.

prestonvanloon · 2019-09-10T14:30:28Z

Instead of

(
  blocks: []BeaconBlock
)

we could define it as

(
  []BeaconBlock
)

What do you think? Having a field name might imply that this is a container. We missed the note originally when implementing the networking spec and sent block response containers.

AgeManning · 2019-09-10T20:14:08Z

I have updated this PR to accommodate suggestions.
Specifically, we have changed the REQ_RESP_MAX_SIZE to apply to the encoded payload of the request/responses and the RESP_TIMEOUT now applies per response chunk.

raulk

Nice work on the chunking strategy! How about leveraging this to mandate/recommend early/streaming validation, with the ability to Reset a stream, or decrement peer reputation on failure? This would create a more secure network.

specs/networking/p2p-interface.md

raulk · 2019-09-11T16:48:06Z

specs/networking/p2p-interface.md

 result    ::= “0” | “1” | “2” | [“128” ... ”255”]
 ```

 The encoding-dependent header may carry metadata or assertions such as the encoded payload length, for integrity and attack proofing purposes. Because req/resp streams are single-use and stream closures implicitly delimit the boundaries, it is not strictly necessary to length-prefix payloads; however, certain encodings like SSZ do, for added security.

-`encoded-payload` has a maximum byte size of `REQ_RESP_MAX_SIZE`.
+A `response` is formed by one or more `response_chunk`s. The exact request determines whether a response consists of a single `response_chunk` or possibly many. Responses that consist of a single SSZ-list (such as `BlocksByRange` and `BlocksByRoot`) send each list item as a `response_chunk`. All other response types (non-Lists) send a single `response_chunk`. The encoded-payload of a `response_chunk` has a maximum uncompressed byte size of `REQ_RESP_MAX_SIZE`.


This can incur in excessive fragmentation/overhead/underutilisation if list items are small, as you're effectively defining a 1:1 mapping between chunk and list element.

I'd recommend to make the boundaries of chunks full list elements, allowing a chunk to contain multiple complete list elements, and disallowing partial elements or bleeding over between chunks.

It may be worth formally introducing the term chunkable, so you can label message fields that satisfy this property in the schemas below.

Please note that adding chunks does not increase the size of the overall payload, because we are replacing the leading 4-byte offsets that previously appeared in the SSZ lists with response codes and varint lengths which will often amount to just 2 bytes per chunk (in the small items case).

I'd recommend to make the boundaries of chunks full list elements, allowing a chunk to contain multiple complete list elements, and disallowing partial elements or bleeding over between chunks.

This is the case. "Responses that consist of a single SSZ-list (such as BlocksByRange and BlocksByRoot) send each list item as a response_chunk"
Perhaps this is not clear enough?

A chunk represents a single encoded item

A chunk represents a single encoded item

Yeah, in practice this can incur in excessive chunking. If you have 3 list items available, and they all fit within one chunk, you can coalesce them into the same chunk. This is especially more efficient if you can fit multiple chunks within a single MTU. I'm not so worried about the byte overhead (although... death by a thousand cuts is a thing), but rather about the fragmentation, and making a protocol with a chunking strategy that's future proof.

Not entirely sure I follow.
Different requests have different discrete items. Eg a status message has a single ssz-encoded response (which is a container). It's chunk would be the size of that ssz-container. (It likely won't be a multiple of an MTU).

Responses that have lists, we now split into single items and send them as a chunk wrapped in a "error response". If we grouped multiple items under a single error response, we would require the encoding to be an ssz-list (like we had earlier) and the receiver would need to know which bytes are lists and which are single elements.

An SSZ list is encoded like | offset| offset| offset ..... | item | item | item |. Previously we sent this whole thing as a single error response across one stream, it now more like:
| error_response | item | error_response | item | ...
So we have replaced the offsets with error_responses and allow the receiver to take individual elements rather than wait for the entire stream to end.

Let me know if I've misunderstood your point

specs/networking/p2p-interface.md

raulk · 2019-09-11T16:50:18Z

specs/networking/p2p-interface.md


-The requester MUST wait a maximum of `TTFB_TIMEOUT` for the first response byte to arrive (time to first byte—or TTFB—timeout). On that happening, the requester will allow further `RESP_TIMEOUT` to receive the full response.
+The requester MUST wait a maximum of `TTFB_TIMEOUT` for the first response byte to arrive (time to first byte—or TTFB—timeout). On that happening, the requester allows a further `RESP_TIMEOUT` for each subsequent `response_chunk` received. For responses consisting of potentially many `response_chunk`s (an SSZ-list) the requester SHOULD read from the stream until either; a) An error result is received in one of the chunks, b) The responder closes the stream,  c) More than `REQ_RESP_MAX_SIZE` bytes have been read for a single `response_chunk` payload or d) More than the maximum number of requested chunks are read. For requests consisting of a single `response_chunk` and a length-prefix, the requester should read the exact number of bytes defined by the length-prefix before closing the stream.


Beware of a starvation DoS attack where attackers could deliberately trickle their chunks slowly so as to keep that socket/file descriptor busy as long as possible.

Each chunk is bounded by RESP_TIMEOUT. So the slowest an attacker can be is 1 chunk per RESP_TIMEOUT. If the chunk is malicious or useless, the application will likely drop the peer

Yes, I'm worried about the case where an attacker has good data but they explicitly starve you off it by trickling it slowly. RESP_TIMEOUT is 10s; imagine you have 1000 chunks, of 1kb each (1mb total). This policy makes it possible to craft an attack where the peer sends you one valid chunk every 9 seconds, overall taking 150 minutes (2.5 hours) to dispatch 1mb of data.

This can be counteracted by a global RPC timeout.

Good point. Will discuss :)

I'm not sure this is something the spec necessarily needs to discuss - it seems it could be handled with a peer quality algorithm instead - if the trickling peer is the only one you have, it's also the one you want.

specs/networking/p2p-interface.md

prestonvanloon

LGTM. I would like a recommendation for the scenario when a dialing peer does not send a Status immediately.

arnetheduck · 2019-09-15T06:50:10Z

LGTM. I would like a recommendation for the scenario when a dialing peer does not send a Status immediately.

timeout/disconnect - no point spending resources if they're not following the protocol

specs/networking/p2p-interface.md

djrtwo

I'd like to get this merged soon and release to v0.8.4

ChihChengLiang · 2019-11-05T07:42:26Z

specs/networking/p2p-interface.md


-The requester MUST wait a maximum of `TTFB_TIMEOUT` for the first response byte to arrive (time to first byte—or TTFB—timeout). On that happening, the requester will allow further `RESP_TIMEOUT` to receive the full response.
+The requester MUST wait a maximum of `TTFB_TIMEOUT` for the first response byte to arrive (time to first byte—or TTFB—timeout). On that happening, the requester allows a further `RESP_TIMEOUT` for each subsequent `response_chunk` received. For responses consisting of potentially many `response_chunk`s (an SSZ-list) the requester SHOULD read from the stream until either; a) An error result is received in one of the chunks, b) The responder closes the stream,  c) More than `MAX_CHUNK_SIZE` bytes have been read for a single `response_chunk` payload or d) More than the maximum number of requested chunks are read. For requests consisting of a single `response_chunk` and a length-prefix, the requester should read the exact number of bytes defined by the length-prefix before closing the stream.


d) More than the maximum number of requested chunks are read.

@AgeManning Should this number be specified?

The two types that can request chunks are BeaconBlocksByRange and BeaconBlocksByRoot. Each specify in the request the maximum number of blocks/chunks that should be returned.

For BeaconBlocksByRange this is the count parameter, for BeaconBlocksByRoot this is the number of hashes requested.

Let me know if this doesn't answer your question :)

That makes perfect sense. Thank you for the clarification ❤️.

Network specification update

4937fa9

AgeManning mentioned this pull request Sep 8, 2019

Network specification update #1402

Closed

djrtwo reviewed Sep 8, 2019

View reviewed changes

AgeManning and others added 5 commits September 9, 2019 05:38

Adds chunked responses to the RPC

3a79ad5

Apply Danny's suggestions

acb86e8

cleanup response_chunk refactor

cc12e29

cleanup max size vars

b743deb

p2p spec copy cleanups

3ead898

djrtwo reviewed Sep 8, 2019

View reviewed changes

specs/networking/p2p-interface.md Outdated Show resolved Hide resolved

zah reviewed Sep 9, 2019

View reviewed changes

specs/networking/p2p-interface.md Outdated Show resolved Hide resolved

prestonvanloon mentioned this pull request Sep 10, 2019

BlocksByRoot request type should be [][32]byte prysmaticlabs/prysm#3438

Closed

Applies github suggestions

4a7d8a4

Removes a max chunk count and corrects timeout for chunked responses

9fa720b

raulk reviewed Sep 11, 2019

View reviewed changes

arnetheduck reviewed Sep 12, 2019

View reviewed changes

specs/networking/p2p-interface.md Show resolved Hide resolved

Renames REQ_RESP_MAX_SIZE to MAX_CHUNK_SIZE

8bb9354

prestonvanloon reviewed Sep 14, 2019

View reviewed changes

specs/networking/p2p-interface.md Show resolved Hide resolved

specs/networking/p2p-interface.md Show resolved Hide resolved

prestonvanloon approved these changes Sep 14, 2019

View reviewed changes

nisdas reviewed Sep 16, 2019

View reviewed changes

specs/networking/p2p-interface.md Show resolved Hide resolved

ensure BeaconBlocksByRoot requests are lists rather than containers

4b2596d

djrtwo approved these changes Sep 16, 2019

View reviewed changes

hwwhww added the scope:networking label Sep 17, 2019

djrtwo approved these changes Sep 17, 2019

View reviewed changes

djrtwo merged commit 759d8f4 into ethereum:v08x Sep 17, 2019

wemeetagain mentioned this pull request Sep 20, 2019

p2p-interface: specify arrays as ssz-encoded lists #1390

Closed

mpetrunic mentioned this pull request Oct 28, 2019

Update networking to 0.9.0 ChainSafe/lodestar#532

Closed

ChihChengLiang mentioned this pull request Oct 29, 2019

Spec v0.8.4 update - Networking ethereum/trinity#1244

Closed

ChihChengLiang reviewed Nov 5, 2019

View reviewed changes

ChihChengLiang mentioned this pull request Nov 5, 2019

Fixed #1244, Spec 0.8.4 networking update ethereum/trinity#1253

Merged

5 tasks

wemeetagain mentioned this pull request Jan 21, 2020

Implement chunkify response in sync protocol ChainSafe/lodestar#530

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Network specification update #1404

Network specification update #1404

AgeManning commented Sep 8, 2019 •

edited

Loading

djrtwo left a comment

zah commented Sep 9, 2019

prestonvanloon commented Sep 10, 2019

AgeManning commented Sep 10, 2019 •

edited

Loading

raulk left a comment

raulk Sep 11, 2019

zah Sep 11, 2019 •

edited

Loading

AgeManning Sep 12, 2019

raulk Sep 12, 2019

AgeManning Sep 12, 2019

raulk Sep 11, 2019

AgeManning Sep 12, 2019

raulk Sep 12, 2019

AgeManning Sep 12, 2019

arnetheduck Sep 12, 2019

prestonvanloon left a comment

arnetheduck commented Sep 15, 2019

djrtwo left a comment

ChihChengLiang Nov 5, 2019

AgeManning Nov 5, 2019

ChihChengLiang Nov 5, 2019


		The requester MUST wait a maximum of `TTFB_TIMEOUT` for the first response byte to arrive (time to first byte—or TTFB—timeout). On that happening, the requester will allow further `RESP_TIMEOUT` to receive the full response.
		The requester MUST wait a maximum of `TTFB_TIMEOUT` for the first response byte to arrive (time to first byte—or TTFB—timeout). On that happening, the requester allows a further `RESP_TIMEOUT` for each subsequent `response_chunk` received. For responses consisting of potentially many `response_chunk`s (an SSZ-list) the requester SHOULD read from the stream until either; a) An error result is received in one of the chunks, b) The responder closes the stream, c) More than `REQ_RESP_MAX_SIZE` bytes have been read for a single `response_chunk` payload or d) More than the maximum number of requested chunks are read. For requests consisting of a single `response_chunk` and a length-prefix, the requester should read the exact number of bytes defined by the length-prefix before closing the stream.

Network specification update #1404

Network specification update #1404

Conversation

AgeManning commented Sep 8, 2019 • edited Loading

djrtwo left a comment

Choose a reason for hiding this comment

zah commented Sep 9, 2019

prestonvanloon commented Sep 10, 2019

AgeManning commented Sep 10, 2019 • edited Loading

raulk left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

zah Sep 11, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

prestonvanloon left a comment

Choose a reason for hiding this comment

arnetheduck commented Sep 15, 2019

djrtwo left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

AgeManning commented Sep 8, 2019 •

edited

Loading

AgeManning commented Sep 10, 2019 •

edited

Loading

zah Sep 11, 2019 •

edited

Loading