Compact Block Propagation #7932

cmwaters · 2021-09-06T12:04:59Z

Protocol Change Proposal

Summary

This is a proposal to optimize block propagation by gossiping only the hashes of each transaction.

Context

Block propagation is currently achieved by requesting transactions from the nodes mempool, constructing the block from the set of transactions and prior state, breaking the block into parts and gossiping these parts through the network alonside a proposal message. Receiving nodes reconstruct the block, verify it and if they are a validator, vote on it. Simultaneously, the mempool operates by broadcasting transactions throughout the network. This causes duplication of transaction messages throughout the network: at least once via the mempool and at least once via consensus.

The idea of gossiping "Compact" blocks is something employed by other networks such as Bitcoin (https://arxiv.org/pdf/2101.00378.pdf).

Proposal

At a high level, the consensus engine requests a set of transaction hashes from the mempool and combines this with the Header, Commit and Evidence (evidence could also be hashes as the evpool works in a similar manner) to produce a "compact block". This compact block would then be gossiped to peers alongside the proposal. This should be of a size that doesn't require chunking (however this could be evaluated at a later point). Receiving nodes of the compact block send the transaction hashes to the mempool. If the mempool already has the transactions it returns them to consensus so that consensus can reconstruct the block. If not, the mempool uses a request/response model with connected peers to fetch the missing transactions. Once complete, consensus can continue with verification and voting. If all the transactions can't be fetched within the consensus timeout, the node will prevote/precommit nil and move on to the next round.

The new request/response mechanism in the mempool could be done over a separate, higher priority channel than the gossiping of transactions.

Moving Forward

Before writing any design documentation, a rather rudimentary experiment to analyse the effectiveness of the change should be done first. This would be to write a custom node that for every block it receives from its peers, it counts how many of the transactions it already had in its mempool. This should be done on both high and low throughput networks. A high percentage indicates that there are significant gains to be made from making this optimization.

This proposal has some overlap with parts of #7922. I would think that such an optimization should be evaluated/implemented post Tendermint 1.0. As the current block chunking assumes nothing of the structure of blocks, it would still be possible to use this in the case of extremely large blocks.

For Admin Use

Not duplicate issue
Appropriate labels applied
Appropriate contributors tagged
Contributor assigned/self-assigned

The text was updated successfully, but these errors were encountered:

ValarDragon · 2021-09-06T16:35:54Z

Agreed that we definitely to analyze if this is a speedup for Tendermint, at different block times.

I view erasure encoding as the huge win in p2p gossip scaling, and then "compact blocks" optimizations as something that should get moved to be intra-chunk. (Erasure encoding ensures we avoid coupon collector problems, and can more effectively get peers gossiping to one another) So then compact blocks imo should be analyzed within each erasure encoding option (of which there are 2 imo):

With raptor codes, its mainly a trade-off for whether makes sense or not to list all the tx hashes optimistically in the proposal, and then have peers tell each other what indices they are missing. (Is that data overhead to the proposer commensurate with the gains saved by not having peers receive those chunks)
With Reed Solomon encoded block gossip this is less clear. Compact chunks would let you be able to skip some of this data gossiping per chunk. (For the systematically encoded parts of a chunk that you know), but this is done in exchange for more gossip latency, which may increase net time.

(Solana uses reed solomon encoding, with no compact gossiping, and a fairly stringent gossiping topology overlayed on top: https://medium.com/solana-labs/turbine-solanas-block-propagation-protocol-solves-the-scalability-trilemma-2ddba46a51db, https://medium.com/solana-labs/gulf-stream-solanas-mempool-less-transaction-forwarding-protocol-d342e72186ad)

Assuming no erasure encoding then I totally agree that it needs to be benchmarked.
The big difference between us and Bitcoin is around block time. At 5 second block times, how much would be paying in the pessimistic case of an extra round trip of gossip between each pair of nodes across the entire p2p layer. Or are there optimizations we can do where a node says that "if I had to request this tx by hash, I should directly gossip its data, not gossip it by hash, since my peers failed to deliver it to me", so this doesn't have cascading delays across the network.

olahouze · 2021-12-23T19:03:29Z

Hello

Have you been able to test both solutions to choose the most optimized one?

Sincerely

faddat · 2024-01-28T19:10:10Z

@ValarDragon comment:

Solana consumes a consistent 1gbps according to validators I've talked to

cmwaters added the S:proposal Status: Proposal label Sep 6, 2021

tac0turtle mentioned this issue Sep 10, 2021

rfc: add performance taxonomy rfc #6921

Merged

cmwaters transferred this issue from tendermint/spec Feb 21, 2022

cmwaters mentioned this issue May 10, 2022

RFC-024: block structure consolidation #8457

Merged

cmwaters mentioned this issue Nov 15, 2022

rfc 27: report on bandwidth usage within Tendermint #9706

Merged

3 tasks

lasarojc mentioned this issue Nov 22, 2022

Short-term monitoring/logging improvements #9076

Closed

25 tasks

thanethomson mentioned this issue Dec 14, 2022

Bandwidth optimization #9880

Closed

16 tasks

This was referenced Dec 20, 2022

Investigate how to optimize block proposal propagation. #9923

Closed

Investigate how to optimize block proposal propagation cometbft/cometbft#14

Open

sergio-mena mentioned this issue Dec 23, 2022

Monitoring/logging improvements cometbft/cometbft#54

Open

28 tasks

github-actions bot added the stale for use by stalebot label Jun 13, 2023

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Jun 18, 2023

ebuchman mentioned this issue Jan 28, 2024

consensus: extract block gossiping into its own reactor cometbft/cometbft#2127

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compact Block Propagation #7932

Compact Block Propagation #7932

cmwaters commented Sep 6, 2021

ValarDragon commented Sep 6, 2021

olahouze commented Dec 23, 2021

faddat commented Jan 28, 2024

Compact Block Propagation #7932

Compact Block Propagation #7932

Comments

cmwaters commented Sep 6, 2021

Protocol Change Proposal

Summary

Context

Proposal

Moving Forward

For Admin Use

ValarDragon commented Sep 6, 2021

olahouze commented Dec 23, 2021

faddat commented Jan 28, 2024