Lookahead: Remove filling of unincluded segment #6075

skunert · 2024-10-15T13:59:02Z

The lookahead collator was filling up the unincluded segment by producing one extra block every time when there is space in the unincluded segment.

This increases time to finality for given transactions and the benefit is questionable. In addition, if we use the full execution time of 2s for async-backing parachains and there is a relay chain fork, we will be producing blocks for 8s, which means that we are producing past our slot.

Alternatively, we could introduce a configuration to make users choose this value, however that makes this PR non-backporteable.

cc @bkchr @eskimor

skunert · 2024-10-15T14:06:03Z

/cmd prdoc --audience node_dev

alexggh · 2024-10-15T15:15:40Z

This increases time to finality for given transactions and the benefit is questionable.

Thinking about this, in the current implementation because we produce from 0..2 we end-up at the first try with 2 parachains blocks ahead.

With this PR, we can easily end up with 2 blocks ahead, if we missed 2 opportunities of backing the parachain blocks on the relay-chain. Because there is still space in the unincluded segment the collators will produce the blocks ahead and after that the steady state will be as before this PR.

Unfortunately, missing two backing opportunities is probably something will happen over the course of a few hours/days, so I think the parachain will quickly converge to the functioning mode from before this PR.

bkchr · 2024-10-15T20:41:31Z

cumulus/client/consensus/aura/src/collators/lookahead.rs

-			// This needs to change to support elastic scaling, but for continuously
-			// scheduled chains this ensures that the backlog will grow steadily.
-			for n_built in 0..2 {


What we actually want is that we in total are only always produce two blocks on top of the last backed block. What currently can happen if we missed to back a block, we put another block (assuming unincluded segment len of 3) on top. So, if everything works as expected, we will need the unincluded segment length of 3, but if a block failed to be backed we should go back to 2 from the included block.

skunert · 2024-10-16T16:02:45Z

After reading the comments and thinking about it some more, I come to the conclusion that my initial assumption was wrong. While its not super useful to build this extra blocks, we also don't really gain anything by the proposed change. Will close here.

alexggh · 2024-10-17T08:21:19Z

After reading the comments and thinking about it some more, I come to the conclusion that my initial assumption was wrong. While its not super useful to build this extra blocks, we also don't really gain anything by the proposed change. Will close here.

So right now, if PB(x) is backed at RB(n), then PB(x) will be created when RB(n-2) is imported, which is full 2 blocks(12s) before it will be backed, we can't reduce unincluded segment to 2 because then it will be created at the begining of RB(n-1) if authoring takes 2s you don't have time to also run the backing protocol, but ....

What if in this loop we do something like sleep(2*seconds * current_unincluded_blocks)), before we create parachain blocks ? In that case we move the creation of PB(x) at RB(n-2) + 4s which is really close to the ideal time you want to start creating the block PB(x) if you want it to be backed at RB(n).

skunert · 2024-10-17T08:53:17Z

What if in this loop we do something like sleep(2*seconds * current_unincluded_blocks)), before we create parachain blocks ? In that case we move the creation of PB(x) at RB(n-2) + 4s which is really close to the ideal time you want to start creating the block PB(x) if you want it to be backed at RB(n).

We could try to improve the timings here a bit, but it is a bit tricky. The building happens in the "main loop" and we might need to build parachain blocks for multiple relay chain forks, each taking 2s. So that needs to be factored in, would become quite finicky.
Next step for the slot based collator is to make it more flexible and extract a signaling task, this will make it much easier to test different timing models, makes more sense to me: #6066

Remove filling of unincluded segment

19436c4

skunert added T0-node This PR/Issue is related to the topic “node”. T9-cumulus This PR/Issue is related to cumulus. labels Oct 15, 2024

actions-user and others added 2 commits October 15, 2024 14:08

Update from skunert running command 'prdoc --audience node_dev'

fd9a196

Update prdoc

b954156

bkchr reviewed Oct 15, 2024

View reviewed changes

skunert closed this Oct 16, 2024

bkchr deleted the skunert/lookahead-remove-segment-filling branch October 17, 2024 08:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lookahead: Remove filling of unincluded segment #6075

Lookahead: Remove filling of unincluded segment #6075

skunert commented Oct 15, 2024

skunert commented Oct 15, 2024

alexggh commented Oct 15, 2024

bkchr Oct 15, 2024

skunert commented Oct 16, 2024

alexggh commented Oct 17, 2024

skunert commented Oct 17, 2024

Lookahead: Remove filling of unincluded segment #6075

Lookahead: Remove filling of unincluded segment #6075

Conversation

skunert commented Oct 15, 2024

skunert commented Oct 15, 2024

alexggh commented Oct 15, 2024

bkchr Oct 15, 2024

Choose a reason for hiding this comment

skunert commented Oct 16, 2024

alexggh commented Oct 17, 2024

skunert commented Oct 17, 2024