Stop inclusion on too many disputes #790

eskimor · 2022-08-18T10:11:17Z

One important attack vector on disputes is to try to overwhelm the system with disputes, with the goal to prevent one particular dispute from concluding (the one which would actually resolve "invalid"). E.g. imagine an attacker hacked a few honest nodes and makes them trigger disputes at a high rate (e.g. dispute each and every candidate) to bring the system to its limit and once the attacker is sure the system can no longer keep up, try to back an invalid candidate.

There are of course a couple of counter measures in place, but we should add another safe guard. An important observation is that disputes are supposed to be exceptional, so a high rate of disputes is very suspicious and the only reasons that come to mind for such an event are:

Load testing on a test network
An attack such as described above
A bug, which triggers honest validators to do validation mistakes

We don't need to worry about 1 obviously, we should worry about 2 and keep in mind 3.

Another observation is that we only care about disputes on unfinalized blocks. An attacker can dispute already finalized blocks, but honest validators will not participate in such disputes, hence the attack described above should not be possible with such disputes.

This means for triggering disputes at a high rate an attacker has to rely on unfinalized blocks, which are limited in number - hence the attacker can not hold back disputes for too long and try to fire them all at once, but will be required to produce them continuously - as candidates get included.

The final ingredient: Candidates can only get disputed (with priority at least) if they have been included on some chain, but this is something we can control - regardless of malicious nodes:

Proposal

When importing bitfields in a block, we take into account the number of active disputes we are aware of. If they surpass some conservative number MAX_ACTIVE_DISPUTES_INCLUSION_RATE_LIMIT honest block produces will stop including bitfields in a block and block importers will reject any block which does anyway. This way we limit the supply of new candidates that could get disputed, hence with a carefully picked number for above constant we ensure the dispute system cannot get overwhelmed. Considering that disputes result in serious slashes this number can be made quite low.

Considerations

Why active disputes?

I would suggest to base the rate limiting (not accepting new bitfields) on currently active disputes on the chain. With active meaning, disputes which either have not been concluded yet or have concluded just recently. One could also suggest to base the decision on the number of dispute votes/disputes that get imported in the very block. The issue here is that malicious block producers can decide to just import less disputes to not hit the limit, which would result in a less effective rate limiting.

Why inclusion?

We could also stop backing candidates, but preventing inclusion is actually more effective.

Only included candidates can get disputes with priority.
Backing is implicitly rate limited by stopping inclusion, as new candidates can only get backed once the core gets freed via timeout.
By keeping the core occupied we back-pressure on the backing pipeline, which results in validators to stop backing new candidates (after a short while for asynchronous backing), which makes sense as we don't intend to process them on chain, meaning less wasted work. This in turn frees up resources for processing those disputes.

If we stopped putting backed candidates on chain, the full backing pipeline would keep working and would continue to produce candidates that then just go to waste.

What about the third cause (bugs)?

In case lots of disputes are happening because of bugs, rate limiting makes sense also. While we are still processing disputes at a fast rate, it certainly is good to rate limit disputes a bit that are actually unjustified. Although bugs might be local to only particular parachains. While we could adjust the threshold in a way, that a single parachain or even two cannot realistically reach it, only filtering out candidates for some particular parachains in case disputes are isolated is worth considering.

DoS?

Doesn't this open up a DoS vector, if we stop processing parachain blocks? Yes! But it is ridiculously cost inefficient. Let's say we set MAX_ACTIVE_DISPUTES_INCLUSION_RATE_LIMIT to 10 and max age for active disputes to 1 block. This means you get 10 slashes for making parachains skip two blocks!

Forks

Honest nodes only consider a limited number of forks, therefore an adversary crafting many forks, will not succeed in getting candidates included on those forks, hence they cannot get disputed.

Attacking Finality

An attacker could try via some means to slow down finality in order to concentrate an attack: E.g. wait 100 blocks (preventing finality of those) and then raise disputes for all those candidates (10_000 for 100 candidates per block) all at once.

One good news here is that if finality is halted, block production will slow down. On the other hand, once those disputes are released - candidate inclusion will get stopped immediately, meaning we will catch up eventually. With 6 sessions time for resolving disputes, this eventually will be very likely in time on production networks. If not, we will still roll back the chain when in doubt.

Conclusion

While it is still a good idea to resolve disputes fast, we should not play cat-and-mouse with attackers if we don't have to.

Conceptually it makes sense as well: Should we really keep building chains as if nothing happened, although we know that we are under some intense attack/have serious bugs in the system?

The text was updated successfully, but these errors were encountered:

eskimor · 2022-08-18T10:17:29Z

@burdges Any reasons this would be a bad idea? Considerations from your end on MAX_ACTIVE_DISPUTES_INCLUSION_RATE_LIMIT value and the maximum age we consider a dispute "active"? Goal would be to minimize effects malicious block producers can have on the inclusion blocking.

Even a max age of 1 is already a sensible protection. While an attacker could quite likely make all open disputes conclude in a single block, another malicious block producer right afterwards would be required to could not import any new disputes, then another malicious block producer would be needed not including any disputes, but only bitfields - assuming we also consider disputes imported in the same block. So even a value of 1, requires 3 dishonest block producers in a row to weaken the inclusion rate limit.

eskimor · 2022-08-18T10:38:34Z

@rphmeier I would be interested in your thoughts as well.

eskimor · 2022-08-18T12:20:14Z

DoS argument: If validators are hacked as in the introduction, they don't care about the slash. So this could indeed be used to cause DoS - triggering thousands of disputes would likely result in some level of DoS as well though.

burdges · 2022-08-19T08:40:14Z

@AlistairStewart won't like impacting liveness but yeah I'm fine with blocking inclusions on disputes. We'll move to Sergi's deterministic WASM metering, which makes 3 less likely.

We'd block only parachains with disputes on them? We could block only parachains with two or more disputes..

We do not currently put disputes on chain, right? I've forgotten how this works now..

If disputes are off-chain, then relay chain blocks could simply delay inclusion for one block with an "excessive disputed flag". If enough say this then everything runs fine, except for that parachain. If only some say this, then anyone building upon their block can move the chain forward, but anyone believing there are too many disputes won't build upon or finalize that extension.

eskimor · 2022-08-19T14:05:20Z

About Liveness: To improve matters here, we can also deploy additional mechanisms for disabling spamming validators and such. Anyhow limiting inclusion when the rate of disputes reaches some threshold is a good last resort mechanism in my opinion. If Polkadot is really making use of it's scalability, then the system by definition cannot keep up with disputes for each and every candidate. We could hope that the system is slowing down under such conditions anyway, but guarantees on dispute resolultion would likely get weakend.

I would block all parachains: We are talking about a spam condition here: Disputes can only be triggered by validators and they don't care about what parachain a candidate belongs to - they can dispute anything. For the bug case, it could be that only candidates of a particular parachain causes the bug. On the other hand disputes on only a single parachain, will not overwhelm the system so would not even trigger the blocking threshold. If more parachains are affected, it becomes more and more unrealistic that it is an isolated issue and filtering for particular parachains seems not to be worth the effort.

We do put dispute votes on chain - if only to have proof for slashing. So we have evidence on chain about the amount of disputes happening and can have a consensus based decision on counter measures.

burdges · 2022-08-20T00:34:15Z

I agree disputes spam against all parachains sounds like one attack technique. It's not 100% clear how you leverage this attack yet, and clearly the bug problem exists, hence my question about whether its worth singling out parachains.

We do not believe this complicates consensus then I guess, since disputes already go on-chain. We just need rules like:

if a parachain has two disputes on-chain then we do not back or include on this parachain until all its disputes are resolved or timeout.
If three parachains have disputes then we block all inclusion & backing until we resolve them all and have no active disputes, and invalid dispute slashing increases from 1% (or whatever) to 20% (or 50% or whatever).

We already have a scheme for correlated slashing, but the correlation effect in 1 could stay small, while the correlation effect in 2 matters for liveness. We should ideally decide whether the existing scheme suffices or whether it needs further work. We should ideally eventually decide if we need cross slashing type correlations too, like if grandpa and disputes interact.

pepyakin · 2022-09-13T16:43:59Z

I think actually we need to consult with the parachains team to see how much they could be affected by the loss of liveness.

One thing that comes to mind is CDP-like approaches, where if the para is stopped, it could lead to under collateralization. If we take a general purpose programmable chain (anything that supports permissionless deployed smart-contracts), then there opened ended list that might allow extracting profit from affecting the liveness.

pepyakin · 2022-09-13T16:45:21Z

Sorry if a stupid question, but why do we care about disputes in finalized blocks at all? If there is a legit dispute in a finalized block, there is nothing that protocol can do about it, AFAIK.

eskimor · 2022-09-13T17:12:54Z

Sorry if a stupid question, but why do we care about disputes in finalized blocks at all? If there is a legit dispute in a finalized block, there is nothing that protocol can do about it, AFAIK.

Exactly it happens that I just clarified that in the guide today.

Where do we say here that we do? (Can't find it)

pepyakin · 2022-09-13T17:17:34Z

My question was prompted by this sentence:

An attacker can dispute already finalized blocks, but those disputes will be treated with lower priority than disputes for unfinalized blocks, hence the attack described above should not be possible with such disputes.

eskimor · 2022-09-14T08:58:38Z

Thanks! Yeah that is obsolete already, as the definition of best-effort got changed. Fixed.

eskimor · 2023-01-11T12:42:32Z

With recent updates nodes handled huge load of disputes way better than before. In particular they naturally started back pressuring on backing and inclusion themselves. All the system did was slowing down, which is exactly what this ticket is about. Hence, we can dramatically reduce priority of this ticket and might punt on it completely with further testing and documenting/enforcing the back pressuring behavior.

* Delete obsolete code * Reword some comments

eskimor · 2024-04-05T14:10:37Z

Importing dispute already takes precedence.

* Adding message relayer scripts, reformating send message scripts * Addressing PR feedback * Update README.md Valid . Co-authored-by: Hernando Castano <HCastano@users.noreply.github.com> * Fixing send-message-from-rialto-millau * Fixing send message script from millau to rialot Co-authored-by: Hernando Castano <HCastano@users.noreply.github.com>

eskimor added I2-security labels Aug 18, 2022

eskimor assigned tdimitrov Aug 18, 2022

This was referenced Oct 4, 2022

Dispute slashing/hardening paritytech/polkadot#6099

Closed

Batch vote import in dispute-distribution paritytech/polkadot#5894

Merged

ordian mentioned this issue Aug 31, 2022

disputes: punishment on repeated dispute initiations (stale) #785

Closed

tdimitrov mentioned this issue Sep 26, 2022

Limit dispute votes in the provisioner paritytech/polkadot#4329

Closed

eskimor mentioned this issue Mar 22, 2023

inherent disputes: remove per block initializer and disputes timeout event paritytech/polkadot#6937

Merged

2 tasks

Sophia-Gold transferred this issue from paritytech/polkadot Aug 24, 2023

the-right-joyce added I1-security The node fails to follow expected, security-sensitive, behaviour. T9-parachains_protocol and removed I2-security labels Aug 25, 2023

the-right-joyce added this to parachains team board Oct 18, 2023

the-right-joyce moved this to Backlog in parachains team board Oct 18, 2023

the-right-joyce removed the T9-parachains_protocol label Oct 23, 2023

claravanstaden pushed a commit to Snowfork/polkadot-sdk that referenced this issue Dec 8, 2023

Remove snowbridge parachain (paritytech#790)

b2acc82

* Delete obsolete code * Reword some comments

eskimor closed this as completed Apr 5, 2024

github-project-automation bot moved this from Backlog to Completed in parachains team board Apr 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stop inclusion on too many disputes #790

Stop inclusion on too many disputes #790

eskimor commented Aug 18, 2022 •

edited

Loading

eskimor commented Aug 18, 2022 •

edited

Loading

eskimor commented Aug 18, 2022

eskimor commented Aug 18, 2022

burdges commented Aug 19, 2022 •

edited

Loading

eskimor commented Aug 19, 2022 •

edited

Loading

burdges commented Aug 20, 2022 •

edited

Loading

pepyakin commented Sep 13, 2022

pepyakin commented Sep 13, 2022

eskimor commented Sep 13, 2022

pepyakin commented Sep 13, 2022

eskimor commented Sep 14, 2022 •

edited

Loading

eskimor commented Jan 11, 2023

eskimor commented Apr 5, 2024

Stop inclusion on too many disputes #790

Stop inclusion on too many disputes #790

Comments

eskimor commented Aug 18, 2022 • edited Loading

Proposal

Considerations

Why active disputes?

Why inclusion?

What about the third cause (bugs)?

DoS?

Forks

Attacking Finality

Conclusion

eskimor commented Aug 18, 2022 • edited Loading

eskimor commented Aug 18, 2022

eskimor commented Aug 18, 2022

burdges commented Aug 19, 2022 • edited Loading

eskimor commented Aug 19, 2022 • edited Loading

burdges commented Aug 20, 2022 • edited Loading

pepyakin commented Sep 13, 2022

pepyakin commented Sep 13, 2022

eskimor commented Sep 13, 2022

pepyakin commented Sep 13, 2022

eskimor commented Sep 14, 2022 • edited Loading

eskimor commented Jan 11, 2023

eskimor commented Apr 5, 2024

eskimor commented Aug 18, 2022 •

edited

Loading

eskimor commented Aug 18, 2022 •

edited

Loading

burdges commented Aug 19, 2022 •

edited

Loading

eskimor commented Aug 19, 2022 •

edited

Loading

burdges commented Aug 20, 2022 •

edited

Loading

eskimor commented Sep 14, 2022 •

edited

Loading