blockchain: Convert to direct single-step reorgs. #1500

davecgh · 2018-10-16T19:43:32Z

This requires PR #1471.

This modifies the chain reorganization logic to directly perform the reorg one block at a time with rollback in the case of failure, as opposed to the existing memory-based two-step approach, so that it is more optimized for the typical case, better handles large reorgs, gives the ability to implement better caching strategies, and helps provide a path to decouple the chain processing and connection code from the download logic. It also removes the cached stxos from the view since the aforementioned changes make them no longer necessary.

A side effect of these changes is that it is no longer possible to know if a reorg will succeed before actually performing it, so the NTReorganization notification is now sent after a successful reorg. The notification really should have been sent after the reorg before anyway.

Prior to these changes, chain reorganization used a two-step approach such that the first step involved checking all of the blocks along the reorg path in memory and then actually performing the reorg in a second step if those checks succeeded. While that approach does have some benefits in terms of avoiding any intermediate mutation to the current best chain for failed reorgs, and thus not requiring a rollback in that case, it also has some disadvantages such as not scaling well with large reorgs, being more difficult to make use of different caching strategies, and hindering the ability to decouple the connection code from the download logic.

In a certain sense, the approach this replaces assumed that a reorg would fail and took measures to detect that condition prior to performing the reorg, while the new approach assumes the reorg will succeed and rolls back the changes in the very rare case it doesn't. This is an acceptable and safe assumption because the proof-of-work requirements make it exceedingly expensive to create blocks that are valid enough to trigger a reorg yet ultimately end up failing to connect, thus miners are heavily disincentivized from creating such invalid blocks and attackers are also unable to easily create such blocks either. Even in the case of attack, the only result would be nodes performing slightly more database updates than the existing approach.

The following results show the difference between performing the large reorg full block tests before and after these changes:

before: 4.3GB memory usage, 2m18.629s to complete
after:  2.8GB memory usage, 2m04.056s to complete

As can be seen, the new approach takes much less memory and is also a bit faster as well.

This is work towards #1145.

dajohi

testnet miner tOK

blockchain/chain.go

This modifies the chain reorganization logic to directly perform the reorg one block at a time with rollback in the case of failure, as opposed to the existing memory-based two-step approach, so that it is more optimized for the typical case, better handles large reorgs, gives the ability to implement better caching strategies, and helps provide a path to decouple the chain processing and connection code from the download logic. It also removes the cached stxos from the view since the aforementioned changes make them no longer necessary. A side effect of these changes is that it is no longer possible to know if a reorg will succeed before actually performing it, so the NTReorganization notification is now sent after a successful reorg. The notification really should have been sent after the reorg before anyway. Prior to these changes, chain reorganization used a two-step approach such that the first step involved checking all of the blocks along the reorg path in memory and then actually performing the reorg in a second step if those checks succeeded. While that approach does have some benefits in terms of avoiding any intermediate mutation to the current best chain for failed reorgs, and thus not requiring a rollback in that case, it also has some disadvantages such as not scaling well with large reorgs, being more difficult to make use of different caching strategies, and hindering the ability to decouple the connection code from the download logic. In a certain sense, the approach this replaces assumed that a reorg would fail and took measures to detect that condition prior to performing the reorg, while the new approach assumes the reorg will succeed and rolls back the changes in the very rare case it doesn't. This is an acceptable and safe assumption because the proof-of-work requirements make it exceedingly expensive to create blocks that are valid enough to trigger a reorg yet ultimately end up failing to connect, thus miners are heavily disincentivized from creating such invalid blocks and attackers are also unable to easily create such blocks either. Even in the case of attack, the only result would be nodes performing slightly more database updates than the existing approach. The following results show the difference between performing the large reorg full block tests before and after these changes: before: 4.3GB memory usage, 2m18.629s to complete after: 2.8GB memory usage, 2m04.056s to complete As can be seen, the new approach takes much less memory and is also a bit faster as well.

davecgh added this to the 1.4.0 milestone Oct 16, 2018

davecgh mentioned this pull request Oct 16, 2018

Multi-peer Checklist #1145

Open

33 tasks

davecgh force-pushed the blockchain_single_step_reorg branch from 3cd19f3 to 4371c09 Compare October 16, 2018 19:46

dajohi approved these changes Oct 16, 2018

View reviewed changes

davecgh force-pushed the blockchain_single_step_reorg branch from 4371c09 to 49c997e Compare October 16, 2018 20:56

dnldd reviewed Oct 16, 2018

View reviewed changes

blockchain/chain.go Outdated Show resolved Hide resolved

davecgh force-pushed the blockchain_single_step_reorg branch from 49c997e to aeb7351 Compare October 16, 2018 21:24

dnldd approved these changes Oct 16, 2018

View reviewed changes

davecgh force-pushed the blockchain_single_step_reorg branch 4 times, most recently from de7528d to 0609124 Compare October 16, 2018 22:38

alexlyp approved these changes Nov 6, 2018

View reviewed changes

davecgh force-pushed the blockchain_single_step_reorg branch from 0609124 to 568e9a5 Compare November 9, 2018 23:30

davecgh mentioned this pull request Nov 9, 2018

multi: Migration for utxo set semantics reversal. #1520

Merged

davecgh force-pushed the blockchain_single_step_reorg branch from 568e9a5 to 9c4569a Compare November 12, 2018 22:00

davecgh merged commit 9c4569a into decred:master Nov 12, 2018

davecgh deleted the blockchain_single_step_reorg branch November 12, 2018 22:04

This was referenced Nov 16, 2018

reorg notification now comes after block connect ntfns decred/dcrdata#823

Closed

rpcserver: bump version to 5.0.0 #1531

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

blockchain: Convert to direct single-step reorgs. #1500

blockchain: Convert to direct single-step reorgs. #1500

davecgh commented Oct 16, 2018 •

edited

Loading

dajohi left a comment

blockchain: Convert to direct single-step reorgs. #1500

blockchain: Convert to direct single-step reorgs. #1500

Conversation

davecgh commented Oct 16, 2018 • edited Loading

dajohi left a comment

Choose a reason for hiding this comment

davecgh commented Oct 16, 2018 •

edited

Loading