core/state/snapshot: less disk reads #6

holiman · 2021-03-30T06:29:22Z

This PR attempts to minimize the number of disk reads. When we have a slice of snapshot values, which do not match the trie data, we currently iterates the disk to retrieve the canonical data.
What this PR does, is commit the (incorrect) snapshot data to a trie, which will be 99% correct. When iterating the trie, we then use the snapshot-trie-database for resolving hashes.
When doing so, we can read 99% of the leaves the from the memory db instead of resolving from disk.

This is still work in progress, needs to be tidied up. It can also be implemented differently. The upside of this particular way to implement it is that most of the modifications are performed in the node iterator, and does not touch the trie database.

* core/state/snapshot: refactor * core/state/snapshot: tiny fix and polish Co-authored-by: rjl493456442 <garyrong0905@gmail.com>

core/state/snapshot: less copy core/state/snapshot: revert split loop core/state/snapshot: handle storage becoming empty, improve test robustness core/state: test modified codehash core/state/snapshot: polish

…eeded

holiman · 2021-03-30T06:30:36Z

Generating without this PR:

Generating with this PR:

^ Note that the two runs are on the same machine at different times, and the latter image shows a much shorter time segment

rjl493456442

Code wise I think it's OK, just some nitpicks

core/state/snapshot/generate.go

rjl493456442 · 2021-03-30T11:07:37Z

core/state/snapshot/generate.go

 	var (
 		trieMore       bool
-		iter           = trie.NewIterator(tr.NodeIterator(origin))
+		nodeIt         = tr.NodeIterator(origin)


I think we can pass the auxiliary database here. iter = trie.NewIterator(tr.NodeIteratorWithDatabase(origin))

rjl493456442 · 2021-03-30T11:08:07Z

core/state/snapshot/generate.go

@@ -428,6 +445,7 @@ func (dl *diskLayer) generateRange(root common.Hash, prefix []byte, kind string,
 		istart   time.Time
 		internal time.Duration
 	)
+	nodeIt.AddResolver(snapTrieDb)


We can pass the db directly when we construct the iterator

holiman · 2021-03-30T12:08:36Z

Code wise I think it's OK, just some nitpicks

Yeah, well, one thing that would be kind of neat, or at least interesting to check, is to

not do a second round of trie:ing after the proof has failed, but rather use the db from the proof-phase. Right now I wasn't sue what trie to use where, so I just created a new one for the proof-of-concept,
and if that works, replace the stacktrie with a regular empty trie, so we can use it if the proof is unsuccessful.

rjl493456442 · 2021-03-31T02:10:24Z

Code wise I think it's OK, just some nitpicks

Yeah, well, one thing that would be kind of neat, or at least interesting to check, is to

not do a second round of trie:ing after the proof has failed, but rather use the db from the proof-phase. Right now I wasn't sue what trie to use where, so I just created a new one for the proof-of-concept,

and if that works, replace the stacktrie with a regular empty trie, so we can use it if the proof is unsuccessful.

Yes sure, I can do it. We need to modify the range prover a bit, let me see how to fix it.

holiman · 2021-04-14T20:51:47Z

Closing this in favour of ethereum#22667

[R4R] modify params for Parlia consensus with 21 validators

rjl493456442 and others added 30 commits March 24, 2021 18:44

eth/protocols: persist received state segments

d1eb592

core: initial implementation

48db911

core/state/snapshot: add tests

1b947e6

core, eth: updates

66b6b2b

eth/protocols/snapshot: count flat state size

6dd40fd

core/state: add metrics

7ab41e9

core/state/snapshot: skip unnecessary deletion

656e92d

core/state/snapshot: rename

6546af7

core/state/snapshot: use the global batch

510c00b

core/state/snapshot: add logs and fix wiping

756f0c8

core/state/snapshot: fix

db37731

core/state/snapshot: save generation progress even if the batch is empty

0a7562d

core/state/snapshot: fixes

961ea1a

core/state/snapshot: fix initial account range length

d92e8b5

core/state/snapshot: fix initial account range

554071f

eth/protocols/snap: store flat states during the healing

2f14a15

eth/protocols/snap: print logs

7e2094a

core/state/snapshot: refactor (#4)

84b15a8

* core/state/snapshot: refactor * core/state/snapshot: tiny fix and polish Co-authored-by: rjl493456442 <garyrong0905@gmail.com>

core, eth: fixes

fe5bfb2

core, eth: fix healing writer

05eb370

core, trie, eth: fix paths

820467d

eth/protocols/snap: fix encoding

8fec107

eth, core: add debug log

90c342b

core/state/generate: release iterator asap (#5)

0362f6a

core/state/snapshot: less copy core/state/snapshot: revert split loop core/state/snapshot: handle storage becoming empty, improve test robustness core/state: test modified codehash core/state/snapshot: polish

core/state/snapshot: optimize stats counter

69301f9

core, eth: add metric

6b2a4d4

core/state/snapshot: update comments

3bf34fa

core/state/snapshot: improve tests

81ed2d2

core/state/snapshot: replace secure trie with standard trie

c184620

core/state/snapshot: wrap return as the struct

c50637a

rjl493456442 and others added 18 commits March 24, 2021 18:47

core/state/snapshot: fix abort

2c26c79

core/state/snapshot: more tests (plus failing testcase)

93472dd

core/state/snapshot: more testcases + fix for failing test

d4d89d2

core/state/snapshot: testcase for malformed data

20e19ec

core/state/snapshot: some test nitpicks

2bd65b6

core/state/snapshot: improvements to logging

f412992

core/state/snapshot: testcase to demo error in abortion

16de86c

core/state/snapshot: fix abortion

764969d

cmd/geth: make verify-state report the root

cae0cf2

trie: fix failing test

0c7cd77

core/state/snapshot: add timer metrics

f549b74

core/state/snapshot: fix metrics

ef79890

core/state/snapshot: udpate tests

23aefe3

eth/protocols/snap: write snapshot account even if code or state is n…

0ca10ea

…eeded

core/state/snapshot: fix diskmore check

b0fd55b

core/state/snapshot: review fixes

56059df

poc: another attempt at reducing the lookups

9d2ec41

squashme: remove some debug output

1ea8978

holiman requested a review from rjl493456442 as a code owner March 30, 2021 06:29

rjl493456442 reviewed Mar 30, 2021

View reviewed changes

core/state/snapshot: some minor fixes

0501b13

rjl493456442 force-pushed the fill-snap-exp branch from 9c7813d to 46fc218 Compare April 9, 2021 08:31

holiman mentioned this pull request Apr 14, 2021

core/state/snapshot: reuse memory data instead of hitting disk when generating ethereum/go-ethereum#22667

Merged

holiman closed this Apr 14, 2021

rjl493456442 pushed a commit that referenced this pull request May 28, 2021

Merge pull request #6 from guagualvcha/stale_depth

ce14f2c

[R4R] modify params for Parlia consensus with 21 validators

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core/state/snapshot: less disk reads #6

core/state/snapshot: less disk reads #6

holiman commented Mar 30, 2021

holiman commented Mar 30, 2021

rjl493456442 left a comment

rjl493456442 Mar 30, 2021

rjl493456442 Mar 30, 2021

holiman commented Mar 30, 2021

rjl493456442 commented Mar 31, 2021

holiman commented Apr 14, 2021

core/state/snapshot: less disk reads #6

core/state/snapshot: less disk reads #6

Conversation

holiman commented Mar 30, 2021

holiman commented Mar 30, 2021

rjl493456442 left a comment

Choose a reason for hiding this comment

rjl493456442 Mar 30, 2021

Choose a reason for hiding this comment

rjl493456442 Mar 30, 2021

Choose a reason for hiding this comment

holiman commented Mar 30, 2021

rjl493456442 commented Mar 31, 2021

holiman commented Apr 14, 2021