Append overlay refactor proposal #13940

cheme · 2023-04-18T10:05:23Z

This branch propose to avoid clones in append by storing offset and size in previous overlay depth.
That way on rollback we can just truncate and change size of existing value.
To avoid copy it also means that :

append on new overlay layer if there is an existing value: create a new Append entry with previous offsets, and take memory of previous overlay value.
rollback on append: restore value by applying offsets and put it back in previous overlay value
commit on append: appended value overwrite previous value (is an empty vec as the memory was taken). offsets of commited layer are dropped, if there is offset in previous overlay layer they are maintained.
set value (or remove) when append offsets are present: current appended value is moved back to previous overlay value with offset applied and current empty entry is overwrite (no offsets kept).

The modify mechanism is not needed anymore.
This branch lacks testing and break some existing genericity (bit of duplicated code), but good to have to check direction.

Generally I am not sure if it is worth or we just should favor differents directions (transients blob storage for instance), as the current append mechanism is a bit tricky (having a variable length in first position means we sometime need to insert in front of a vector).

cheme · 2023-04-25T07:28:20Z

Please don't review yet (will change design a bit to be closest to paritytech/polkadot-sdk#30)

- append always set an append item - move data only between two consecutive append - store size and offset at current depth

ggwpez · 2023-04-25T17:42:29Z

Generally I am not sure if it is worth or we just should favor differents directions (transients blob storage for instance), as the current append mechanism is a bit tricky (having a variable length in first position means we sometime need to insert in front of a vector).

So this applies to all appendable storage structus?
Not sure if it is worth it, but could we selectively add this with to a StorageValue with an append_only attribute?
AFAIK we only need this fast truncate on append-only vectors like System::Events, but the advantage is that they are never read.

Otherwise we should probably add a test to check that this solution has a low memory footprint on deeply nested reverts (as expected).

cheme · 2023-04-26T07:37:02Z

So this applies to all appendable storage structus?

yes all storage item containing a value. But as I did comment a bit in the code, using a same key value with sp_io::set and later with sp_io::append is really bad and no sensible runtime should allow it. (if you got a value that is encoded Vec of length 3 so starting with a compact, call to append with a 4 byte encoded vec will change the current byte length to 4 and append the content, then reading will get nothing sensible.

Not sure if it is worth it, but could we selectively add this with to a StorageValue with an append_only attribute?

So yes in practice it could be in a different key space, but that is basically what transient storage does (need to copy final value in some finalize to a standard state value though).
Alternatively (or extending the transient storage) a child trie/state could be exclusive to append value.

AFAIK we only need this fast truncate on append-only vectors like System::Events, but the advantage is that they are never read.

yes, but we cannot ensure it. In my next change, the storage will make this usecase better (with paritytech/polkadot-sdk#30, if only writes followed by a single read (root calculation), then we will not have anymore the costy resize each time the number of element Compact change in size (or a single one)).

cheme · 2023-04-26T14:08:43Z

I think I did align with paritytech/polkadot-sdk#30 , should be reviewable (requires more test, especially since most test access on every transaction change and thus render the data).

cheme · 2023-05-16T16:35:17Z

If this works out, then we maybe dont need #14120

I really think storing big value is a mistake so at some point paged list really make sense.
(ability to produce way smaller proofs at small cost)

ggwpez · 2023-05-16T17:39:55Z

If this works out, then we maybe dont need #14120

I really think storing big value is a mistake so at some point paged list really make sense. (ability to produce way smaller proofs at small cost)

Ah right. So it depends on the use-case. For System::Events, we can just use your MR. But for para-chain cases we maybe rather want to use the paged list.

cheme · 2023-05-16T17:44:04Z

For events we should use transient storage (only write root of an event trie in state and still index data in an external db), but this is probably not here soon, so yes short term this is more pragmatic.

cheme · 2023-05-17T09:22:55Z

I went through a bit of fuzzing and fix a few issue, think this would be good to review now.

paritytech-cicd-pr · 2023-05-17T09:23:25Z

The CI pipeline was cancelled due to failure one of the required jobs.
Job name: test-linux-stable-int
Logs: https://gitlab.parity.io/parity/mirrors/substrate/-/jobs/2850187

…on removal.

cheme added 5 commits April 17, 2023 15:43

append variant with initial sizes.

03beafd

adapt test

50cb263

append twice code path

444f54c

a bit more code path

8e6c421

rename

d77ac22

cheme added the A3-in_progress Pull request is in progress. No review needed at this stage. label Apr 25, 2023

cheme added 3 commits April 25, 2023 15:38

use fields in enum append

431d5f5

simplify logic:

81ec941

- append always set an append item - move data only between two consecutive append - store size and offset at current depth

renamings

4a6c7d5

cheme added 3 commits April 26, 2023 11:30

non read access logic

fb2b703

style

cf6da1a

Mutable read access and render on read.

fd59ad6

cheme requested a review from koute as a code owner April 26, 2023 14:05

cheme added A0-please_review Pull request needs code review. and removed A3-in_progress Pull request is in progress. No review needed at this stage. labels Apr 26, 2023

cheme added 10 commits April 26, 2023 16:47

fix set in same overlay logic

757e475

Fix commit

cc67a05

additional assert

509e544

clippying

1dc8fa7

Merge branch 'master' into append

732e85e

initial fuzzing impl

9282ed8

fuzzer and fixes

f10b873

fix reference impl

0311300

fix depth monitoring

ef4d370

fix bug

4a898ab

cheme requested a review from a team May 16, 2023 15:41

fix similar corner case

68e3ff4

cheme added 3 commits May 16, 2023 20:14

new item to have variance in compact len encoding and break again

4a54331

Decreasing size can actuallly happen

5b48e97

factor

70c896d

cheme added B0-silent Changes should not be mentioned in any release notes C1-low PR touches the given topic and has a low impact on builders. D9-needsaudit 👮 PR contains changes to fund-managing logic that should be properly reviewed and externally audited labels May 17, 2023

no std

a5f0704

kianenigma self-requested a review June 19, 2023 12:27

Merge branch 'master' into append

d5b0db4

cheme requested a review from a team July 13, 2023 07:15

cheme mentioned this pull request Jul 30, 2023

Child trie and state machine refactors #13006

Open

cheme added 3 commits July 31, 2023 11:05

Merge branch 'master' into append

1f58a09

missing one

4b718c0

fmt

c5d084e

cheme requested a review from andresilva as a code owner July 31, 2023 09:08

cheme added 9 commits August 2, 2023 09:12

Merge branch 'master' into append

aa496e6

Merge branch 'master' into append

ec4bbe0

doc

b85347b

slightly enhance append function readability.

c780830

_simple -> _offchain

017d2e9

slight commit readability changes.

ef3abee

Using enum to make logic explicit when reading.

283d79a

from_parent is necessary

f24acbd

Merge branch 'master' into append, make changes for backend transacti…

03e520b

…on removal.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Append overlay refactor proposal #13940

Append overlay refactor proposal #13940

cheme commented Apr 18, 2023

cheme commented Apr 25, 2023

ggwpez commented Apr 25, 2023

cheme commented Apr 26, 2023 •

edited

Loading

cheme commented Apr 26, 2023

cheme commented May 16, 2023

ggwpez commented May 16, 2023

cheme commented May 16, 2023

cheme commented May 17, 2023

paritytech-cicd-pr commented May 17, 2023

Append overlay refactor proposal #13940

Are you sure you want to change the base?

Append overlay refactor proposal #13940

Conversation

cheme commented Apr 18, 2023

cheme commented Apr 25, 2023

ggwpez commented Apr 25, 2023

cheme commented Apr 26, 2023 • edited Loading

cheme commented Apr 26, 2023

cheme commented May 16, 2023

ggwpez commented May 16, 2023

cheme commented May 16, 2023

cheme commented May 17, 2023

paritytech-cicd-pr commented May 17, 2023

cheme commented Apr 26, 2023 •

edited

Loading