Transient storage runtime host function #359

gavofyork · 2020-03-25T10:58:19Z

Right now we abuse storage for intra-block data such as block number, parent hash and block author as well as various housekeeping information and flags like whether we set the uncles Authorship::DidSetUncles.

When initially writing, this incurs an extra trie lookup, which is slow. Instead there should be another host API, which works exactly like set_storage/get_storage but has no trie backing, so it never tries to lookup the value in the trie, nor does it write the value at the end of the block.

The text was updated successfully, but these errors were encountered:

NikVolf · 2020-04-03T12:30:57Z

So I've seen potential users of this api:

in authorship (Author: Option<AccountId>, DidSetUncles: bool)
in babe (Initialized: Option<schnorrkel::RawVRFOutput>)
contracts (GasSpent: u64)
finality-tracker (Update: BlockNumber)
timestamp (DidUpdate: bool)

we also need to change decl_storge! and add "transient" option for code generation, since other modules do readings of those values

bkchr · 2020-04-03T12:47:24Z

Everything in system that is removed in on_finalize.

NikVolf · 2020-04-07T12:40:59Z

@bkchr in finalize and initialize (events)

bkchr · 2020-04-07T12:58:06Z

No, events are deleted in the next block and need to stay in storage to be inspectable.

NikVolf · 2020-04-07T13:03:09Z

But isn't it a hack?
If the lifetime of transient storage data is extended by client or whatever is using event data, this might not be required?

bkchr · 2020-04-07T13:37:49Z

I would call it sort of hack. It would need to be stored in the database anyway to make it accessible for the UI. So, any change here probably just requires a ton of changes to make it compatible.

NikVolf · 2020-04-08T06:13:30Z

I wonder if we can avoid calls into the host at all and just keep everything in the runtime memory

bkchr · 2020-04-08T07:44:53Z

No. Memory is resetted between calls.

NikVolf · 2020-04-08T07:48:50Z

But for block execution it does not matter?

bkchr · 2020-04-08T08:12:10Z

But for Block production.

kianenigma · 2020-04-08T18:58:50Z

So I've seen potential users of this api:

in authorship (Author: Option<AccountId>, DidSetUncles: bool)

in babe (Initialized: Option<schnorrkel::RawVRFOutput>)

contracts (GasSpent: u64)

finality-tracker (Update: BlockNumber)

timestamp (DidUpdate: bool)

we also need to change decl_storge! and add "transient" option for code generation, since other modules do readings of those values

transaction_payment::NextWeightMultiplier.

(cc @shawntabrizi this is very similar to an idea that you also had)

shawntabrizi · 2020-04-08T19:50:46Z

My idea was to have some hook in executive/system that reads values from storage (like block number, block author, etc..) and places them in memory and allows all pallets quick access to them without incurring any storage costs.

There should additionally be hooks so that any pallet may introduce some data to be added to this hook, so if someone writes a "system critical pallet" where values from it are used in every block, they can instead take advantage of this in-memory storage.

xlc · 2022-10-04T11:06:57Z

This helps with #278 and also have many other use cases. Can we have this prioritized.

cheme · 2022-10-04T14:19:34Z

In itself transient storage do not look like a lot of work to me (I just did draft it here paritytech/substrate@master...cheme:transient_storage), but it did add some host function and can make a few thing a bit more complex (I remember a discussion about making proof of execution for individual extrinsic where we could call storage_root between extrinsic to allow it: this would not be possible anymore). Also it open some question about limiting memory usage.

bkchr · 2022-10-04T14:22:48Z

this would not be possible anymore

Why?

Also it open some question about limiting memory usage.

In what way?

cheme · 2022-10-04T14:27:17Z

It would be possible but you would need to either attach the transient storage state at start of extrinsic or a root of it, but anyway I don't think it is a use case we want to support (proof of execution of a single extrinsic of a block).

I just fear that the host function would be use to store big blobs, but that was a bit silly (a runtime should not allow doing so).

burdges · 2022-10-04T14:47:26Z

We'll want other storage options with other proving semantics ala KZG, so I'm worried if merely doing a hash map is a problem. It's also simpler if you know some storage type simply goes away, but..

If you want a transparent option, then you could mark state writes as transient in the block, and then declare the block invalid if not removed when the block concludes. In this way, the block producer could add these transient markers automatically.

pepyakin · 2022-10-04T18:06:24Z

This helps with #278 and also have many other use cases. Can we have this prioritized.

Since then, we also had some discussions which may be useful to contextualize this a bit.

The discussion about custom storage tries. I am not able to find the issue to link, but the general idea is to give the Substrate Runtime ability to manipulate data structures other than the vanilla key-value storage available right now. Such a structure would be similar to the existing child-tries (and in fact, CTs were designed to accommodate different types of tries already), but the allowed data patterns or rules will be different. Besides different tree types, one of the use cases was transient data structures. Another would be append-only data structures. Each type of the trie would be manipulated by a specifically designed API.
There is another angle that storage tries could be attacked: Block building within the same wasm memory? substrate#10557. Right now, during block building, we assume that each call/extrinsic has a pristine memory space untouched by prior runtime calls. If we lift that constraint, it will be possible to have a similar construction to transient storage described here, but not equivalent. Still worth keeping this option in mind.

bkchr · 2022-10-04T20:24:40Z

It would be possible but you would need to either attach the transient storage state at start of extrinsic or a root of it, but anyway I don't think it is a use case we want to support (proof of execution of a single extrinsic of a block).

I just fear that the host function would be use to store big blobs, but that was a bit silly (a runtime should not allow doing so).

As these elements are not part of the trie, do we really need to add them to the storage root? I don't think so.

cheme · 2022-10-05T07:05:21Z

It would be possible but you would need to either attach the transient storage state at start of extrinsic or a root of it, but anyway I don't think it is a use case we want to support (proof of execution of a single extrinsic of a block).
I just fear that the host function would be use to store big blobs, but that was a bit silly (a runtime should not allow doing so).

As these elements are not part of the trie, do we really need to add them to the storage root? I don't think so.

As part of this issue of course not, as part of the use case where a modified system runtime would call and store storage_root between each extrinsic (to create proof for single extrinsic), it may indeed not be needed and in case it is (let's say to harden stuff or for legal reason) the transient storage could still be stored to storage before intermediary storage_root calls (as long as the transient storage provide an iterator).

bkchr · 2022-10-05T10:05:52Z

You should only store information in transient storage that are not important for the state, IMO. Aka you would also not need it for any intermediate proof. Data that isn't being able to be accessed later, doesn't need to be part of the storage root.

arkpar · 2022-10-05T10:21:15Z

Related: paritytech/substrate#9170

A relatively simple solution that I suggested there is to add a revert function. The actual trie lookup is happening when the transient values are deleted from storage. If there was a function that deleted them only from the memory overlay, that would be enough to prevent the trie lookup.

bkchr · 2022-11-08T13:11:43Z

Something semi related would be to have some kind of storage item that you can write in a block, but reading it in a block would always return the old value. Currently such a behavior could be achieved by having some intermediate storage item that you "move" in on_finalize to the correct storage item. This could be used in Aura for example for the authorities. Currently when there is a new session we directly overwrite the authorities. This leads to things like FindAuthor returning the author based on the new set (which is clearly wrong).

cheme · 2022-11-08T13:16:02Z

Something semi related would be to have some kind of storage item that you can write in a block, but reading it in a block would always return the old value

I did implement it in the past (as a way to avoid lock in an experimental thread branch), it is pretty straightforward to do (just don't query the change overlay in state-machine).

bkchr · 2022-11-08T13:36:44Z

Yeah, it shouldn't be too hard to implement. Just will require some new host function.

Polkadot-Forum · 2023-06-16T13:30:59Z

This issue has been mentioned on Polkadot Forum. There might be relevant details there:

https://forum.polkadot.network/t/generalized-storage-proofs/1315/5

shawntabrizi · 2023-06-16T15:19:32Z

Just FYI this is one of the features Cosmos SDK already provides to their users:

https://docs.cosmos.network/v0.46/core/store.html#transient-store

burdges · 2023-06-16T17:21:30Z

Anyone know if statics already just work in substrate?

bkchr · 2023-06-20T22:03:59Z

Anyone know if statics already just work in substrate?

No.

JoshOrndorff · 2023-08-11T16:46:49Z

Doesn't the storage overlay optimization prevent things that are stored and also removed before the end of the block from going to the db anyway?

burdges · 2023-08-15T05:59:53Z

A PoV block needs the copath elements in the PoV data to prove there was nothing there already.

It obviously makes no sense to use storage for ephemeral stuff, instead execute_block should be passing some &mut T between the components it calls.

bkchr · 2023-08-16T11:43:59Z

Doesn't the storage overlay optimization prevent things that are stored and also removed before the end of the block from going to the db anyway?

The point is that you should not be required to clear this data at the end and it is thrown away automatically for you.

* relay-ethereum-client * use relay-ethereum-client from ethereum-poa-relay * cargo fmt --all * #![warn(missing_docs)] * EthereumRpcClient -> EthereumClient * make EthereumHeadersSyncPipeline private * return concrete type from crate::new * cleanup dependencies * *self -> self * remove trait Client * sort deps

gavofyork added Z2-medium I9-optimisation An enhancement to provide better overall performance in terms of time-to-completion for a task. labels Mar 25, 2020

NikVolf self-assigned this Mar 25, 2020

NikVolf closed this as completed Apr 3, 2020

NikVolf reopened this Apr 3, 2020

bkchr unassigned NikVolf Mar 17, 2022

shawntabrizi mentioned this issue Aug 24, 2023

[FRAME Core] Remove without_storage_info on pallets #323

Open

40 tasks

bkchr mentioned this issue Oct 4, 2022

In block in memory storage paritytech/substrate#12415

Closed

kianenigma mentioned this issue Oct 5, 2022

Get rid of junk in storage proofs. paritytech/substrate#9170

Open

gavofyork mentioned this issue Oct 28, 2022

Vision: Host functions and database support for non-Merklised-persistent data structures #245

Open

juangirini added the T1-runtime label Jun 8, 2023

juangirini transferred this issue from paritytech/substrate Aug 24, 2023

the-right-joyce added T1-FRAME This PR/Issue is related to core FRAME, the framework. D1-medium Can be fixed by a coder with good Rust knowledge but little knowledge of the codebase. and removed T1-runtime labels Aug 25, 2023

JoshOrndorff mentioned this issue Sep 29, 2023

Accumulators for intra-block book-keeping Off-Narrative-Labs/Tuxedo#105

Closed

This was referenced Jun 5, 2024

Update polkadot-sdk from v1.7.0 to v1.11.0 moondance-labs/tanssi#573

Closed

Update polkadot-sdk from v1.10.0 to v1.11.0 moondance-labs/tanssi#577

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transient storage runtime host function #359

Transient storage runtime host function #359

gavofyork commented Mar 25, 2020

NikVolf commented Apr 3, 2020 •

edited

Loading

bkchr commented Apr 3, 2020

NikVolf commented Apr 7, 2020

bkchr commented Apr 7, 2020

NikVolf commented Apr 7, 2020

bkchr commented Apr 7, 2020

NikVolf commented Apr 8, 2020

bkchr commented Apr 8, 2020

NikVolf commented Apr 8, 2020

bkchr commented Apr 8, 2020

kianenigma commented Apr 8, 2020

shawntabrizi commented Apr 8, 2020

xlc commented Oct 4, 2022 •

edited

Loading

cheme commented Oct 4, 2022

bkchr commented Oct 4, 2022

cheme commented Oct 4, 2022

burdges commented Oct 4, 2022

pepyakin commented Oct 4, 2022

bkchr commented Oct 4, 2022 •

edited

Loading

cheme commented Oct 5, 2022

bkchr commented Oct 5, 2022

arkpar commented Oct 5, 2022 •

edited

Loading

bkchr commented Nov 8, 2022

cheme commented Nov 8, 2022 •

edited

Loading

bkchr commented Nov 8, 2022

Polkadot-Forum commented Jun 16, 2023

shawntabrizi commented Jun 16, 2023 •

edited

Loading

burdges commented Jun 16, 2023

bkchr commented Jun 20, 2023

JoshOrndorff commented Aug 11, 2023

burdges commented Aug 15, 2023

bkchr commented Aug 16, 2023

Transient storage runtime host function #359

Transient storage runtime host function #359

Comments

gavofyork commented Mar 25, 2020

NikVolf commented Apr 3, 2020 • edited Loading

bkchr commented Apr 3, 2020

NikVolf commented Apr 7, 2020

bkchr commented Apr 7, 2020

NikVolf commented Apr 7, 2020

bkchr commented Apr 7, 2020

NikVolf commented Apr 8, 2020

bkchr commented Apr 8, 2020

NikVolf commented Apr 8, 2020

bkchr commented Apr 8, 2020

kianenigma commented Apr 8, 2020

shawntabrizi commented Apr 8, 2020

xlc commented Oct 4, 2022 • edited Loading

cheme commented Oct 4, 2022

bkchr commented Oct 4, 2022

cheme commented Oct 4, 2022

burdges commented Oct 4, 2022

pepyakin commented Oct 4, 2022

bkchr commented Oct 4, 2022 • edited Loading

cheme commented Oct 5, 2022

bkchr commented Oct 5, 2022

arkpar commented Oct 5, 2022 • edited Loading

bkchr commented Nov 8, 2022

cheme commented Nov 8, 2022 • edited Loading

bkchr commented Nov 8, 2022

Polkadot-Forum commented Jun 16, 2023

shawntabrizi commented Jun 16, 2023 • edited Loading

burdges commented Jun 16, 2023

bkchr commented Jun 20, 2023

JoshOrndorff commented Aug 11, 2023

burdges commented Aug 15, 2023

bkchr commented Aug 16, 2023

NikVolf commented Apr 3, 2020 •

edited

Loading

xlc commented Oct 4, 2022 •

edited

Loading

bkchr commented Oct 4, 2022 •

edited

Loading

arkpar commented Oct 5, 2022 •

edited

Loading

cheme commented Nov 8, 2022 •

edited

Loading

shawntabrizi commented Jun 16, 2023 •

edited

Loading