try-runtime-cli: Lazy Download #13562

kianenigma · 2023-03-08T16:21:21Z

liamaharon · 2023-04-21T10:41:34Z

related: chopsticks storage layer https://github.com/AcalaNetwork/chopsticks/blob/master/packages/chopsticks/src/blockchain/storage-layer.ts

ggwpez · 2023-04-21T11:42:11Z

I tried out this subway to see if it could handle the batch request that you are currently fixing in one MR.
But it did not work. However, it is good for caching repeated queries.

liamaharon · 2023-04-21T12:42:17Z

I tried out this subway to see if it could handle the batch request that you are currently fixing in one MR. But it did not work. However, it is good for caching repeated queries.

This could be useful - but worth keeping in mind that (unless using a remote node) the biggest bottleneck is merkelizing the keys when loading them in TextExternalities, rather than fetching them from the node

ggwpez · 2023-04-21T12:51:40Z

Yes there could also be a much more optimized way for this; like having an export-state RPC or something that encodes and sends the whole Trie. But yea, the lazy approach should also help a lot.

kianenigma · 2023-04-21T13:54:31Z

Also, I think Centrifuge's fudge might have the right inspirational piece of code that lets you have Ext be backed by a database path: https://github.com/centrifuge/fudge

xlc · 2023-04-21T21:14:33Z

I tried out this subway to see if it could handle the batch request that you are currently fixing in one MR.
But it did not work. However, it is good for caching repeated queries.

batch request is not yet implemented but it is trivial to support it

bkchr · 2023-04-21T22:22:36Z

something that encodes and sends the whole Trie

This is state sync. This downloads the entire trie at a given block. It is used by warp sync.

Also, I think Centrifuge's fudge might have the right inspirational piece of code that lets you have Ext be backed by a database path: https://github.com/centrifuge/fudge

What you mean by backed by a database path? Ext is in "normal mode" always operating on a db, but I assume you want a db to cache your queries?

liamaharon · 2023-04-22T06:12:30Z

This is state sync. This downloads the entire trie at a given block. It is used by warp sync.

Thanks @bkchr this sounds very worth looking into. Perhaps not for lazy download but for saving entire state to disk that can quickly be loaded up again on subsequent runs (existing snapshots created by 'create-snapshot' still take 300+ seconds on my M2 Max Macbook Pro to load, bc building the tri is so expensive).

I've been meaning to create an issue to improve the performance of 'snapshot' mode, will mention this as an option to consider.

xlc · 2023-04-22T06:17:35Z

why does it need to build trie? there is (almost) no such logic in chopsticks

liamaharon · 2023-04-24T07:06:03Z

why does it need to build trie? there is (almost) no such logic in chopsticks

This is a good question. The trie may even be disadvantageous it'll slow things down a lot and it's unlikely that there'll be many keys needed cached in-memory at any one time.

I'll start hacking on a simple storage layer that accesses storage from a node and stashes results in a flat in-memory LRU cache. No trie, just flat key-value.

If this works I'll try different workloads on it and see how it performs both in terms of memory and speed.

bkchr · 2023-04-24T20:42:01Z

I'll start hacking on a simple storage layer that accesses storage from a node and stashes results in a flat in-memory LRU cache. No trie, just flat key-value.

You can just use the OverlayedChanges?

arkpar · 2023-04-24T21:30:52Z

(existing snapshots created by 'create-snapshot' still take 300+ seconds

If I'm reading the code correctly, this could be because key-values are inserted in the trie one at a time here:

substrate/primitives/state-machine/src/testing.rs

Line 137 in 2b91202

self.backend.insert(vec![(None, vec![(k, Some(v))])], self.state_version);

If you pass multiple/all values at once, it should be much faster because the trie root would not need to be recalculated after each insertion.

liamaharon · 2023-04-25T04:45:00Z

(existing snapshots created by 'create-snapshot' still take 300+ seconds

If I'm reading the code correctly, this could be because key-values are inserted in the trie one at a time here:

substrate/primitives/state-machine/src/testing.rs

Line 137 in 2b91202

self.backend.insert(vec![(None, vec![(k, Some(v))])], self.state_version);

If you pass multiple/all values at once, it should be much faster because the trie root would not need to be recalculated after each insertion.

I assumed they were being inserted one at a time for a reason. I'll try batch inserting, if it works it'll speed things up dramatically, thanks for the advice.

edit: this had a big impact, PR with before/after performance benchmarks: #14004

kianenigma · 2023-04-27T08:56:08Z

why does it need to build trie?

Ideally, we want the try-runtime-enabled runtime to be able to operate in a normal mode. In that case, one requirement would be to be able to respond to storage::root host function correctly, ergo using a trie.

If we remove this assumption, a whole lot of things will be easier, and as @liamaharon pointed in a comment, a flat key -> value map would suffice. We could achieve this by implementing trait Externalities for a new type that works mostly the same, is backed by HashMap, and panics if you ask it to calculate the state root. As @bkchr said, this might very well be just OverlayedChanges.

Ext is in "normal mode" always operating on a db, but I assume you want a db to cache your queries?

Not just for caching, but also as the first point of response. The scenario I have in mind is that you run try-runtime-cli with some sub-command, and instead of your state being an RPC endpoint or a snapshot file, you give the path to an existing (RocksDB/ParityDB) polkadot/substrate database, and it would initialize an Ext based on that, and execute the commands there.

bkchr · 2023-04-27T09:51:26Z

and panics if you ask it to calculate the state root

That isn't required, you can just take all the key/value pairs and calculate the storage root :P No problem.

Not just for caching, but also as the first point of response. The scenario I have in mind is that you run try-runtime-cli with some sub-command, and instead of your state being an RPC endpoint or a snapshot file, you give the path to an existing (RocksDB/ParityDB) polkadot/substrate database, and it would initialize an Ext based on that, and execute the commands there.

This is basically an enhanced OverlayedChanges. OverlayedChanges is already caching writes and then only goes to the state if the key is not yet written. Basically it should be enough for you to keep the OverlayedChanges changes around. This also already supports calculating a storage root etc.

gpestana · 2023-04-30T18:28:35Z

instead of your state being an RPC endpoint or a snapshot file, you give the path to an existing (RocksDB/ParityDB) polkadot/substrate database, and it would initialize an Ext based on that, and execute the commands there.

fwiw, ➕ for this approach. In the case of data gathering for analysis and block replays with different state for testing purposes this would be very useful.

liamaharon · 2023-05-02T09:39:38Z

instead of your state being an RPC endpoint or a snapshot file, you give the path to an existing (RocksDB/ParityDB) polkadot/substrate database, and it would initialize an Ext based on that, and execute the commands there.

fwiw, ➕ for this approach. In the case of data gathering for analysis and block replays with different state for testing purposes this would be very useful.

@gpestana are there cases where reading from a DB path would be helpful in ways that reading from a snapshot would not? Snapshots can be created from arbitrary block numbers and (shortly) will be able to be loaded almost instantly.

Update on this: I've had success writing directly to the ext OverlayedChanges (rather than backend) which allows for close to instant snapshot loading. WIP PR: #14057

liamaharon · 2023-05-04T08:23:45Z

FYI, I have created a dedicated issue to track the progress of allowing db-backed externalities to be used with the try-runtime-cli. Let's maintain the focus of this issue on lazy-download.

Upon introducing a DB path option, I think lazy-download is likely to be a preferred mode only when using a remote node and the state cannot fit in memory.

Node Location	State Size	Preferred Mode
local	reasonable	snapshot
remote	reasonable	snapshot
local	large	db-backed
remote	large	lazy-download

Although we should definitely eventually implement lazy-download, I wanted to note that it may not be as high priority as some other try-runtime-cli tasks, because most bases will be covered and I'm uncertain how prevalent use case of using it with a large remote node will be.

crystalin · 2023-07-17T14:10:00Z

Hi, I'm curious if there is currently some progress on it

ggwpez · 2023-07-17T14:25:23Z

I think its still a high value issue - just currently resource starved. We instead prioritized the versioned runtime upgrades to reduce the number of error sources when migrating.
Hopefully once that is done maybe @liamaharon (OOO) can pick it up or distribute to someone.

liamaharon · 2023-07-19T04:40:34Z

Yeah I've made decent progress with this and have a working PoC, I am planing to pick it up again once the higher prio runtime upgrade stuff is shipped.

@crystalin fyi I have actually run the PoC with moonbeam state and runtime testing a runtime migration and it executed in just a few seconds :)

crystalin · 2023-07-19T06:55:08Z

That's awesome.
I'll definitely help testing it

liamaharon · 2023-08-02T14:55:52Z

Hey @crystalin, we've decided to deprioritise this for now to focus on some higher priority features.

I have a PoC branch here you're welcome to pick up if you need the feature.

liamaharon · 2023-08-14T06:33:27Z

Moved: paritytech/try-runtime-cli#8

kianenigma added the Z4-involved Can be fixed by an expert coder with good knowledge of the codebase. label Mar 8, 2023

github-project-automation bot moved this to 📕 Backlog in (Nominated) Proof of Stake Mar 8, 2023

kianenigma added this to (Nominated) Proof of Stake Mar 8, 2023

ggwpez mentioned this issue Mar 23, 2023

try-runtime can panic when RPC requests fail #13695

Closed

liamaharon added this to Runtime / FRAME Apr 12, 2023

liamaharon removed this from (Nominated) Proof of Stake Apr 12, 2023

github-project-automation bot moved this to Backlog in Runtime / FRAME Apr 12, 2023

liamaharon moved this from Backlog to To Do in Runtime / FRAME Apr 12, 2023

liamaharon moved this from To Do to Backlog in Runtime / FRAME Apr 12, 2023

liamaharon self-assigned this Apr 21, 2023

liamaharon moved this from Backlog to To Do in Runtime / FRAME Apr 21, 2023

liamaharon added I9-optimisation An enhancement to provide better overall performance in terms of time-to-completion for a task. U2-some_time_soon Issue is worth doing soon. labels Apr 21, 2023

liamaharon moved this from To Do to In progress in Runtime / FRAME Apr 24, 2023

liamaharon mentioned this issue Apr 25, 2023

remote-externalities: batch insert key/values #14004

Merged

liamaharon mentioned this issue May 4, 2023

try-runtime-cli: disk-backed remote-externalities #14076

Closed

liamaharon changed the title ~~try-runtime: Lazy Download~~ try-runtime-cli: Lazy Download May 4, 2023

juangirini removed the U2-some_time_soon Issue is worth doing soon. label May 23, 2023

juangirini added the T1-runtime This PR/Issue is related to the topic “runtime”. label Jun 7, 2023

juangirini moved this from In Progress to To Do in Runtime / FRAME Jun 9, 2023

liamaharon moved this from To Do to In Progress in Runtime / FRAME Jul 19, 2023

liamaharon moved this from In Progress to Backlog in Runtime / FRAME Aug 2, 2023

liamaharon removed their assignment Aug 2, 2023

liamaharon mentioned this issue Aug 14, 2023

Lazy download externalities paritytech/try-runtime-cli#8

Open

liamaharon closed this as completed Aug 14, 2023

github-project-automation bot moved this from Backlog to Done in Runtime / FRAME Aug 14, 2023

liamaharon mentioned this issue Aug 14, 2023

Disk-backed externalities paritytech/try-runtime-cli#15

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

try-runtime-cli: Lazy Download #13562

try-runtime-cli: Lazy Download #13562

kianenigma commented Mar 8, 2023

liamaharon commented Apr 21, 2023

ggwpez commented Apr 21, 2023

liamaharon commented Apr 21, 2023

ggwpez commented Apr 21, 2023

kianenigma commented Apr 21, 2023

xlc commented Apr 21, 2023

bkchr commented Apr 21, 2023

liamaharon commented Apr 22, 2023

xlc commented Apr 22, 2023

liamaharon commented Apr 24, 2023 •

edited

Loading

bkchr commented Apr 24, 2023

arkpar commented Apr 24, 2023

liamaharon commented Apr 25, 2023 •

edited

Loading

kianenigma commented Apr 27, 2023

bkchr commented Apr 27, 2023

gpestana commented Apr 30, 2023

liamaharon commented May 2, 2023

liamaharon commented May 4, 2023

crystalin commented Jul 17, 2023

ggwpez commented Jul 17, 2023

liamaharon commented Jul 19, 2023 •

edited

Loading

crystalin commented Jul 19, 2023

liamaharon commented Aug 2, 2023

liamaharon commented Aug 14, 2023

try-runtime-cli: Lazy Download #13562

try-runtime-cli: Lazy Download #13562

Comments

kianenigma commented Mar 8, 2023

liamaharon commented Apr 21, 2023

ggwpez commented Apr 21, 2023

liamaharon commented Apr 21, 2023

ggwpez commented Apr 21, 2023

kianenigma commented Apr 21, 2023

xlc commented Apr 21, 2023

bkchr commented Apr 21, 2023

liamaharon commented Apr 22, 2023

xlc commented Apr 22, 2023

liamaharon commented Apr 24, 2023 • edited Loading

bkchr commented Apr 24, 2023

arkpar commented Apr 24, 2023

liamaharon commented Apr 25, 2023 • edited Loading

kianenigma commented Apr 27, 2023

bkchr commented Apr 27, 2023

gpestana commented Apr 30, 2023

liamaharon commented May 2, 2023

liamaharon commented May 4, 2023

crystalin commented Jul 17, 2023

ggwpez commented Jul 17, 2023

liamaharon commented Jul 19, 2023 • edited Loading

crystalin commented Jul 19, 2023

liamaharon commented Aug 2, 2023

liamaharon commented Aug 14, 2023

liamaharon commented Apr 24, 2023 •

edited

Loading

liamaharon commented Apr 25, 2023 •

edited

Loading

liamaharon commented Jul 19, 2023 •

edited

Loading