Implement FlatStorageState #7663

mzhangmzz · 2022-09-22T04:17:03Z

This PR implements FlatStorageState, the struct that manages deltas in flat storage. After this PR, the main implementation of flat storage is mostly ready. The next step is to add more tests.

…as_between_blocks

jakmeier

Looks good to me! Your choice if you want to wait for Aleksandr to be back and have a look as well or if you want to merge before that. :)

jakmeier · 2022-09-23T07:49:46Z

chain/chain/src/chain.rs

                shard_id,
+                store.head().unwrap().height,


Is there a reason this changed from ? to .unwrap()?

Ah good call, changed back.

jakmeier · 2022-09-23T08:23:35Z

chain/chain/src/chain.rs

                // Right now, we don't implement flat storage for catchup, so we only store
                // the delta if we are not catching up
-                if !is_catching_up {
-                    if let Some(chain_flat_storage) =
-                        self.runtime_adapter.get_flat_storage_state_for_shard(shard_id)
-                    {
-                        let delta = FlatStateDelta::from_state_changes(
-                            &apply_result.trie_changes.state_changes(),
-                        );
-                        let store_update = chain_flat_storage.add_delta(&block_hash, delta)?;
-                        self.chain_store_update.merge(store_update);
-                    }
-                }
+                self.save_flat_state_changes(


Is the comment about catchup still relevant here?

Yes, I'll move it to inside save_flat_state_changes.

core/store/src/flat_state.rs

Longarithm · 2022-09-26T19:20:23Z

core/store/src/flat_state.rs

+        let mut chain = MockChain::linear_chain(10);
+        let store = create_test_store();
+        let mut store_update = store.store_update();
+        store_helper::set_flat_head(&mut store_update, 0, &chain.get_block_hash(0));


Can we check in set_flat_head that flat state head is not set previously, or add a comment that it must be called only once? Though I am not sure how it will work with catchup logic.

Adding comment here because set_flat_head is unchanged in the PR.

I'm not sure I understand what you are referring to. set_flat_head can be called many times, it is called every time when flat_head is updated. Are you worried that the function set_flat_head can be arbitrarily called in the code, not through update_flat_head?

Are you worried that the function set_flat_head can be arbitrarily called in the code, not through update_flat_head?

Yeah, that's right. I'm thinking about making set_flat_head private, or explicitly saying that it must be called only inside update_flat_head or on initialization.

I made it private and added another function set_flat_storage_state_for_genesis

core/store/src/flat_state.rs

robin-near · 2022-09-26T23:40:57Z

core/store/src/flat_state.rs

-    // TODO (#7327): implement garbage collection of old deltas.
-    // TODO (#7327): cache deltas to speed up multiple DB reads.
+    /// Get deltas between blocks `target_block_hash`(inclusive) to flat head(inclusive),
+    /// in backwards chain order. Returns an error if there is no path between these two them.


I think it would be helpful to say what "backwards chain order" means, it's not clear what "chain order" exactly refers to.

robin-near · 2022-09-26T23:44:10Z

core/store/src/flat_state.rs

@@ -392,38 +413,41 @@ struct FlatStorageStateInner {
    /// State deltas for all blocks supported by this flat storage.
    /// All these deltas here are stored on disk too.
    #[allow(unused)]
-    deltas: HashMap<CryptoHash, FlatStateDelta>,
+    deltas: HashMap<CryptoHash, Arc<FlatStateDelta>>,


Should we combine blocks and deltas into a single hash map, so we don't need to look up twice?

robin-near · 2022-09-26T23:48:57Z

core/store/src/flat_state.rs

-    pub fn merge(&mut self, other: Self) {
-        self.0.extend(other.0)
+    pub fn merge(&mut self, other: &Self) {
+        self.0.extend(other.0.iter().map(|(k, v)| (k.clone(), v.clone())))


nit: other.0.iter().cloned() should work?

robin-near · 2022-09-26T23:54:10Z

core/store/src/flat_state.rs

@@ -81,7 +100,7 @@ mod imp {
        /// could charge users for the value length before loading the value.
        // TODO (#7327): support different roots (or block hashes).
        // TODO (#7327): consider inlining small values, so we could use only one db access.
-        pub fn get_ref(&self, key: &[u8]) -> Result<Option<ValueRef>, StorageError> {
+        pub fn get_ref(&self, key: &[u8]) -> Result<Option<ValueRef>, crate::StorageError> {
            // Take deltas ordered from `self.block_hash` to flat state head.
            // In other words, order of deltas is the opposite of the order of blocks in chain.
            let deltas = self.flat_storage_state.get_deltas_between_blocks(&self.block_hash)?;


Will we move this to the (maybe lazy) initialization of FlatState? I don't imagine computing this for every single key would be very efficient.

Yes, but flat_head could change during the lifetime of FlatState, so we can't remove the need for calling get_deltas_between_blocks all together. One optimization we could do is to store the value of flat_head and the deltas path last time when get_deltas_between_blocks is called and only recompute it if flat_head is changed. I'll create a JIRA issue to track this.

I see, thanks. As a general feedback here I find the names a bit overloaded. Maybe some documentation text on how each of these structures are different / related could be helpful. For example, it's almost impossible to tell what the difference is between FlatStorageState and FlatState by just looking at the names (it's still fuzzy to me even after reading the code).

This PR implements FlatStorageState, the struct that manages deltas in flat storage. After this PR, the main implementation of flat storage is mostly ready. The next step is to add more tests.

Min Zhang added 3 commits September 22, 2022 00:16

draft implementation of flatstoragestate

af8a092

fix a compilation error

87a3afe

fix tests

329576a

mzhangmzz marked this pull request as ready for review September 22, 2022 20:57

mzhangmzz requested a review from a team as a code owner September 22, 2022 20:57

mzhangmzz requested review from matklad, Longarithm, jakmeier and akhi3030 and removed request for matklad September 22, 2022 20:57

add comments to test and test flat state as well. Also fixed get_delt…

1c573b9

…as_between_blocks

jakmeier approved these changes Sep 23, 2022

View reviewed changes

Longarithm approved these changes Sep 26, 2022

View reviewed changes

Longarithm reviewed Sep 26, 2022

View reviewed changes

core/store/src/flat_state.rs Show resolved Hide resolved

mzhangmzz and others added 4 commits September 26, 2022 17:05

Merge branch 'master' into flat_storage_state

bb6e503

address comments

c067ff5

Merge branch 'master' into flat_storage_state

e4d8105

make set_flat_head private

fc59577

mzhangmzz added the S-automerge label Sep 26, 2022

fix compilcation

73465f0

near-bulldozer bot merged commit 55a3b4d into master Sep 26, 2022

near-bulldozer bot deleted the flat_storage_state branch September 26, 2022 22:27

robin-near reviewed Sep 26, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement FlatStorageState #7663

Implement FlatStorageState #7663

mzhangmzz commented Sep 22, 2022 •

edited

Loading

jakmeier left a comment

jakmeier Sep 23, 2022

mzhangmzz Sep 26, 2022

jakmeier Sep 23, 2022

mzhangmzz Sep 26, 2022

Longarithm Sep 26, 2022

mzhangmzz Sep 26, 2022

Longarithm Sep 26, 2022

mzhangmzz Sep 26, 2022

robin-near Sep 26, 2022

robin-near Sep 26, 2022

mzhangmzz Sep 27, 2022

robin-near Sep 26, 2022

robin-near Sep 26, 2022

mzhangmzz Sep 27, 2022

robin-near Sep 27, 2022

Implement FlatStorageState #7663

Implement FlatStorageState #7663

Conversation

mzhangmzz commented Sep 22, 2022 • edited Loading

jakmeier left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mzhangmzz commented Sep 22, 2022 •

edited

Loading