feat: finish code for background flat storage creation #8053

Longarithm · 2022-11-15T15:08:25Z

Implement two remaining steps for background flat storage creation:

Fetching state

We split the state into several parts and fetch them one by one. Number of parts is based on the state size limit, which I set to 10 MiB. Then, fetching is executed in several steps so that we could save intermediate results. Currently each step includes 20 state parts. They are fetched using threads of a separate rayon pool, which size is limited by 4 threads so that it doesn't affect block processing.

Here I also introduce FetchingStateStatus which defines the current progress. It makes testing more convenient: for lightweight tests it is enough to fetch only one part, alhough on production we need thousands of parts.

After state for a shard is fully fetched, we start catching up - which means that we move flat storage head forward and apply all saved deltas, limiting it by 50 blocks at once.

After we fully caught up, we finally create flat storage state.

Testing

Complete scenario in test_flat_storage_creation.
https://buildkite.com/nearprotocol/nearcore-flat-state/builds/108

jakmeier

Looks solid to me, great to see this progress!

@mzhangmzz I think you should also take a look before we merge, feels like your expertise around the chain and flat state are required here.

@Longarithm Do you plan to add tests to this PR or will it be a follow-up? I would tend towards a separate follow-up PR but it's up to you.

chain/chain/src/flat_storage_creator.rs

core/store/src/columns.rs

chain/chain/src/flat_storage_creator.rs

Co-authored-by: Jakob Meier <mail@jakobmeier.ch>

Longarithm · 2022-11-16T20:50:32Z

Do you plan to add tests to this PR or will it be a follow-up? I would tend towards a separate follow-up PR but it's up to you.

Yeah, they will be in follow-up PRs.

mzhangmzz

The PR looks great! Thanks for the awesome work and sorry for the super late review.

Most of my comments are about adding more comments :) Let's try to make the code easier to read for other people and the future us. I approved it to unblock, but please address the comments.

chain/chain/src/flat_storage_creator.rs

mzhangmzz · 2022-11-18T23:31:51Z

chain/chain/src/flat_storage_creator.rs

+use near_store::Store;
+use near_store::{Trie, TrieDBStorage, TrieTraversalItem};
+use std::sync::atomic::AtomicU64;
+use std::sync::Arc;
 use tracing::debug;
 use tracing::info;



Maybe add a paragraph at the beginning of this file to describe on a high level how flat state is migrated, what the steps are, and how FlatStateCreator is used in the code. I know that you already have comments throughout the code, but I think it is still valuable to have an overview to link everything together and help the readers to have a general picture before they dive into the code.

chain/chain/src/flat_storage_creator.rs

mzhangmzz · 2022-11-18T23:39:08Z

chain/chain/src/flat_storage_creator.rs

+                    merged_delta.merge(delta.as_ref());
+                }
+
+                if old_flat_head != flat_head {


is it possible that old_flat_head == flat_head == final_head?

In reality - no, because final head should move forward at least once during FetchingState step.
And even if it is the case, it is fine to wait on this step until final head moves forward.

chain/chain/src/flat_storage_creator.rs

mzhangmzz · 2022-11-18T23:41:37Z

chain/chain/src/flat_storage_creator.rs

+                        // If we reached chain final head, we can finish catchup and finally create flat storage.
+                        store_helper::finish_catchup(&mut store_update, shard_id);
+                        store_update.commit()?;
+                        debug!(target: "chain", %shard_id, %flat_head, %height, "Creating flat storage");


Is this debug statement intended? Should it print something like flat storage done?

Yeah, it is intended.
I agree that "Flat storage creation done" sounds better. Also changed debug! to info! to places where major part of work is finished so users will be aware of status but there won't be much spam in logs.

chain/chain/src/flat_storage_creator.rs

mzhangmzz · 2022-11-18T23:47:09Z

chain/chain/src/flat_storage_creator.rs

                    let mut store_update = chain_store.store().store_update();
                    store_helper::set_flat_head(&mut store_update, shard_id, &block_hash);
-                    store_helper::set_fetching_state_step(&mut store_update, shard_id, 0u64);
+                    store_helper::set_fetching_state_status(&mut store_update, shard_id, status);


Should we store flat head here with a different prefix, maybe something like set_temporary_flat_head to distinguish it from when flat head is actually set and the flat state is ready to use? This way, there is no way the code can accidentally create a flat storage if flat state is not ready.

We probably should. Let's do it separately because this change is already quite big and write some unit test checking that FS can't be created accidentally.

Co-authored-by: Jakob Meier <mail@jakobmeier.ch>

Longarithm self-assigned this Nov 15, 2022

finish code for flat storage creation

e5a4638

Longarithm force-pushed the fs-spawn branch from 3b1a669 to e5a4638 Compare November 15, 2022 15:18

return logger

ffa9d0e

Longarithm marked this pull request as ready for review November 15, 2022 15:28

Longarithm requested a review from a team as a code owner November 15, 2022 15:28

Longarithm requested review from matklad, jakmeier and mzhangmzz November 15, 2022 15:28

Longarithm changed the title ~~draft: finish code for background flat storage creation~~ feat: finish code for background flat storage creation Nov 15, 2022

matklad removed their request for review November 15, 2022 18:05

Looogarithm and others added 3 commits November 15, 2022 22:33

add final check

4b53093

add comments

a6c5f5b

Merge branch 'master' into fs-spawn

6bfa77c

jakmeier reviewed Nov 16, 2022

View reviewed changes

chain/chain/src/flat_storage_creator.rs Outdated Show resolved Hide resolved

core/store/src/columns.rs Outdated Show resolved Hide resolved

chain/chain/src/flat_storage_creator.rs Show resolved Hide resolved

chain/chain/src/flat_storage_creator.rs Show resolved Hide resolved

jakmeier mentioned this pull request Nov 16, 2022

Flat storage MVP #7327

Closed

26 tasks

Longarithm and others added 3 commits November 17, 2022 00:07

Update chain/chain/src/flat_storage_creator.rs

4c8a637

Co-authored-by: Jakob Meier <mail@jakobmeier.ch>

address comments

258046d

8 threads

615f1d7

mzhangmzz approved these changes Nov 18, 2022

View reviewed changes

Looogarithm added 2 commits November 21, 2022 23:24

apply suggestions

28fd5e5

Merge branch 'master' into fs-spawn

4d181a7

Longarithm merged commit e20a8f7 into master Nov 21, 2022

Longarithm deleted the fs-spawn branch November 21, 2022 19:46

nikurt pushed a commit that referenced this pull request Nov 22, 2022

feat: finish code for background flat storage creation (#8053)

53a2440

Co-authored-by: Jakob Meier <mail@jakobmeier.ch>

This was referenced Jan 18, 2023

DRAFT: parallel flat storage migration v2 #7901

Closed

DRAFT: parallel flat storage migration v3 #7933

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: finish code for background flat storage creation #8053

feat: finish code for background flat storage creation #8053

Longarithm commented Nov 15, 2022 •

edited

Loading

jakmeier left a comment

Longarithm commented Nov 16, 2022

mzhangmzz left a comment

mzhangmzz Nov 18, 2022

mzhangmzz Nov 18, 2022

Longarithm Nov 21, 2022

mzhangmzz Nov 18, 2022

Longarithm Nov 21, 2022

mzhangmzz Nov 18, 2022

Longarithm Nov 21, 2022

feat: finish code for background flat storage creation #8053

feat: finish code for background flat storage creation #8053

Conversation

Longarithm commented Nov 15, 2022 • edited Loading

Fetching state

Testing

jakmeier left a comment

Choose a reason for hiding this comment

Longarithm commented Nov 16, 2022

mzhangmzz left a comment

Choose a reason for hiding this comment

mzhangmzz Nov 18, 2022

Choose a reason for hiding this comment

mzhangmzz Nov 18, 2022

Choose a reason for hiding this comment

Longarithm Nov 21, 2022

Choose a reason for hiding this comment

mzhangmzz Nov 18, 2022

Choose a reason for hiding this comment

Longarithm Nov 21, 2022

Choose a reason for hiding this comment

mzhangmzz Nov 18, 2022

Choose a reason for hiding this comment

Longarithm Nov 21, 2022

Choose a reason for hiding this comment

Longarithm commented Nov 15, 2022 •

edited

Loading