Update badger sync settings to optimize memory usage during hypersync #636

superzordon · 2023-08-11T17:43:41Z

Update hypersync to use default badger settings, switch to default settings once hypersync completes

…ttings once hypersync completes

diamondhands0 · 2023-08-13T21:09:50Z

lib/server.go

+func (srv *Server) updateDbOpts(opts badger.Options) {
+	// Make sure that a mempool process doesn't try to access the DB while we're closing and re-opening it.
+	srv.mempool.mtx.RLock()
+	defer srv.mempool.mtx.RUnlock()


You may actually want Lock() here not RLock(). RLock() is non-exclusive meaning that other threads could be reading the mempool db (but not writing to it). If you do Lock() it kicks out readers in addition to writers.

diamondhands0 · 2023-08-13T21:13:09Z

lib/server.go

+	defer srv.mempool.mtx.RUnlock()
+	// Make sure that a server process doesn't try to access the DB while we're closing and re-opening it.
+	srv.DbMutex.Lock()
+	defer srv.DbMutex.Unlock()


Hmm.. I'm a bit confused. I think DbMutex is only used here, which means that if anything else has a handle on the db then it won't exclude it from reading from that handle. So does this really do anything? It seems like the only thing it prevents is calling updateDbOpts in multiple threads, but nothing else.

This is utilized in backend in a few recurring jobs, such as the hot feed routine.

Hmm I see it's used in backend.

diamondhands0 · 2023-08-13T21:13:52Z

lib/server.go

+	srv.snapshot.mainDb = srv.blockchain.db
+	srv.mempool.bc.db = srv.blockchain.db
+	srv.mempool.backupUniversalUtxoView.Handle = srv.blockchain.db
+	srv.mempool.universalUtxoView.Handle = srv.blockchain.db


This is dirty but I'm trying to think about how we would clean it up. We could have a wrapper on the DB that this thing can update... but that's dirty too. Ideally you could change the db without having to update all of these handles manually... I'm noodling.

Agreed, not exactly a pretty solution, but the only other option I could think of was the wrapper struct, which you've mentioned.

diamondhands0 · 2023-08-13T21:17:39Z

lib/server.go

+	// DbMutex protects the badger database from concurrent access when it's being closed & re-opened.
+	// This is necessary because the database is closed & re-opened when the node finishes hypersyncing in order
+	// to change the database options from Default options to Performance options.
+	DbMutex deadlock.Mutex


So we're doing a couple things here:

We reduce the hypersync queue size. Seems like a strict win.

We swap the db options on badger after we finish syncing. This one creates some messiness.

Is (2) a significant improvement to memory usage? Like if we merge (1) but not (2) does it go above 32gb?

2 does win us a substantial amount of memory usage. The intense I/O requirements of hypersync married with our very high performance settings results in a massive amount of memory usage. In order to be reliably under 32GBs of usage, we do need to be using the default settings during HS.

An alternate approach here would be to fix the way we store utxo ops in the backend db. My hunch is that if we were to fix that index to only store a single utxo op per record, we would be able to run default settings in block sync, meaning that we could avoid the ugliness associated with stopping + restarting badger in order to switch the settings. This would involve moving the block + transaction to the key of that index, rather than having a single record that contains a bundle with every op for every transaction in a block.

diamondhands0

I didn't run it locally, but the overall change looks good with the commentary that I added.

Update hypersync to use default badger settings, switch to default se…

41f1e59

…ttings once hypersync completes

superzordon requested a review from a team as a code owner August 11, 2023 17:43

superzordon added 2 commits August 11, 2023 13:54

Fix tests

ae90440

Cleanup comments

c47ae18

diamondhands0 reviewed Aug 13, 2023

View reviewed changes

Add more comments around the switching of Badger options

bf2bcff

diamondhands0 approved these changes Sep 4, 2023

View reviewed changes

superzordon merged commit 8176149 into main Sep 8, 2023
2 checks passed

superzordon deleted the z/badger-settings-based-on-sync-type branch September 8, 2023 22:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update badger sync settings to optimize memory usage during hypersync #636

Update badger sync settings to optimize memory usage during hypersync #636

superzordon commented Aug 11, 2023

diamondhands0 Aug 13, 2023

diamondhands0 Aug 13, 2023

superzordon Aug 13, 2023

diamondhands0 Aug 13, 2023

diamondhands0 Aug 13, 2023

superzordon Aug 13, 2023

diamondhands0 Aug 13, 2023

superzordon Aug 13, 2023 •

edited

Loading

diamondhands0 left a comment

Update badger sync settings to optimize memory usage during hypersync #636

Update badger sync settings to optimize memory usage during hypersync #636

Conversation

superzordon commented Aug 11, 2023

diamondhands0 Aug 13, 2023

Choose a reason for hiding this comment

diamondhands0 Aug 13, 2023

Choose a reason for hiding this comment

superzordon Aug 13, 2023

Choose a reason for hiding this comment

diamondhands0 Aug 13, 2023

Choose a reason for hiding this comment

diamondhands0 Aug 13, 2023

Choose a reason for hiding this comment

superzordon Aug 13, 2023

Choose a reason for hiding this comment

diamondhands0 Aug 13, 2023

Choose a reason for hiding this comment

superzordon Aug 13, 2023 • edited Loading

Choose a reason for hiding this comment

diamondhands0 left a comment

Choose a reason for hiding this comment

superzordon Aug 13, 2023 •

edited

Loading