[Ledger] Replace LRU cache with a FIFO queue (circular buffer) #2893

ramtinms · 2022-07-29T16:10:14Z

This PR

replaces the LRU cache used by the forest with a FIFO queue implemented with a circular buffer.
implements an adjusted copy of the great buffer @fxamacker has written for the checkpointer. (added lookup and mutex functionality)
updates the ledger to still be able to read trie removal entries from WAL but don't do anything about them and don't capture these types of entries going forward (backward compatibility for old version WALs).

But, Why?
The initial decision to use an LRU cache had roots in another type of design considered for the ledger. ideally, the ledger should only care about purging the oldest trie added to the buffer and get trie calls having any impact on which trie is going to be evicted. This would also make the checkpointer and forest to be in harmony with how they deal with trie updates.

Why LRU cache was a bad idea? in the LRU version scripts would change the order of tries kept in the cache, and this could result in issues loosing the head of the trie we need to build blocks. in other words, one might send a lot of scripts targeting older tries with the hope to purge a necessary trie in memory and halt the execution.

fxamacker

Great work and amazing how insanely fast you created this PR, right after talking about it!

Some suggestions:

Replace sync.Mutex with sync.RWMutex in buffer.
Add forest.PurgeCacheExcept() test in forest_test.go.
Replace references of LRU cache to FIFO queue in docs in ledger.go and forest.go.

First item is in the code comments but 2 and 3 are just here because it is unmodified by this PR.

Also, maybe rename Buffer to be more specific if you'd like. I don't have a better name for it 😆

ledger/complete/mtrie/forest.go

ledger/complete/mtrie/buffer.go

ledger/complete/mtrie/trieCache.go

SaveTheRbtz · 2022-07-29T20:57:57Z

ledger/complete/mtrie/trieCache.go

+
+	tries := make([]*trie.MTrie, tc.count)
+
+	if tc.isFull() {


just a random though: if we used a double-linked list, would the logic here be a bit simpler?

array is much cheaper than link list

agree! that said, given that we use map for O(1) access it should not affect perf too much (esp. given how infrequently we should update the forest.)

there is going to be a follow up PR for improvements, going to consider your suggestions in the PR.

ledger/complete/mtrie/trieCache.go

m4ksio · 2022-07-29T21:21:27Z

ledger/complete/checkpoint_benchmark_test.go

@@ -145,7 +145,6 @@ func BenchmarkLoadCheckpointAndWALs(b *testing.B) {
 			return err
 		},
 		func(rootHash ledger.RootHash) error {
-			forest.RemoveTrie(rootHash)


With removal of removal, maybe we can get out of it completely, remove WALDelete (and WALUpdate since now we have only one operation, and all related code

I think we need it for reading old WALs when we spork, we could get rid of it after next spork.

ledger/complete/mtrie/forest_test.go

ledger/complete/mtrie/trieCache.go

ledger/complete/mtrie/trieCache_test.go

ramtinms · 2022-07-29T23:13:37Z

ledger/complete/ledger_test.go

-		// test deletion
-		s := led2.ForestSize()
-		assert.Equal(t, s, size)
-


We don't do removals any more so this part is not relevant anymore

taabodim · 2022-07-31T00:50:29Z

ledger/complete/mtrie/trieCache.go

+	defer tc.lock.RUnlock()
+
+	if tc.count == 0 {
+		return nil


suggestion, if you return an empty slice, it would make the job of caller easier. they won't have to check for nil anymore

hey @taabodim we are going to have a follow up PR for improvements, going to include your suggestions in that PR.

ramtinms added 2 commits July 29, 2022 08:59

replace lru cache with fifo buffer

955c1f5

no need to capture trie remove wals

eac058c

ramtinms requested review from m4ksio and AlexHentschel as code owners July 29, 2022 16:10

ramtinms requested a review from fxamacker July 29, 2022 16:10

ramtinms changed the title ~~[Ledger] Replace LRU cache with a FIFO circular buffer~~ [Ledger] Replace LRU cache with a FIFO queue (circular buffer) Jul 29, 2022

fxamacker reviewed Jul 29, 2022

View reviewed changes

ramtinms added 4 commits July 29, 2022 13:15

Applying PR's comment

0f18591

rename files

9079607

doc update

46c81ba

add test for PurgeCacheExcept

8fb31d4

ramtinms requested a review from fxamacker July 29, 2022 20:29

SaveTheRbtz reviewed Jul 29, 2022

View reviewed changes

ledger/complete/mtrie/trieCache.go Show resolved Hide resolved

SaveTheRbtz reviewed Jul 29, 2022

View reviewed changes

SaveTheRbtz requested a review from pattyshack July 29, 2022 20:58

pattyshack reviewed Jul 29, 2022

View reviewed changes

ledger/complete/mtrie/trieCache.go Outdated Show resolved Hide resolved

ledger/complete/mtrie/trieCache.go Outdated Show resolved Hide resolved

m4ksio approved these changes Jul 29, 2022

View reviewed changes

apply PR's comments

c36ddc3

fxamacker approved these changes Jul 29, 2022

View reviewed changes

ramtinms added 4 commits July 29, 2022 15:05

add more tests for purge

c1dc91f

add a simple test for concurrent access

02fc84c

code optimization

a9efa31

remove unrelavant tests

77f4e55

ramtinms commented Jul 29, 2022

View reviewed changes

fxamacker mentioned this pull request Jul 30, 2022

[EN Performance] Reuse ledger state for about -200GB peak RAM, -160GB disk i/o, and about -32 minutes duration #2792

Merged

14 tasks

taabodim reviewed Jul 31, 2022

View reviewed changes

ramtinms merged commit ec132df into fxamacker/reuse-mtrie-state-for-checkpointing-2 Aug 2, 2022

ramtinms deleted the ramtin/replace-ledger-forest-lru-cache branch August 2, 2022 16:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Ledger] Replace LRU cache with a FIFO queue (circular buffer) #2893

[Ledger] Replace LRU cache with a FIFO queue (circular buffer) #2893

ramtinms commented Jul 29, 2022 •

edited

Loading

fxamacker left a comment

SaveTheRbtz Jul 29, 2022

pattyshack Jul 29, 2022 •

edited

Loading

SaveTheRbtz Jul 29, 2022

ramtinms Aug 2, 2022

m4ksio Jul 29, 2022

ramtinms Jul 29, 2022

ramtinms Jul 29, 2022

taabodim Jul 31, 2022

ramtinms Aug 2, 2022

[Ledger] Replace LRU cache with a FIFO queue (circular buffer) #2893

[Ledger] Replace LRU cache with a FIFO queue (circular buffer) #2893

Conversation

ramtinms commented Jul 29, 2022 • edited Loading

fxamacker left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pattyshack Jul 29, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ramtinms commented Jul 29, 2022 •

edited

Loading

pattyshack Jul 29, 2022 •

edited

Loading