ethdb/pebble: switch to increasing level sizes #30602

karalabe · 2024-10-15T13:54:01Z

Since forever we've been using 2MB database files, ever since the LevelDB era. We've sometimes attempted to change these to something different, but compaction-induced db writes always blew up disk IO tens fold.

A lot of time passed however, and nowadays we're using a completely different data scheme (path vs hash), which has much more localized writes. Out of curiosity I've ran a benchmark changing back the levels to exponentially increasing ones.

Naturally, the obvious effect is the number of files:

master branch with 2MB files across all levels has about 160K data files in the datastore.
pr branch has about 9.2K data files in the datastore.

These have been expected, but the more interesting question is how this affects performance (charts: green == master, yellow == PR)?

Full syncing the chain is ever so slightly faster with the PR. Approximately 12h across 13 days. Not bad, but also nothing special. Could be just differences in the machines, but even if realistic, 12h is always welcome.

CPU usage wise, the leveled database (after some initial data pileup) uses half a CPU core less computation, most probably due to less compaction shuffling. This is a surprise, but a welcome one. Probably not something amazingly relevant, but it's never bad to lower resources.

IO wait is as expected. With larger files, when compaction hits, there's more data to mobilize, so there should be more time waiting for data in general. That said, during a whole full sync we haven't ever hit disk limits, so it seems to be an acceptable compromise.

The stat I was most worried about though, disk writes. Historically this is what blew up beyond acceptable levels. Pleasantly noticing, the PRs disk write hit is about 5% after an entire full sync. That is amazing.

All in all, seems this change has relatively negligible performance implications, but in exchange reduces the number of database files from 160K to 10K. My 2c is that reducing the file count would be very valuable as on OSes where the file system might not handle many files very gracefully, this could be the difference between very fast and unusably slow.

holiman

LGTM

ethdb/pebble: switch to increasing level sizes

9b86a06

karalabe added this to the 1.14.12 milestone Oct 15, 2024

karalabe requested a review from holiman October 15, 2024 13:54

holiman approved these changes Oct 15, 2024

View reviewed changes

karalabe merged commit a449057 into ethereum:master Oct 15, 2024
2 checks passed

holiman pushed a commit that referenced this pull request Nov 19, 2024

ethdb/pebble: switch to increasing level sizes (#30602)

4918c7c

BrewTestBot mentioned this pull request Nov 19, 2024

ethereum 1.14.12 Homebrew/homebrew-core#198233

Merged

magicxyyz mentioned this pull request Nov 19, 2024

[config change] pebble: no-sync mode and increasing level sizes OffchainLabs/nitro#2800

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ethdb/pebble: switch to increasing level sizes #30602

ethdb/pebble: switch to increasing level sizes #30602

karalabe commented Oct 15, 2024

holiman left a comment

ethdb/pebble: switch to increasing level sizes #30602

ethdb/pebble: switch to increasing level sizes #30602

Conversation

karalabe commented Oct 15, 2024

holiman left a comment

Choose a reason for hiding this comment