quantification of compaction algorithms #7770

problame · 2024-05-15T15:06:51Z

Child of 2024Q2 compaction work: #8001

This epic tracks the efforts to quantify any compaction algorithm's outcomes.

We had a brainstorming session some time back to come up with an (incomplete) set of potentially useful metrics: https://www.notion.so/neondatabase/Productionize-Tiered-Compaction-eca9b06aa1ae4c62bdf6cf40ab002eb6?pvs=4

Meeting notes / ideas:

point-in-time efficiency: Use layer map dump / keyspace dump (not actually measure it by access)
total space efficiency:
- (logical size + wal in PITR window) = synthetic size, can just use that
- sum(all layer files in index_part.json) => just that

Demo test case to adapt / apply the Python helpers to:

test_gc_feedback

Refs

Give feedback

test(pageserver): quantify compaction outcome #7867

run-benchmarks
develop automated workflow to back-test & perf-evaluate with production data
Options

The text was updated successfully, but these errors were encountered:

problame · 2024-05-28T17:48:14Z

This week:

extend / follow-up on the pagectl PR test(pageserver): quantify compaction outcome #7867 so that it runs the estimations against test_gc_feedback

A simple API to collect some statistics after compaction to easily understand the result. The tool reads the layer map, and analyze range by range instead of doing single-key operations, which is more efficient than doing a benchmark to collect the result. It currently computes two key metrics: * Latest data access efficiency, which finds how many delta layers / image layers the system needs to iterate before returning any key in a key range. * (Approximate) PiTR efficiency, as in #7770, which is simply the number of delta files in the range. The reason behind that is, assume no image layer is created, PiTR efficiency is simply the cost of collect records from the delta layers, and the replay time. Number of delta files (or in the future, estimated size of reads) is a simple yet efficient way of estimating how much effort the page server needs to reconstruct a page. Signed-off-by: Alex Chi Z <chi@neon.tech>

problame · 2024-06-10T08:47:01Z

This week, @problame to address his follow-up requests from #7867 (review)

problame · 2024-08-28T15:07:31Z

This issue was part of

Epic: 2024Q2/Q3 compaction work #8001

In the end, that work expanded into Q3 and we focussed solely on bottommost compaction.

Bottommost compaction is very deterministic and hence, the existing quantification work in test_gc_feedback (#7867) is sufficient to qualify & quantify it.

problame mentioned this issue May 15, 2024

Epic: productionize tiered compaction #7554

Closed

problame changed the title ~~test_gc_feedback: duplicate & adapt to measure / regress-test point-in-time space efficiency~~ python infra to measure total-space-efficiency & point-in-time space efficiency May 15, 2024

problame assigned skyzh May 15, 2024

jcsp added c/storage/pageserver Component: storage: pageserver a/test Area: related to testing labels May 20, 2024

skyzh mentioned this issue May 23, 2024

test(pageserver): quantify compaction outcome #7867

Merged

5 tasks

problame changed the title ~~python infra to measure total-space-efficiency & point-in-time space efficiency~~ quantification of compaction algorithms Jun 10, 2024

problame assigned problame and unassigned skyzh Jun 10, 2024

problame mentioned this issue Jun 10, 2024

Epic: 2024Q2/Q3 compaction work #8001

Closed

problame closed this as completed Aug 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

quantification of compaction algorithms #7770

quantification of compaction algorithms #7770

problame commented May 15, 2024 •

edited

Loading

Refs

problame commented May 28, 2024

problame commented Jun 10, 2024

problame commented Aug 28, 2024

quantification of compaction algorithms #7770

quantification of compaction algorithms #7770

Comments

problame commented May 15, 2024 • edited Loading

Refs

problame commented May 28, 2024

problame commented Jun 10, 2024

problame commented Aug 28, 2024

problame commented May 15, 2024 •

edited

Loading