admission: control snapshot ingest disk write bandwidth #120708

sumeerbhola · 2024-03-19T17:36:10Z

see thread starting at #80607 (comment) for context.

aadityasondhi · 2024-03-19T18:33:39Z

pre-requisite: #86857

This patch integrates raft snapshot ingestion with the disk write mechanism in admission control. The following internal machinery changes were made to make that possible: - `SnapshotQueue` was added as an implementation of the `requester` interface. Internally, it is a simple FIFO queue unlike the other work queue, since we can make the assumption that all snapshots are of the same priority and are processed as system tenant requests. - A new `kvStoreTokenChildGranter` was created to grant tokens to snapshot requests. - We now have a `StoreWorkType` that differentiates `regular`, `elastic`, and `snapshot` work for the store granters. This was necessary because snapshots do not incur the same write-amp as the other work types – they land into L6 of the LSM due to excises. We also only want these requests to be subject to pacing based on disk bandwidth. - We now prioritize store writes in the following order: `regular`, `snapshot`, `elastic`. - The `demuxHandle` of the `GrantCoordinator` now uses `StoreWorkType`. The integration point for the `SnapshotQueue` is in `Receive()` where we use a pacing mechanism to process incoming snapshots. Snapshots are subject to `snapshotBurstSize` amounts of disk writes before asking for further admission of the same size. The `multiSSTWriter` uses Pebble's SST size estimates to maintain a running count of disk writes incurred by the snapshot ingest. Once the SST is finalized we deduct/return further tokens. Closes cockroachdb#120708. Release note (ops change): Admission Control now has an integration for pacing snapshot ingest traffic based on disk bandwidth. `kvadmission.store.snapshot_ingest_bandwidth_control.enabled` is used to turn on this integration. Note that it requires provisioned bandwidth to be set for the store (or cluster through the cluster setting) for it to take effect.

131243: admission, kvserver: snapshot integration for disk bandwidth r=sumeerbhola a=aadityasondhi This patch integrates raft snapshot ingestion with the disk write mechanism in admission control. The following internal machinery changes were made to make that possible: - `SnapshotQueue` was added as an implementation of the `requester` interface. Internally, it is a simple FIFO queue unlike the other work queue, since we can make the assumption that all snapshots are of the same priority and are processed as system tenant requests. - A new `kvStoreTokenChildGranter` was created to grant tokens to snapshot requests. - We now have a `StoreWorkType` that differentiates `regular`, `elastic`, and `snapshot` work for the store granters. This was necessary because snapshots do not incur the same write-amp as the other work types – they land into L6 of the LSM due to excises. We also only want these requests to be subject to pacing based on disk bandwidth. - We now prioritize store writes in the following order: `regular`, `snapshot`, `elastic`. - The `demuxHandle` of the `GrantCoordinator` now uses `StoreWorkType`. The integration point for the `SnapshotQueue` is in `Receive()` where we use a pacing mechanism to process incoming snapshots. Snapshots are subject to `snapshotBurstSize` amounts of disk writes before asking for further admission of the same size. The `multiSSTWriter` uses Pebble's SST size estimates to maintain a running count of disk writes incurred by the snapshot ingest. Once the SST is finalized we deduct/return further tokens. Closes #120708. Release note (ops change): Admission Control now has an integration for pacing snapshot ingest traffic based on disk bandwidth. `kvadmission.store.snapshot_ingest_bandwidth_control.enabled` is used to turn on this integration. Note that it requires provisioned bandwidth to be set for the store (or cluster through the cluster setting) for it to take effect. Co-authored-by: Aaditya Sondhi <20070511+aadityasondhi@users.noreply.github.com>

This patch integrates raft snapshot ingestion with the disk write mechanism in admission control. The following internal machinery changes were made to make that possible: - `SnapshotQueue` was added as an implementation of the `requester` interface. Internally, it is a simple FIFO queue unlike the other work queue, since we can make the assumption that all snapshots are of the same priority and are processed as system tenant requests. - A new `kvStoreTokenChildGranter` was created to grant tokens to snapshot requests. - We now have a `StoreWorkType` that differentiates `regular`, `elastic`, and `snapshot` work for the store granters. This was necessary because snapshots do not incur the same write-amp as the other work types – they land into L6 of the LSM due to excises. We also only want these requests to be subject to pacing based on disk bandwidth. - We now prioritize store writes in the following order: `regular`, `snapshot`, `elastic`. - The `demuxHandle` of the `GrantCoordinator` now uses `StoreWorkType`. The integration point for the `SnapshotQueue` is in `Receive()` where we use a pacing mechanism to process incoming snapshots. Snapshots are subject to `snapshotBurstSize` amounts of disk writes before asking for further admission of the same size. The `multiSSTWriter` uses Pebble's SST size estimates to maintain a running count of disk writes incurred by the snapshot ingest. Once the SST is finalized we deduct/return further tokens. Closes cockroachdb#120708. Release note (ops change): Admission Control now has an integration for pacing snapshot ingest traffic based on disk bandwidth. `kvadmission.store.snapshot_ingest_bandwidth_control.enabled` is used to turn on this integration. Note that it requires provisioned bandwidth to be set for the store (or cluster through the cluster setting) for it to take effect.

sumeerbhola added C-enhancement Solution expected to add code/behavior + preserve backward-compat (pg compat issues are exception) A-admission-control labels Mar 19, 2024

sumeerbhola mentioned this issue Mar 19, 2024

admission,kvserver: subject snapshot ingestion to admission control #80607

Closed

aadityasondhi mentioned this issue Apr 4, 2024

admission: disk bandwidth limiter integrations #121779

Open

5 tasks

aadityasondhi self-assigned this Aug 6, 2024

aadityasondhi mentioned this issue Aug 16, 2024

admission: add roachtest for disk-bound overload for snapshots #129122

Closed

aadityasondhi mentioned this issue Oct 7, 2024

admission, kvserver: snapshot integration for disk bandwidth #131243

Merged

craig bot closed this as completed in 7afa154 Oct 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

admission: control snapshot ingest disk write bandwidth #120708

admission: control snapshot ingest disk write bandwidth #120708

sumeerbhola commented Mar 19, 2024 •

edited by exalate-issue-sync bot

Loading

aadityasondhi commented Mar 19, 2024

admission: control snapshot ingest disk write bandwidth #120708

admission: control snapshot ingest disk write bandwidth #120708

Comments

sumeerbhola commented Mar 19, 2024 • edited by exalate-issue-sync bot Loading

aadityasondhi commented Mar 19, 2024

sumeerbhola commented Mar 19, 2024 •

edited by exalate-issue-sync bot

Loading