storage: Snapshot bandwidth "priority inversion" #15274

bdarnell · 2017-04-23T01:13:15Z

Snapshots are currently placed into two categories for bandwidth management, which effectively act as priorities. However, since we also only allow only one snapshot at a time (per target node), we have problems with priority inversion - a high-priority operation is not allowed to interrupt an existing low-priority operation that may take a while to finish. We should introduce some way to interrupt low-priority rebalance operations when they compete with high-priority repairs (unless this entire mechanism is reworked as discussed in #14768)

petermattis · 2017-04-24T13:55:46Z

An alternative to interrupting a low-priority snapshot would be to adjust its bandwidth dynamically. I'm wondering if this is a real problem to solve, though. Recovery operations are already prioritized over rebalance operations, so the most a recovery operation will have to wait is for one rebalance operation to finish.

bdarnell · 2017-04-24T14:43:55Z

Yeah, I think this is probably a theoretical concern for now. It will be a bigger issue when/if we increase the max range size since "one rebalance operation" could take longer.

tbg · 2018-10-11T10:51:11Z

Folding into #14768.

bdarnell added this to the 1.1 milestone Apr 23, 2017

bdarnell modified the milestones: Later, 1.1 Aug 14, 2017

bdarnell added C-performance Perf of queries or internals. Solution not expected to change functional behavior. A-kv-replication Relating to Raft, consensus, and coordination. labels Apr 26, 2018

petermattis removed this from the Later milestone Oct 5, 2018

tbg closed this as completed Oct 11, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

storage: Snapshot bandwidth "priority inversion" #15274

storage: Snapshot bandwidth "priority inversion" #15274

bdarnell commented Apr 23, 2017

petermattis commented Apr 24, 2017

bdarnell commented Apr 24, 2017

tbg commented Oct 11, 2018

storage: Snapshot bandwidth "priority inversion" #15274

storage: Snapshot bandwidth "priority inversion" #15274

Comments

bdarnell commented Apr 23, 2017

petermattis commented Apr 24, 2017

bdarnell commented Apr 24, 2017

tbg commented Oct 11, 2018