kvserver: avoid lease transfers to restarted nodes that could become overloaded #96521
Closed
2 tasks done
Labels
A-kv-distribution
Relating to rebalancing and leasing.
branch-release-23.1
Used to mark GA and release blockers, technical advisories, and bugs for 23.1
C-enhancement
Solution expected to add code/behavior + preserve backward-compat (pg compat issues are exception)
GA-blocker
T-kv
KV Team
Milestone
We have seen that a node restarting in the cluster can have very negative effects on request latency.
A reproduction test was added in #95161
Allowing Lease Transfers
data:image/s3,"s3://crabby-images/5d088/5d088366d47aea478b44f8b4d64e80962a078fdf" alt="image"
data:image/s3,"s3://crabby-images/1a9c5/1a9c54d0833a43e79c96bec4759771a52eb6b103" alt="image"
Extremely High P99 SQL Latency
Blocking Lease Transfers
data:image/s3,"s3://crabby-images/03dab/03dab632d6a1fd0874588fdaa2ba1f6860c5f1dc" alt="image"
data:image/s3,"s3://crabby-images/3ab55/3ab55cfa8660ba387c137c95cd76fe1fd7e00f1c" alt="image"
No impact on P99 SQL Latency
This issue tracks the improvements to crdb in 23.1 to prevent a restarted node impacting end latency. The improvements are narrowly scoped but have been proven to have a significant effect in keeping request latency normal.
Jira issue: CRDB-24164
The text was updated successfully, but these errors were encountered: