Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update alert rule for low space (#6376) #6401

Merged
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
13 changes: 11 additions & 2 deletions alert-rules.md
Original file line number Diff line number Diff line change
Expand Up @@ -751,11 +751,20 @@ This section gives the alert rules for the TiKV component.

Check which kind of tasks has a higher value. You can normally find a solution to the Coprocessor and apply worker tasks from other metrics.

#### `TiKV_low_space_and_add_region`
#### `TiKV_low_space`

* Alert rule:

`count((sum(tikv_store_size_bytes{type="available"}) by (instance) / sum(tikv_store_size_bytes{type="capacity"}) by (instance) < 0.2) and (sum(tikv_raftstore_snapshot_traffic_total{type="applying"}) by (instance) > 0)) > 0`
`sum(tikv_store_size_bytes{type="available"}) by (instance) / sum(tikv_store_size_bytes{type="capacity"}) by (instance) < 0.2`

* Description:

The data volume of TiKV exceeds 80% of the configured node capacity or the disk capacity of the machine.

* Solution:

* Check the balance condition of node space.
* Make a plan to increase the disk capacity or delete some data or increase cluster node depending on different situations.

#### `TiKV_approximate_region_size`

Expand Down