Speed up balance leader #4610

nolouch · 2022-01-25T08:05:26Z

Development Task

Currently, balance leader only have max ops 100 op/s. we want to increase the speed when a big cluster restart (2M regions).

one way like: #4008

nolouch · 2022-01-25T08:25:00Z

cc @CabinfeverB, Would you like to take a look? ptal @rleungx

CabinfeverB · 2022-01-25T08:47:13Z

I will take a look

nolouch · 2022-02-08T06:19:15Z

/assign @CabinfeverB

CabinfeverB · 2022-02-18T05:29:27Z

Motivation

Currently, the MinScheduleInterval param determines the balance-leader speed. According to MinScheduleInterval equals 10 ms, balance-leader only has max ops 100 op/s.

If there are 100K regions that need to balance leader when a big cluster restart (2M regions), it will take 30 minutes. This is an unacceptable time cost

Detailed Design

Considering that the trigger frequency of the scheduler should not be too fast, we decided to add a batch field in the balance-leader scheduler to speed up balance leader by increasing the number of operators generated every scheduling.

In the TiKV, since we believe that the performance overhead of transferring leader in a raft group is small, the transfer-leader operator does not consume the store limit. This means that regions can be repeatedly selected from a store, so a priority-queue-like idea can be adopted. Operators are extracted from the store which has the highest/lowest leader score and we calculate the influence to adjust this top of 'heap', unless it is really impossible to extract. Then extract the next highest/timer low, and so on.

Usage Desc

Since we think the balance leader is an urgent scheduler, we set the Batch parameter of the balance-leader to 4 by default. But considering the potential scheduler competition scenario, we have added an api for configuration. At the same time, related configuration functions will be added into pd-ctl.

Development Plan

Subtasks

#4652 must be involved

Test Plan

Under the same cluster size, it should be possible to obtain an approximate linear optimization by testing the time to reach the equilibrium state when the Batch is equal to different values.

In order to test whether the original goal can be achieved, it is best to have a large cluster to do the test.

CabinfeverB · 2022-03-08T03:09:42Z

cc @mayjiang0203

…rs (#4652) ref #4008, ref #4610 speed up balance leader by batch Signed-off-by: Cabinfever_B <cabinfeveroier@gmail.com> Co-authored-by: Ti Chi Robot <ti-community-prow-bot@tidb.io>

ref #4610, ref #4652 Add `balance-leader-leader` config API. Signed-off-by: Cabinfever_B <cabinfeveroier@gmail.com> Co-authored-by: Ti Chi Robot <ti-community-prow-bot@tidb.io>

ref #4610, ref #4652, ref #4655 pdctl supports update balance-leader-scheduler config Signed-off-by: Cabinfever_B <cabinfeveroier@gmail.com> Co-authored-by: Ti Chi Robot <ti-community-prow-bot@tidb.io>

close #4610 add lock to avoid data race in balance-leader-scheduler Signed-off-by: Cabinfever_B <cabinfeveroier@gmail.com> Co-authored-by: Ti Chi Robot <ti-community-prow-bot@tidb.io>

ref tikv#4610, ref tikv#4652, ref tikv#4655 pdctl supports update balance-leader-scheduler config Signed-off-by: Cabinfever_B <cabinfeveroier@gmail.com> Co-authored-by: Ti Chi Robot <ti-community-prow-bot@tidb.io>

close tikv#4610 add lock to avoid data race in balance-leader-scheduler Signed-off-by: Cabinfever_B <cabinfeveroier@gmail.com> Co-authored-by: Ti Chi Robot <ti-community-prow-bot@tidb.io>

…4747) close #4610 adjust `Batch` size when created by `ConfigJSONDecoder` Signed-off-by: Cabinfever_B <cabinfeveroier@gmail.com>

nolouch added the type/development The issue belongs to a development tasks label Jan 25, 2022

ti-chi-bot assigned CabinfeverB Feb 8, 2022

This was referenced Feb 18, 2022

scheduler: allow balance-leader-scheduler generate multiple operators #4652

Merged

pdctl: support config balance-leader #4656

Merged

api, scheduler: add balance-leader config handler #4655

Merged

CabinfeverB mentioned this issue Mar 17, 2022

scheduler: Add lock for balance leader config #4742

Merged

ti-chi-bot closed this as completed in #4742 Mar 17, 2022

rleungx mentioned this issue Mar 17, 2022

Improve the speed of the balance-leader-scheduler #4454

Closed

CabinfeverB mentioned this issue Mar 17, 2022

scheduler: adjust Batch size when created by ConfigJSONDecoder #4747

Merged

ti-chi-bot pushed a commit that referenced this issue Mar 17, 2022

scheduler: adjust Batch size when created by ConfigJSONDecoder (#…

f1b8f80

…4747) close #4610 adjust `Batch` size when created by `ConfigJSONDecoder` Signed-off-by: Cabinfever_B <cabinfeveroier@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Speed up balance leader #4610

Speed up balance leader #4610

nolouch commented Jan 25, 2022 •

edited

Loading

nolouch commented Jan 25, 2022

CabinfeverB commented Jan 25, 2022

nolouch commented Feb 8, 2022

CabinfeverB commented Feb 18, 2022 •

edited

Loading

CabinfeverB commented Mar 8, 2022

Speed up balance leader #4610

Speed up balance leader #4610

Comments

nolouch commented Jan 25, 2022 • edited Loading

Development Task

nolouch commented Jan 25, 2022

CabinfeverB commented Jan 25, 2022

nolouch commented Feb 8, 2022

CabinfeverB commented Feb 18, 2022 • edited Loading

Motivation

Detailed Design

Usage Desc

Development Plan

Subtasks

Test Plan

CabinfeverB commented Mar 8, 2022

nolouch commented Jan 25, 2022 •

edited

Loading

CabinfeverB commented Feb 18, 2022 •

edited

Loading