-
Notifications
You must be signed in to change notification settings - Fork 720
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Lightning block at not full replicaed #6426
Labels
Comments
6 tasks
#6420 also add quick fix commit for this issue. |
ti-chi-bot bot
added a commit
that referenced
this issue
May 10, 2023
close #6426 allow the `schedule=deny` label can do rule constraints check Signed-off-by: nolouch <nolouch@gmail.com> Co-authored-by: ti-chi-bot[bot] <108142056+ti-chi-bot[bot]@users.noreply.github.com>
ti-chi-bot
pushed a commit
to ti-chi-bot/pd
that referenced
this issue
May 10, 2023
close tikv#6426 Signed-off-by: ti-chi-bot <ti-community-prow-bot@tidb.io>
nolouch
added a commit
to nolouch/pd
that referenced
this issue
May 15, 2023
close tikv#6426 allow the `schedule=deny` label can do rule constraints check Signed-off-by: nolouch <nolouch@gmail.com> Co-authored-by: ti-chi-bot[bot] <108142056+ti-chi-bot[bot]@users.noreply.github.com>
16 tasks
ti-chi-bot bot
added a commit
that referenced
this issue
May 25, 2023
close #6426, ref #6427 allow the `schedule=deny` label can do rule constraints check Signed-off-by: ti-chi-bot <ti-community-prow-bot@tidb.io> Signed-off-by: nolouch <nolouch@gmail.com> Signed-off-by: Ryan Leung <rleungx@gmail.com> Co-authored-by: ShuNing <nolouch@gmail.com> Co-authored-by: nolouch <nolouch@gmail.com> Co-authored-by: Ryan Leung <rleungx@gmail.com> Co-authored-by: ti-chi-bot[bot] <108142056+ti-chi-bot[bot]@users.noreply.github.com>
ti-chi-bot
pushed a commit
to ti-chi-bot/pd
that referenced
this issue
Feb 18, 2024
close tikv#6426 Signed-off-by: ti-chi-bot <ti-community-prow-bot@tidb.io>
ti-chi-bot bot
pushed a commit
that referenced
this issue
Feb 26, 2024
close #6426 allow the `schedule=deny` label can do rule constraints check Signed-off-by: ti-chi-bot <ti-community-prow-bot@tidb.io> Signed-off-by: nolouch <nolouch@gmail.com> Co-authored-by: ShuNing <nolouch@gmail.com> Co-authored-by: nolouch <nolouch@gmail.com>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Labels
Bug Report
Problem
In concurrent import, we may have replicas that do not satisfy the replicas constraints, such as:
the lightning split the region during the scatter, it may cause abnormal regions. We can see the logs in pd like, it has 6 replicas :
And metrics like:
That's say the lightning tries to scatter the region, but the region is not satisfied by the replicas count of the
rule constraint
, the replica is less or large than the expect. currently, PD should fix this region, then lighting can retry request success to scatter this region. but from the log, PD is always not trying to fix it.Analyze
After the investigation, we found lightning use the new interface, want to pauses the scheduler of a key range. but the function introduced by #4649 will stop all scheduler and operator for the key range if the key range label as
schedule=deny
,It will block the
RuleChecker
to fix the region, it is unexpected during the import. details:pd/pkg/schedule/checker/checker_controller.go
Lines 74 to 89 in d3ff9d6
in before, lighting will timeout in 3mins, and then the label of pause scheduler will be clear, and then pd can continue fix the region. but currently, lighting increase the timeout, so it's easier to observe occur the block issue.
How to fix
Do not deny the placement rule operator during the import, we may consider a new label like
schedule=importing
or directly let deny allow rule check.What version of PD are you using (
pd-server -V
)?master
The text was updated successfully, but these errors were encountered: