You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
There is a leader shuffler used to shuffle the leader fo the leader region(leader region is the region containing the key ""). As you can see from the log, region #11920 is the source region and #11860 is the target region. After 10s, it still think the leader region is #11920.
[case-id: 210657] 2020/02/21 18:00:48 leader_shuffler.go:383: �[0;37m[info] [leader shuffler] [leader=6] Merge leader region #11920 and #11860�[0m
[case-id: 210657] 2020/02/21 18:00:58 leader_shuffler.go:282: �[0;37m[info] [leader shuffler] [leader=6] Transfer leader region #11920 from [5 4 6] to [1 4 5]�[0m
It's the log of the leader of region #11860. The region heartbeat is sent successfully.
Log of PD. PD received the heartbeat of region #11860 and updated its key range.
[2020/02/21 10:00:48.494 +00:00] [INFO] [operator_controller.go:293] ["add operator"] [region-id=11860] [operator="\"admin-merge-region (kind:admin,merge, region:11860(4755,434), createAt:2020-02-21 10:00:48.494316899 +0000 UTC m=+91025.692975359, startAt:0001-01-01 00:00:00 +0000 UTC, currentStep:0, steps:[merge region 11920 into region 11860]) \""]
[2020/02/21 10:00:48.498 +00:00] [INFO] [operator_controller.go:103] ["operator finish"] [region-id=11860] [operator="\"admin-merge-region (kind:admin,merge, region:11860(4755,434), createAt:2020-02-21 10:00:48.494316899 +0000 UTC m=+91025.692975359, startAt:2020-02-21 10:00:48.494511375 +0000 UTC m=+91025.693169846, currentStep:1, steps:[merge region 11920 into region 11860]) finished\""]
[2020/02/21 10:00:48.497 +00:00] [INFO] [cluster_info.go:551] ["region Version changed"] [region-id=11920] [detail="StartKey:{}, EndKey:{7480000000000000FF0500000000000000F8}"] [old-version=4776] [new-version=4777]
[2020/02/21 10:00:48.498 +00:00] [INFO] [cluster_info.go:551] ["region Version changed"] [region-id=11860] [detail="StartKey Changed:{7480000000000000FF0500000000000000F8} -> {}, EndKey:{7480000000000000FF0F00000000000000F8}"] [old-version=4755] [new-version=4778]
Log of TiDB. You can see the region 11920 was destroyed at 10:00:48, but after 20s, it still existed in PD.
[2020/02/21 10:01:09.016 +00:00] [WARN] [backoff.go:305] ["regionMiss backoffer.maxSleep 20000ms is exceeded, errors:\nmessage:\"region 11920 is missing\" region_not_found:<region_id:11920 > at 2020-02-21T10:01:08.01143584Z\nmessage:\"region 11920 is missing\" region_not_found:<region_id:11920 > at 2020-02-21T10:01:08.512857692Z\nmessage:\"region 11920 is missing\" region_not_found:<region_id:11920 > at 2020-02-21T10:01:09.016322964Z"]
Please answer these questions before submitting your issue. Thanks!
In the Schrodinger test of the deadlock detector, I found it spent about 40s to find the true leader of the leader region from PD:
There is a leader shuffler used to shuffle the leader fo the leader region(leader region is the region containing the key ""). As you can see from the log, region #11920 is the source region and #11860 is the target region. After 10s, it still think the leader region is #11920.
It's the log of the leader of region #11860. The region heartbeat is sent successfully.
Log of PD. PD received the heartbeat of region #11860 and updated its key range.
Log of TiDB. You can see the region 11920 was destroyed at 10:00:48, but after 20s, it still existed in PD.
You can see all logs here: https://schrodinger.pingcap.net/?#/experimentdetail?experimentid=408178&boxid=210001
What did you expect to see?
PD update region info correctly and timely.
What version of PD are you using (
pd-server -V
)?branch: release-3.0
hash: 1b67bbb
The text was updated successfully, but these errors were encountered: