-
Notifications
You must be signed in to change notification settings - Fork 725
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TestRegionStatistics is unstable #8319
Comments
close #8319 Because this test has only 2 pd's and five ResignLeader times, whenever there is a network or disk problem will cause the lease to expire, and result in one more leader resigning. And then there is a high probability of being recognized as frequent campaign times to transfer leader failed Signed-off-by: husharp <jinhao.hu@pingcap.com>
https://github.com/tikv/pd/actions/runs/9936180084/job/27443847817 |
https://github.com/tikv/pd/actions/runs/9967757112/job/27541904077 |
Conclude: Because frequently change(There are many resign leader in this test and slow disk can make frequently happened in high probability :(
leaderName = leaderServer.GetServer().Name()
leaderServer.ResignLeader()
re.NotEqual(tc.WaitLeader(), leaderName) Let's check the log: // now leader is pd3
[2024/07/17 03:38:56.625 +00:00] [INFO] [server.go:1737] ["campaign PD leader ok"] [campaign-leader-name=pd3]
[2024/07/17 03:38:57.143 +00:00] [INFO] [member.go:356] ["try to resign etcd leader to next pd-server"] [from=pd3] [to=]
// due to frequently transfer leader to pd3 as well
[2024/07/17 03:38:58.149 +00:00] [INFO] [member.go:356] ["try to resign etcd leader to next pd-server"] [from=pd2] [to=]
[2024/07/17 03:38:58.190 +00:00] [INFO] [server.go:1737] ["campaign PD leader ok"] [campaign-leader-name=pd3]
[2024/07/17 03:38:58.650 +00:00] [ERROR] [server.go:1717] ["campaign PD leader meets error due to etcd error"] [campaign-leader-name=pd2] [error="[PD:server:ErrLeaderFrequentlyChange]leader pd2 frequently changed, leader-key is [/pd/7392444153452143012/leader]"] |
close #8319 Signed-off-by: husharp <ihusharp@gmail.com> Co-authored-by: ti-chi-bot[bot] <108142056+ti-chi-bot[bot]@users.noreply.github.com>
Flaky Test
Which jobs are failing
CI link
https://github.com/tikv/pd/actions/runs/9611005214/job/26508694960?pr=7756
Reason for failure (if possible)
Because this test has only 2 pd's and five ResignLeader times, whenever there is a network or disk problem will cause the lease to expire, and result in one more leader resigning.
There is a high probability of being recognized as frequent campaign times.
Anything else
The text was updated successfully, but these errors were encountered: