-
Notifications
You must be signed in to change notification settings - Fork 738
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
test_pfcwd_wb
is flaky due to missing log
#8490
Comments
@bingwang-ms is this repro only at SN2700 or you do see failures on other platforms? this is important as if you're seeing it on other platforms - TC can be modified |
Hi @SavchukRomanLv, I only saw this failure on SN2700. I also checked the test result on SN4600 testbed in the past 30 days, it was pretty stable. |
Hi @bingwang-ms can you please share info which fanout has been used? Thank you! |
Hi @SavchukRomanLv , the fanout switch for the testbed is Arista-7260. But I don't think this issue is related to the leaf fanout, because we can confirm the PFC storm was triggered successfully after warm-boot from the syslog. The only issue is the log pattern |
Hi @bingwang-ms. Most probably message is not detected due to 15076, 11180. I do see that 15080 also included in 202205 branch two days ago. Can you update sonic-buildimage pointer and monitor it TC still randomly failing? |
Thanks @SavchukRomanLv. I manually update the file as PR#15080 did, and run the test for 3 times, all passed. It looks like the issue can be addressed by PR#15080. |
Description
The test case
test_pfcwd_wb
is flaky on SN2700 platform.The reason is because below log didn't show up after warm-reboot.
Actually, the warm-reboot is completed, and PFC watchdog is triggered after warm-reboot as expected.
The missing log is used to locate the start point after warm-reboot.
Because the expected log for recent warm-reboot is missing,
LogAnalyzer
will search in all syslog until the log pattern is found. Finally,LogAnalyzer
will find the log pattern of warm-reboot in history. That results in more log messages being parsed, and then more PFC watchdog logs are found.There are two possible reasons for the log missing
test_disable_rsyslog_rate_limit
intest_pretest.py
doesn't work as expectedSteps to reproduce the issue:
test_pfcwd_wb
. The failing rate is around 20%Describe the results you received:
Describe the results you expected:
Additional information you deem important:
The text was updated successfully, but these errors were encountered: