-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[System-ready] System-ready status sometimes is not reflecting the correct status #15935
Comments
sg893052 is taking a look, will share findings |
@Praveen-Brcm @adyeung @sg893052 The issue now occurs statistically. Here is output from one of our devices
Attaching STATE_DB and syslog here |
@Praveen-Brcm @adyeung @sg893052 Any update on this? |
I will look into this and get back in couple of days. Please let me know if I could try out with the latest master https://sonic-build.azurewebsites.net/ui/sonic/pipelines/138/builds?branchName=master and just loading the image results in this issue? |
@dgsudharsan I couldn't reproduce the issue. Please let me know if there is any repro scenario or platform specific.
|
@sg893052 Does sysready get affected by absence or bad PSU? From the state_db log I find this. If this is the case can't sys ready highlight what the issue is?
|
In another repro, I find that we don't have the SYSTEM_READY|SYSTEM_STATE table in state_db but all app ready status are present. Do we know what could be the root cause of this? |
I was able to reproduce the problem and have a backtrace Mar 13 21:06:25.810087 sonic NOTICE healthd: System is ready |
Closing this issue as the fix is merged |
Description
System ready status is not reflecting the correct status. Below is an example. The telemetry status shows ok even though the app has exited. The overall system status shows not ready but all services are shown as Ok.
This deviates from the output of system health detail
Steps to reproduce the issue:
Describe the results you received:
show system-health sysready-status shows conflicting status.
Describe the results you expected:
show system-health sysready-status should align with actual status of the system
Output of
show version
:Output of
show techsupport
:Additional information you deem important (e.g. issue happens only occasionally):
sonic_dump_qa-eth-vt01-3-2700a0_20230721_183209.tar.gz
The text was updated successfully, but these errors were encountered: