Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[201811][DELL] S6100, Z9100 Last Reboot Reason Watchdog, Thermal Support #3520

Closed

Conversation

sridhar-ravindran
Copy link
Contributor

@sridhar-ravindran sridhar-ravindran commented Sep 26, 2019

- What I did
Added support for Watchdog & Thermal events in Last Reboot Reason "show reboot-cause" command
- How I did it
On branch lrr_watchdog_nov
Your branch is up-to-date with 'upstream/201811'.
Changes to be committed:
(use "git reset HEAD ..." to unstage)

    modified:   files/image_config/platform/rc.local
    modified:   platform/broadcom/sonic-platform-modules-dell/common/platform_reboot
    modified:   platform/broadcom/sonic-platform-modules-dell/debian/platform-modules-s6100.install
    modified:   platform/broadcom/sonic-platform-modules-dell/debian/platform-modules-z9100.install
    new file:   platform/broadcom/sonic-platform-modules-dell/s6100/scripts/fastboot_plugin
    modified:   platform/broadcom/sonic-platform-modules-dell/s6100/scripts/s6100_platform.sh
    new file:   platform/broadcom/sonic-platform-modules-dell/s6100/scripts/track_reboot_reason.sh
    modified:   platform/broadcom/sonic-platform-modules-dell/s6100/sonic_platform/chassis.py
    new file:   platform/broadcom/sonic-platform-modules-dell/z9100/scripts/fastboot_plugin
    new file:   platform/broadcom/sonic-platform-modules-dell/z9100/scripts/track_reboot_reason.sh
    modified:   platform/broadcom/sonic-platform-modules-dell/z9100/scripts/z9100_platform.sh
    modified:   platform/broadcom/sonic-platform-modules-dell/z9100/sonic_platform/chassis.py

Added support to track watchdog events and thermal overload of ASIC & CPU

- How to verify it
Added the Unit Test Results.
- Description for the changelog

[DELL] S6100, Z9100 Last Reboot Reason Watchdog, Thermal Support

LRR_UnitTesting_Nov.txt

- A picture of a cute animal (not mandatory but encouraged)

@jleveque jleveque changed the title [DELL] S6100, Z9100 Last Reboot Reason Watchdog, Thermal Support in N… [201811][DELL] S6100, Z9100 Last Reboot Reason Watchdog, Thermal Support Sep 26, 2019
@sujinmkang sujinmkang self-assigned this Sep 27, 2019
@sujinmkang
Copy link
Collaborator

@sridhar-ravindran watchdog reboot reason doesn't work consistently on this branch.
Can you check why it's not working consistently?
++++++++++++++++++++++++++++++++++++++
admin@str-s6100-acs-2:$ show reboot-cause
Unknown
admin@str-s6100-acs-2:
$ sudo cat /var/log/syslog | grep -a watchdog0
Sep 27 21:08:24.700974 str-s6100-acs-2 CRIT kernel: [ 629.250819] watchdog: watchdog0: watchdog did not stop!
Sep 27 21:13:25.657467 str-s6100-acs-2 CRIT kernel: [ 178.895828] watchdog: watchdog0: watchdog did not stop!
admin@str-s6100-acs-2:$ sudo cat /var/log/syslog | grep -a watchdog
Sep 27 21:08:24.700974 str-s6100-acs-2 CRIT kernel: [ 629.250819] watchdog: watchdog0: watchdog did not stop!
Sep 27 21:10:30 sonic kernel: [ 0.076100] NMI watchdog: enabled on all CPUs, permanently consumes one hw-PMU counter.
Sep 27 21:10:30 sonic kernel: [ 0.076100] NMI watchdog: enabled on all CPUs, permanently consumes one hw-PMU counter.
Sep 27 21:13:25.657467 str-s6100-acs-2 CRIT kernel: [ 178.895828] watchdog: watchdog0: watchdog did not stop!
Sep 27 21:14:24.925744 str-s6100-acs-2 INFO kernel: [ 0.076117] NMI watchdog: enabled on all CPUs, permanently consumes one hw-PMU counter.
Sep 27 21:20:52.822780 str-s6100-acs-2 INFO kernel: [ 0.076068] NMI watchdog: enabled on all CPUs, permanently consumes one hw-PMU counter.
Sep 27 21:26:41.882124 str-s6100-acs-2 INFO kernel: [ 0.076077] NMI watchdog: enabled on all CPUs, permanently consumes one hw-PMU counter.
admin@str-s6100-acs-2:
$ cat /sys/devices/platform/SMF.512/hwmon/hwmon1/mb_poweron_reason
0x99
++++++++++++++++++++++++++++++++

@jleveque
Copy link
Contributor

@sujinmkang: can you please format your comment for readbility? Use a code block (three backticks). Reference: https://guides.github.com/features/mastering-markdown/

@paavaanan
Copy link
Contributor

@sujinmkang will check and get back on this.

@sridhar-ravindran
Copy link
Contributor Author

Hi,
We have raised #3767 to support this feature in master.
Once approved, will raise a new PR for this branch. Request you to review the same.

@sridhar-ravindran sridhar-ravindran deleted the lrr_watchdog_nov branch November 22, 2019 05:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants