Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Mellanox] use SDK sysfs for module low power mode control #218

Closed
wants to merge 11 commits into from

Conversation

Junchao-Mellanox
Copy link
Owner

@Junchao-Mellanox Junchao-Mellanox commented Mar 19, 2024

Why I did it

Nvidia platform API is using SDK API to control module low power mode. This PR is aimed to migrate those logic to
SDK provided sysfs nodes.

Work item tracking
  • Microsoft ADO (number only):

How I did it

Use SDK provided sysfs nodes for LPM control.

How to verify it

Manual test passed
sonic-mgmt regression test passed

Which release branch to backport (provide reason below if selected)

  • 201811
  • 201911
  • 202006
  • 202012
  • 202106
  • 202111
  • 202205
  • 202211
  • 202305

Tested branch (Please provide the tested image version)

Description for the changelog

Link to config_db schema for YANG module changes

A picture of a cute animal (not mandatory but encouraged)

@Junchao-Mellanox Junchao-Mellanox force-pushed the master-lpm branch 2 times, most recently from 4bd4567 to 66638b7 Compare June 11, 2024 03:36
@Junchao-Mellanox Junchao-Mellanox changed the base branch from master to master_test_thermal_updater June 18, 2024 10:01
@Junchao-Mellanox Junchao-Mellanox changed the base branch from master_test_thermal_updater to master June 18, 2024 10:01
@Junchao-Mellanox Junchao-Mellanox changed the base branch from master to master_test_thermal_updater June 18, 2024 10:02
@Junchao-Mellanox Junchao-Mellanox changed the base branch from master_test_thermal_updater to master June 18, 2024 10:02
yuazhe and others added 11 commits June 19, 2024 09:33
…onic-net#18984)

- Why I did it
Enable get_tx_fault support in CMIS cable host mgmt mode on Mellanox platforms

- How I did it
Use community get_tx_fault implementation instead of the dummy one

- How to verify it
API call and run API Test

Signed-off-by: Yuanzhe, Liu <yualiu@nvidia.com>
- Add ASIC_TYPE to force orchagent to run with the given ASIC.
- Add PFC_DLR_INIT_ENABLE to allow test to override PfcDlrInitEnable for VS switch.
  This is required to test PfcWdAclHandler, instead of PfcWdDlrHandler.
* [TSA] Reliable TSA: Addressing pizza box issues

- Why I did it
Implement HLD https://github.com/skeesara-nokia/SONiC/blob/master/doc/voq/Reliable_TSA.md

OB- How I did it
A new attribute "tsa_enabled" has been added in CHASSIS_APP_DB the value of which changes whenever TSA/TSB is issued in the supervisor (default value is false). bgpcfgd subscribes to CHASSIS_APP_DB to receive updates on the newly added "tsa_enabled" attribute and in conjunction with the CONFIG_DB "tsa_enabled" attribute value, determine the BGP operational state is determined to be in TSA or TSB.

Signed-off-by: fountzou <ioannis.fountzoulas@nokia.com>
… Smart Switch. (sonic-net#19200)

The configuration generated from the template for the Smart Switch contained incorrect data in the "subnet4:id" field. For regular cases, the subnet ID is deduced from the VLAN name. For the Smart Switch, there is always one subnet, and the ID is set to 0.
…bookworm and add new platforms support. (sonic-net#18936)

* Update old platform code.
* s6301-56st
* s7801-54xs
* s8901-54xc
* s9110-32x
* s9300-32d

* Update code to support Bookworm
* s6301-56st
* s7801-54xs
* s8901-54dc
* s9110-32x
* s9300-32d

* Add new ufispace platforms.
* s9301-32d
* s9301-32db

* Update s9300-32d and s9301-32db bcm port config
- Why I did it

I observed an issue with fast-reboot that in a rare circumstances a queued FDB event might be written to ASIC_DB by a thread inside syncd after a call to FLUSHDB ASIC_DB was made.
That left ASIC_DB only with one record about that FDB entry and caused syncd to crash at start:

Mar 15 13:28:42.765108 sonic NOTICE syncd#SAI: :- Syncd: syncd started
Mar 15 13:28:42.765268 sonic NOTICE syncd#SAI: :- onSyncdStart: performing hard reinit since COLD start was performed
Mar 15 13:28:42.765451 sonic NOTICE syncd#SAI: :- readAsicState: loaded 1 switches
Mar 15 13:28:42.765465 sonic NOTICE syncd#SAI: :- readAsicState: switch VID: oid:0x21000000000000
Mar 15 13:28:42.765465 sonic NOTICE syncd#SAI: :- readAsicState: read asic state took 0.000205 sec
Mar 15 13:28:42.766364 sonic NOTICE syncd#SAI: :- onSyncdStart: on syncd start took 0.001097 sec
Mar 15 13:28:42.766376 sonic ERR syncd#SAI: :- run: Runtime error during syncd init: map::at
Mar 15 13:28:42.766376 sonic NOTICE syncd#SAI: :- sendShutdownRequest: sending switch_shutdown_request notification to OA for switch: oid:0x0
Mar 15 13:28:42.766518 sonic NOTICE syncd#SAI: :- sendShutdownRequestAfterException: notification send successfully
The fix is done in utilities in fast-reboot script, however in order to allow upgrade from a version without the fix, flush ASIC_DB at boot in fast-reboot as well.

Related to sonic-net/sonic-utilities#3342

- How I did it
Flush ASIC_DB on fast boot.

- How to verify it
Run fast-reboot.

Signed-off-by: Stepan Blyschak <stepanb@nvidia.com>
In this PR, we add two parameters STOP_ON_FAILURE: "False" and RETRY_TIMES: 0 into non-blocking T0 PR checker. Which means, one script will only execute once in a testplan, no matter if it fails. And one failed script will not block other remaining test scripts. Test plan will finish executing all test scripts.
update SAI version to 10.1.25
update dnx debian name by adding "_dnx_" to avoid name conflict with XGS
…lly (sonic-net#19374)

#### Why I did it
src/sonic-swss
```
* 78af15dd - (HEAD -> master, origin/master, origin/HEAD) Use unix socket for redis connection (sonic-net#3190) (2 days ago) [Junchao-Mellanox]
* 4b4aaf91 - [vlanmgr] use IP command for vlan_filtering and no_linklocal_learn, remove echo command (sonic-net#3198) (2 days ago) [Mai Bui]
```
#### How I did it
#### How to verify it
#### Description for the changelog
@Junchao-Mellanox
Copy link
Owner Author

approved by test

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.