-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Interface state is 'down' and not going up when adding and removing it from vlan group #5347
Comments
can you provide the full output of 'show version' including the platform? |
@prsunny I have added the platform from 'show version', this is reproduced on several platform not only this one. |
|
@itaibaz, could you please take a look at this issue? |
The error seems to be SAI redis issue, and not SAI issue, so I don't understand the connection to SAI |
@anshuv-mfst ethernet104 is not belong to any other vlan. |
in function PortsOrch::removeBridgePort there is a comments as below:
seems that there is logic missing to remove the FDB entry before removing the bridge port, I think this is what exactly we have observed, there is still a reference to the port in the DB, so the remove fails. this is not likely only an issue on some certain platform, @prsunny what do you think? |
I checked what is the FDB that was learnt 2020-09-08.15:12:57.786288|n|fdb_event|[{"fdb_entry":"{"bvid":"oid:0x26000000000013","mac":"24:8A:07:3E:0C:86","switch_id":"oid:0x21000000000000"}","fdb_event":"SAI_FDB_EVENT_LEARNED","list":[{"id":"SAI_FDB_ENTRY_ATTR_BRIDGE_PORT_ID","value":"oid:0x3a000000000662"},{"id":"SAI_FDB_ENTRY_ATTR_TYPE","value":"SAI_FDB_ENTRY_TYPE_DYNAMIC"},{"id":"SAI_FDB_ENTRY_ATTR_PACKET_ACTION","value":"SAI_PACKET_ACTION_FORWARD"}]}]| There is an error port was added to vlan 3, but PVID was not set, so PVID remains 1 we work with ingress vlan filtering on To me it seems weird, why PVID wasn't set to 3 when the port was added to vlan 3 |
Bottom line I think there are 2 issues -
|
@prsunny When we flush the FDB entries corresponding to the bridge port before calling SAI API remove_bridge_port(), the issue is fixed. Doesn't this fix look good? |
sure, could you help provide the fix? |
Yes, I have the fix. I have tested it. I will raise a PR shortly. |
@prsunny the PR for the fix is: |
issue is fixed and merged thus closing it. |
Description
Interface state change to 'down' and not going up again when adding an interface to some vlan group and then remove it.
This issue is not easy to reproduce, It occur randomly after several times or even more of doing the steps below.
In order to restore the interface a config reload or reboot is required.
Steps to reproduce the issue:
Do the following until you able to catch this issue:
Describe the results you received:
On system log:
NOTICE swss#orchagent: :- removeVlanMember: Remove member Ethernet104 from VLAN Vlan3 lid:3 vmid:27000000000663
NOTICE swss#orchagent: :- setHostIntfsStripTag: Set SAI_HOSTIF_VLAN_TAG_STRIP to host interface: Ethernet104
ERR swss#orchagent: :- meta_generic_validation_remove: object 0x3a000000000662 reference count is 1, can't remove
ERR swss#orchagent: :- removeBridgePort: Failed to remove bridge port Ethernet104 from default 1Q bridge, rv:-5
Describe the results you expected:
No error should occur with this flow and interface should go up again.
Output of
show version
:syslog.txt
sai_sdk_dump_09_08_2020_03_23_PM.gz
saidump.txt
full techsupport output:
mstdump.zip
proc.zip
sai_sdk_dump.zip
hw-mgmt.zip
log.zip
log2.zip
log3.zip
dump.zip
etc.zip
The text was updated successfully, but these errors were encountered: