-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bpi-r3 branch 6.3-rc missing SFP support / transmit queue timeout #103
Comments
I pulled the 6.3-rc branch. compiled and booted from it and noticed the rj45 sfp stopped working. As I didn't have time to play with it and 6.2-rc has been working fine, I just went back to 6.2. |
which SFP do you use exactly? c686900 2023-03-14 net: sfp: support 2.5g copper SFP v3 should be make 2g5 sfp from aliexpress work...if you have the tplink then daniel prepared a quirk-patch for it |
It's the tplink from aliexpress model TL-SM410U I will give the 6.3-rc another try |
Ok,the tp-link needs patch from daniel i have not yet in my tree... |
Oh ok, got confused, do you have plans to add Daniels patch to your 6.3 tree? |
have added the tp-link quirk, but daniel reports that it is not clean working and they search the issue https://www.spinics.net/lists/netdev/msg892902.html btw. there is 6.2-main...better than rc to use but i wonder why 6.2-rc works for you.... maybe you can try to revert the sfp-patch above...possibly it breaks your sfps |
Thanks for the tp-link quirk patch, the SFP is working now in 6.3-rc. I will test the 6.2-main now, I noticed the fiber SFP is not detecting link. |
I couldn't find the 6.2-main branch |
which fibre sfp? mine (H!Fibre in left cage) is working in 6.3-rc sorry about 6.2-main, had not pushed it...done now |
Thanks, I'll pull 6.3-main. I tested two, I can see them but link is not detected Brocade 57-1000013-01 |
6.2-main...6.3 is not final (sry typo from me...rc branches are till the kernel is released - when only rc are available) which slot do you use? ethtool -m eth1/lan4 dmesg |grep -i 'eth\|sfp' what happens when you do the autoneg workaround? ethtool -s eth1 autoneg off |
Ok, my bad, last time I tried to make the SFP work I played too much with it and let auto negotiation disabled on the other side.. so after enabling it I can detect link and tcpdump traffic. Looks fine so far on 6.3-rc. vlan is not working, if I remember correctly left side SFP does not support vlan. |
Yes,there was a patch series from felix fietkau which cannot be applied...thought patch from vladimir fixed it but maybe it was another issue...felix patch disabled hw offloading on the gmac1 |
This should be the patch fixing the vlan-issue on gmac |
i have pushed the vlan-fix to 6.3-pwm-branch you can cherry-pick to any other branch you want...it works so far on my bananapi-r3 and i send it as RFC/RFT to ML https://patchwork.kernel.org/project/linux-mediatek/patch/20230416091038.54479-1-linux@fw-web.de/ |
I just build 6.3-pwm and booted from it and I can see the vlan tags in the packets. Amazing, thanks! |
I hope there will be no sideeffects...i guess the patch disables rx vlan offload completely but better than broken vlan support |
I've ran into this kernel panic twice now. I need to reboot to clear it. You think it's related to the a
|
I think this is related to felix patch adding the transmi queues...but i have no idea how to fix...there is a issue on openwrt github repo |
Do you know a way to trigger this bug? This is the issue on openwrt repo where i sent an update: openwrt/openwrt#12143 (comment) If we know how to trigger it we have a chance to debug it to root cause |
Let issue open for tracing the transmit error |
I don't, but it hasn't crashed today and yesterday when it crashed I was copying big files through SFP1/SFP2. I'll run iperf3 test and see if it crashes again |
I have not seen it with iperf3 yet...thats my standard-test in both directions,but i guess the problem is when there are multiple streams which are spreaded over different tx queues over time |
Have you seen this with 6.2 too? If not you can try the vlan-patch there and look if this is the cause but imho this cannot be the cause as this is not in tx path |
I havent tested with 6.2, can I just checkout latest 6.2-main and build it? I ran into the issue again yesterday and I think the issue has something to do with the right SFP, where I have the copper tplink SFP. I ran into it three times when traffic is flowing as shown below.
|
Is this reproducable for you (e.g. you can trigger it this way)? Is the other end also 2.5g or do you run at lower speed there? Could you try with another sfp (maybe only 1g) in the lan-sfp cage or towards one of the rj45 ports? Basicly you can use every branch to build :) vlanfix is only applied to the last one (6.3-pwm) but for testing the sfp itself other branches can work too,but maybe need patches for supporting the tplink-sfp |
I can trigger it but unsure exactly how.. it just happens. I noticed it doesn't happen when I just stop using the right sfp2/lan4 Both ends on both SFP's are 2.5g I tried a copper 1g SFP on the right sfp2/lan4 and it wont take it seems like it only allows 2.5g modules
|
My fiber sfp works in both cages and this is 1g only ..maybe your 1g module needs a quirk too...daniel reported from some sfps having checksum errors or having invalid data (e.g. supported modes) in eeprom |
Can you trigger the issue when only working over lan4/sfp2 (writing/reading to/from r3). |
I took a while to reply because I was testing. The good news is that I haven't ran into this issue again. I have both SFP's populated with the same modules and same traffic going through them. The only thing that changed is back then I was doing a lot of NTP/GPS/PPS testing, maybe the kernel panic had something to do with it? I can't think of anything else. |
I loaded 6.3.0-bpi-r3-main 24 hours ago and no crashes. When I initially ran into this issue it could crash after couple hours. |
Have you same connections as before (sfps,rj45 ports,vlan)? Have not seen any fix for these problems |
Something definitely changed, but I can't tell what. |
Hi Frank, Can you add the tp-link quirk patch to 6.6.25-bpi-r3-main? Thanks |
Can you make a pull request or point to the patch? |
I'm actually not sure that patch has the fix. The issue this time is the SFP not detecting link.
|
This looks like the main problem for all 2g5 sfp...look what i have done in 6.9-net-next...i guess you need at least erics phylink patch and the tplink quirk let phylink know about 2g5...the phy-mapping i had done for the oem imho does not work for tplink |
Hi Frank,
Testing the 6.3-rc branch I noticed the SFP's stopped working, any fix for this?
Thanks
The text was updated successfully, but these errors were encountered: