dwc_otg: Call usb_hcd_unlink_urb_from_ep with lock held in completion ta... #312

wmdb · 2013-06-17T18:48:25Z

...sklet

This fixes a regression introduced in c4564d4,
usb_hcd_unlink_urb_from_ep must be called with the lock held. Not doing so
resulted in corruption of the list.

popcornmix · 2013-06-17T23:31:10Z

@P33M, @ghollingworth
Happy with this fix?

wmdb · 2013-06-17T23:42:06Z

Yes, we had an application that involved a lot of powering down/replugging USB devices, sometimes in the middle of serving data over a cdc_ether link. We were hitting kernel oopses due to NULL pointer dereferences in this driver quite frequently. These seem to have gone now, we've done a fair bit of stress testing since this patch went in.

wmdb · 2013-06-18T17:19:51Z

It occurred to me that if this problem was occurring at all then there must be contention in another place in the driver as well, this turned out to be correct. However fixing it reintroduced the problem again, so we worked out a better solution that I believe addresses the underlying problem. It turned out that it went deeper than just the tasklet code, so the NULL pointer dereferences were likely being seen even prior to that change, but the tasklet made the situation more likely to occur.

The update is at: wmdb@521f206

Errors closely resembling this problem, and pre-dating c4564d4, were reported here: http://www.raspberrypi.org/phpBB3/viewtopic.php?t=16280

Let me know your views on this, this is my first time posting commits through github, so if I am doing it all wrong please do say so.

Mike

popcornmix · 2013-06-18T17:23:14Z

@wmdb
You can update the pull request by editing the git commit and force pushing (git push -f).
You can edit with "git commit --amend" or "git rebase -i HEAD~" (amongst many other ways...)

ghollingworth · 2013-06-18T17:26:54Z

I'd suggest you also try this with the fiq_split branch because there are other changes there that probably make this bug occur more (the dequeing fix actually means some of those things make themselves through to the task let)

Gordon

wmdb · 2013-06-18T19:41:32Z

@popcornmix
I thought you might shoot me if I suggested forcing a push, but I have done as you suggested :-)

@ghollingworth
I did try testing against fiq_split but there were too many changes needed in our module environment for me to do this in the time available. Looking at it, I did not see evidence of conflicts with fiq_split, and I agree it does seem possible that code in that branch might have triggered the bug as well.

popcornmix · 2013-06-18T20:12:40Z

You've pulled in some unwanted commits.
git rebase -i HEAD~6
And you should be able to fix that up.

wmdb · 2013-06-18T20:46:30Z

rebase upstream/rpi-3.6.y seemed to do the trick, not sure how those commits ended up with new IDs but they are back the way they were now

Mike

P33M · 2013-06-18T21:59:40Z

Kernel docs say that usb_hcd_unlink_urb_from_ep has to be done with hcd lock held and interrupts disabled locally so the existing implementation does need changing.

To be clear, what was the call stack you were seeing with typical OOPSes? I had seen a few with kill_urbs_in_qh_list popping up but the problem was quite hard to reproduce (disconnects with an active device had a smallish chance to do it).

wmdb · 2013-06-18T23:21:17Z

I didn't have the ability to capture or read the full call stack in my setup but the primary message was always the same, "Unable to handle kernel NULL pointer dereference", in usb_hcd_giveback_urb. As you said it could sometimes take a while to replicate but our setup was such that we could spray data at it and just leave it replugging at random intervals, originally it would only take 5 or 10 minutes to reproduce.

I have dealt with a very similar problem in the past with another embedded host driver, killing urbs from a different context at unplug was the cause there, but I can't be certain if it was the same here.

I wasn't clear in saying that the second version of the patch also put the access to completed_urb_list inside the critical section, clearly this was needed on the insertion side, not just in the tasklet. The original version only bracketed the tasklet function, though it did seem to make the problem go away.

But I knew that couldn't be the end of the story and I bracketed the insertion side, but that made it suddenly easy to replicate again. Unlinking the endpoint prior to adding to to a queue that other parts of the driver know nothing about seemed like a good idea, as did ensuring that something else somewhere hadn't already unlinked it.

We've gone through thousands of replugs with this now without any issues.

P33M · 2013-06-19T14:03:30Z

Is this extra locking absolutely necessary within _complete? (line 359) As far as I can tell, this is called from within the driver during which either interrupts are disabled (as in from xfercomp or error handlers) or the spinlock is already held.

Edit: I see what you're saying now - is there evidence that _complete is being called without the lock being held?

Other than that, the rest makes sense to me.

wmdb · 2013-06-20T03:09:25Z

I asked myself the same question and I wasn't sure of the context it was
being called in at the time. In a single processor system like the current
one it may be sufficient just to guarantee IRQs are locked in _complete,
but in the general case I think you would need the lock to be held. It
would be more efficient just to use spin_lock in _complete() if IRQs are
already locked. It is also more efficient just to use spin_lock_irq in the
tasklet as you know IRQs are not locked there (I think I added a comment to
that effect, but there isn't a macro in this driver for that and I kept it
the way it was for consistency and because it is what we had the most
testing time with).

Or something like that, I don't have the code in front of me right now :-)

Mike

On 19 June 2013 07:03, P33M notifications@github.com wrote:

Is this extra locking absolutely necessary within _complete? (line 359) As
far as I can tell, this is called from within the driver during which
either interrupts are disabled (as in from xfercomp or error handlers) or
the spinlock is already held.

Other than that, the rest makes sense to me.

—
Reply to this email directly or view it on GitHubhttps://github.com//pull/312#issuecomment-19685781
.

Mike Bradley
Inca Networks Inc.
+1 604 653 3360 | mike.bradley@incanetworks.com
Skype: wmd_bradley

P33M · 2013-06-20T15:58:29Z

The HCD lock is claimed on entry to the IRQ handler, so all usual calls to _complete are with the lock held. The only potentially concurrency-unsafe use of _complete is in a HCD disconnect (i.e. shutdown) with multiple CPUs present, because for some reason the disconnect interrupt handler will release the lock before doing kill_all_urbs which then _completes them all with an error.

Before the tasklet implementation, the _complete function unlocked before usb_hcd_giveback_urb in case a completion callback resubmitted itself (common for interrupt or isochronous) avoiding a deadlock. With this split off to the tasklet, there's no need as we are in a softirq context with no locks held anyway.

For a minimal fix, all I can see that needs doing is:

Correct flow of usb_hcd_check_unlink_urb / usb_hcd_unlink_urb_from_ep in _complete
removal of usb_hcd_unlink_urb_from_ep from the tasklet and back into _complete where the lock is already held.

Would it be possible for you to test without the extra locking? I would prefer it if the completion path does not have any extra complexity added as this is a significant factor in performance and reliability of isoc transfers.

wmdb · 2013-06-21T06:04:47Z

I agree with this, assuming we have no concerns at this stage about the multi-processor case. The biggest impact comes from unlinking from the endpoint before kicking it over to the tasklet (the original idea of keeping that in the tasklet but with the lock held didn't go far enough).

Do you have a view on the applicability of usb_hcd_check_unlink_urb here? One fear I had with that was that an urb that was for some reason never linked to an endpoint might cause this call to fail. I don't know if such a situation could occur in the completion handler, or if _complete is responsible for giving the urb back if it did.

Since _complete is always run from the IRQ handler, and as far as we know the lock is already held there, I am running stress tests without the DWC_SPINLOCK_IRQSAVE/DWC_SPINUNLOCK_IRQRESTORE in _complete. So far there have been no problems.

I have left the other changes as they are for now, including the unlink check.

P33M · 2013-06-21T10:57:05Z

In theory check_unlink is called during a dequeue to check that we aren't about to unlink a URB that's been completed (and now sitting in the tasklet queue) or previously dequeued. I suspect that it shouldn't be necessary to check for an unlink in _complete but the original code must have had it in for a reason.

… handler usb_hcd_unlink_urb_from_ep must be called with the HCD lock held. Calling it asynchronously in the tasklet was not safe (regression in c4564d4). This change unlinks it from the endpoint prior to queueing it for handling in the tasklet, and also adds a check to ensure the urb is OK to be unlinked before doing so. NULL pointer dereference kernel oopses had been observed in usb_hcd_giveback_urb when a USB device was unplugged/replugged during data transfer. This effect was reproduced using automated USB port power control, hundreds of replug events were performed during active transfers to confirm that the problem was eliminated.

wmdb · 2013-06-21T15:57:57Z

The commit has been updated with those changes, it tested OK overnight (probably 10,000 replugs in that time).

P33M · 2013-06-22T19:12:06Z

Looks good to me.

I had a trawl through the kernel mailing lists and it seems someone has RFC'd a patch for putting a completion tasklet into EHCI drivers - with some similarly game-changing preemption latency speedups on low-speed systems. There's a few things mentioned that make me want to poke other bits of the driver a bit further, but as far as this bugfix is concerned I've got nothing further to add.

dwc_otg: Call usb_hcd_unlink_urb_from_ep with lock held in completion ta...

popcornmix · 2013-06-23T11:02:20Z

Thanks @wmdb and @P33M.

qtfp · 2013-06-27T16:28:40Z

Hey~
I have problem compiling. I compiled with the page for my touchscreen with the page:

http://engineering-diy.blogspot.tw/2013/01/adding-7inch-display-with-touchscreen.html

Then, I met problem with the ftdi ft232 usb-serial converter hang. Then, I rpi-updated to fix the ftdi ft232 hang problem.
Here I cant find a way to fix these two problem the same time.
How can I compile a version with the newest kernel version?
Thx

…_nlattr Otherwise, we hit a NULL pointer deference since handlers always assume default timeout policy is passed. netlink: 24 bytes leftover after parsing attributes in process `syz-executor2'. kasan: CONFIG_KASAN_INLINE enabled kasan: GPF could be caused by NULL-ptr deref or user memory access general protection fault: 0000 [#1] PREEMPT SMP KASAN CPU: 0 PID: 9575 Comm: syz-executor1 Not tainted 4.19.0+ #312 Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 01/01/2011 RIP: 0010:icmp_timeout_obj_to_nlattr+0x77/0x170 net/netfilter/nf_conntrack_proto_icmp.c:297 Fixes: c779e84 ("netfilter: conntrack: remove get_timeout() indirection") Reported-by: Eric Dumazet <eric.dumazet@gmail.com> Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org>

…_nlattr commit 8866df9 upstream Otherwise, we hit a NULL pointer deference since handlers always assume default timeout policy is passed. netlink: 24 bytes leftover after parsing attributes in process `syz-executor2'. kasan: CONFIG_KASAN_INLINE enabled kasan: GPF could be caused by NULL-ptr deref or user memory access general protection fault: 0000 [#1] PREEMPT SMP KASAN CPU: 0 PID: 9575 Comm: syz-executor1 Not tainted 4.19.0+ #312 Hardware name: Google Google Compute Engine/Google Compute Engine, BIOS Google 01/01/2011 RIP: 0010:icmp_timeout_obj_to_nlattr+0x77/0x170 net/netfilter/nf_conntrack_proto_icmp.c:297 Fixes: c779e84 ("netfilter: conntrack: remove get_timeout() indirection") Reported-by: Eric Dumazet <eric.dumazet@gmail.com> Signed-off-by: Pablo Neira Ayuso <pablo@netfilter.org> Signed-off-by: Zubin Mithra <zsm@chromium.org> Signed-off-by: Sasha Levin (Microsoft) <sashal@kernel.org>

…format [ Upstream commit 06d213d ] For incoming SCO connection with transparent coding format, alt setting of CVSD is getting applied instead of Transparent. Before fix: < HCI Command: Accept Synchron.. (0x01|0x0029) plen 21 #2196 [hci0] 321.342548 Address: 1C:CC:D6:E2:EA:80 (Xiaomi Communications Co Ltd) Transmit bandwidth: 8000 Receive bandwidth: 8000 Max latency: 13 Setting: 0x0003 Input Coding: Linear Input Data Format: 1's complement Input Sample Size: 8-bit # of bits padding at MSB: 0 Air Coding Format: Transparent Data Retransmission effort: Optimize for link quality (0x02) Packet type: 0x003f HV1 may be used HV2 may be used HV3 may be used EV3 may be used EV4 may be used EV5 may be used > HCI Event: Command Status (0x0f) plen 4 #2197 [hci0] 321.343585 Accept Synchronous Connection Request (0x01|0x0029) ncmd 1 Status: Success (0x00) > HCI Event: Synchronous Connect Comp.. (0x2c) plen 17 #2198 [hci0] 321.351666 Status: Success (0x00) Handle: 257 Address: 1C:CC:D6:E2:EA:80 (Xiaomi Communications Co Ltd) Link type: eSCO (0x02) Transmission interval: 0x0c Retransmission window: 0x04 RX packet length: 60 TX packet length: 60 Air mode: Transparent (0x03) ........ > SCO Data RX: Handle 257 flags 0x00 dlen 48 #2336 [hci0] 321.383655 < SCO Data TX: Handle 257 flags 0x00 dlen 60 #2337 [hci0] 321.389558 > SCO Data RX: Handle 257 flags 0x00 dlen 48 #2338 [hci0] 321.393615 > SCO Data RX: Handle 257 flags 0x00 dlen 48 #2339 [hci0] 321.393618 > SCO Data RX: Handle 257 flags 0x00 dlen 48 #2340 [hci0] 321.393618 < SCO Data TX: Handle 257 flags 0x00 dlen 60 #2341 [hci0] 321.397070 > SCO Data RX: Handle 257 flags 0x00 dlen 48 #2342 [hci0] 321.403622 > SCO Data RX: Handle 257 flags 0x00 dlen 48 #2343 [hci0] 321.403625 > SCO Data RX: Handle 257 flags 0x00 dlen 48 #2344 [hci0] 321.403625 > SCO Data RX: Handle 257 flags 0x00 dlen 48 #2345 [hci0] 321.403625 < SCO Data TX: Handle 257 flags 0x00 dlen 60 #2346 [hci0] 321.404569 < SCO Data TX: Handle 257 flags 0x00 dlen 60 #2347 [hci0] 321.412091 > SCO Data RX: Handle 257 flags 0x00 dlen 48 #2348 [hci0] 321.413626 > SCO Data RX: Handle 257 flags 0x00 dlen 48 #2349 [hci0] 321.413630 > SCO Data RX: Handle 257 flags 0x00 dlen 48 #2350 [hci0] 321.413630 < SCO Data TX: Handle 257 flags 0x00 dlen 60 #2351 [hci0] 321.419674 After fix: < HCI Command: Accept Synchronou.. (0x01|0x0029) plen 21 #309 [hci0] 49.439693 Address: 1C:CC:D6:E2:EA:80 (Xiaomi Communications Co Ltd) Transmit bandwidth: 8000 Receive bandwidth: 8000 Max latency: 13 Setting: 0x0003 Input Coding: Linear Input Data Format: 1's complement Input Sample Size: 8-bit # of bits padding at MSB: 0 Air Coding Format: Transparent Data Retransmission effort: Optimize for link quality (0x02) Packet type: 0x003f HV1 may be used HV2 may be used HV3 may be used EV3 may be used EV4 may be used EV5 may be used > HCI Event: Command Status (0x0f) plen 4 #310 [hci0] 49.440308 Accept Synchronous Connection Request (0x01|0x0029) ncmd 1 Status: Success (0x00) > HCI Event: Synchronous Connect Complete (0x2c) plen 17 #311 [hci0] 49.449308 Status: Success (0x00) Handle: 257 Address: 1C:CC:D6:E2:EA:80 (Xiaomi Communications Co Ltd) Link type: eSCO (0x02) Transmission interval: 0x0c Retransmission window: 0x04 RX packet length: 60 TX packet length: 60 Air mode: Transparent (0x03) < SCO Data TX: Handle 257 flags 0x00 dlen 60 #312 [hci0] 49.450421 < SCO Data TX: Handle 257 flags 0x00 dlen 60 #313 [hci0] 49.457927 > HCI Event: Max Slots Change (0x1b) plen 3 #314 [hci0] 49.460345 Handle: 256 Max slots: 5 < SCO Data TX: Handle 257 flags 0x00 dlen 60 #315 [hci0] 49.465453 > SCO Data RX: Handle 257 flags 0x00 dlen 60 #316 [hci0] 49.470502 > SCO Data RX: Handle 257 flags 0x00 dlen 60 #317 [hci0] 49.470519 < SCO Data TX: Handle 257 flags 0x00 dlen 60 #318 [hci0] 49.472996 > SCO Data RX: Handle 257 flags 0x00 dlen 60 #319 [hci0] 49.480412 < SCO Data TX: Handle 257 flags 0x00 dlen 60 #320 [hci0] 49.480492 < SCO Data TX: Handle 257 flags 0x00 dlen 60 #321 [hci0] 49.487989 > SCO Data RX: Handle 257 flags 0x00 dlen 60 #322 [hci0] 49.490303 < SCO Data TX: Handle 257 flags 0x00 dlen 60 #323 [hci0] 49.495496 > SCO Data RX: Handle 257 flags 0x00 dlen 60 #324 [hci0] 49.500304 > SCO Data RX: Handle 257 flags 0x00 dlen 60 #325 [hci0] 49.500311 Signed-off-by: Kiran K <kiran.k@intel.com> Signed-off-by: Lokendra Singh <lokendra.singh@intel.com> Signed-off-by: Marcel Holtmann <marcel@holtmann.org> Signed-off-by: Sasha Levin <sashal@kernel.org>

In case when is64 == 1 in emit(A64_REV32(is64, dst, dst), ctx) the generated insn reverses byte order for both high and low 32-bit words, resuling in an incorrect swap as indicated by the jit test: [ 9757.262607] test_bpf: #312 BSWAP 16: 0x0123456789abcdef -> 0xefcd jited:1 8 PASS [ 9757.264435] test_bpf: #313 BSWAP 32: 0x0123456789abcdef -> 0xefcdab89 jited:1 ret 1460850314 != -271733879 (0x5712ce8a != 0xefcdab89)FAIL (1 times) [ 9757.266260] test_bpf: #314 BSWAP 64: 0x0123456789abcdef -> 0x67452301 jited:1 8 PASS [ 9757.268000] test_bpf: #315 BSWAP 64: 0x0123456789abcdef >> 32 -> 0xefcdab89 jited:1 8 PASS [ 9757.269686] test_bpf: #316 BSWAP 16: 0xfedcba9876543210 -> 0x1032 jited:1 8 PASS [ 9757.271380] test_bpf: #317 BSWAP 32: 0xfedcba9876543210 -> 0x10325476 jited:1 ret -1460850316 != 271733878 (0xa8ed3174 != 0x10325476)FAIL (1 times) [ 9757.273022] test_bpf: #318 BSWAP 64: 0xfedcba9876543210 -> 0x98badcfe jited:1 7 PASS [ 9757.274721] test_bpf: #319 BSWAP 64: 0xfedcba9876543210 >> 32 -> 0x10325476 jited:1 9 PASS Fix this by forcing 32bit variant of rev32. Fixes: 1104247 ("bpf, arm64: Support unconditional bswap") Signed-off-by: Artem Savkov <asavkov@redhat.com> Tested-by: Puranjay Mohan <puranjay12@gmail.com> Acked-by: Puranjay Mohan <puranjay12@gmail.com> Acked-by: Xu Kuohai <xukuohai@huawei.com> Message-ID: <20240321081809.158803-1-asavkov@redhat.com> Signed-off-by: Alexei Starovoitov <ast@kernel.org>

[ Upstream commit a51cd6b ] In case when is64 == 1 in emit(A64_REV32(is64, dst, dst), ctx) the generated insn reverses byte order for both high and low 32-bit words, resuling in an incorrect swap as indicated by the jit test: [ 9757.262607] test_bpf: #312 BSWAP 16: 0x0123456789abcdef -> 0xefcd jited:1 8 PASS [ 9757.264435] test_bpf: #313 BSWAP 32: 0x0123456789abcdef -> 0xefcdab89 jited:1 ret 1460850314 != -271733879 (0x5712ce8a != 0xefcdab89)FAIL (1 times) [ 9757.266260] test_bpf: #314 BSWAP 64: 0x0123456789abcdef -> 0x67452301 jited:1 8 PASS [ 9757.268000] test_bpf: #315 BSWAP 64: 0x0123456789abcdef >> 32 -> 0xefcdab89 jited:1 8 PASS [ 9757.269686] test_bpf: #316 BSWAP 16: 0xfedcba9876543210 -> 0x1032 jited:1 8 PASS [ 9757.271380] test_bpf: #317 BSWAP 32: 0xfedcba9876543210 -> 0x10325476 jited:1 ret -1460850316 != 271733878 (0xa8ed3174 != 0x10325476)FAIL (1 times) [ 9757.273022] test_bpf: #318 BSWAP 64: 0xfedcba9876543210 -> 0x98badcfe jited:1 7 PASS [ 9757.274721] test_bpf: #319 BSWAP 64: 0xfedcba9876543210 >> 32 -> 0x10325476 jited:1 9 PASS Fix this by forcing 32bit variant of rev32. Fixes: 1104247 ("bpf, arm64: Support unconditional bswap") Signed-off-by: Artem Savkov <asavkov@redhat.com> Tested-by: Puranjay Mohan <puranjay12@gmail.com> Acked-by: Puranjay Mohan <puranjay12@gmail.com> Acked-by: Xu Kuohai <xukuohai@huawei.com> Message-ID: <20240321081809.158803-1-asavkov@redhat.com> Signed-off-by: Alexei Starovoitov <ast@kernel.org> Signed-off-by: Sasha Levin <sashal@kernel.org>

popcornmix added a commit that referenced this pull request Jun 23, 2013

Merge pull request #312 from wmdb/rpi-3.6.y

3f3284d

dwc_otg: Call usb_hcd_unlink_urb_from_ep with lock held in completion ta...

popcornmix merged commit 3f3284d into raspberrypi:rpi-3.6.y Jun 23, 2013

dwc_otg: Call usb_hcd_unlink_urb_from_ep with lock held in completion ta... #312

dwc_otg: Call usb_hcd_unlink_urb_from_ep with lock held in completion ta... #312

Uh oh!

Conversation

wmdb commented Jun 17, 2013

Uh oh!

popcornmix commented Jun 17, 2013

Uh oh!

wmdb commented Jun 17, 2013

Uh oh!

wmdb commented Jun 18, 2013

Uh oh!

popcornmix commented Jun 18, 2013

Uh oh!

ghollingworth commented Jun 18, 2013

Uh oh!

wmdb commented Jun 18, 2013

Uh oh!

popcornmix commented Jun 18, 2013

Uh oh!

wmdb commented Jun 18, 2013

Uh oh!

P33M commented Jun 18, 2013

Uh oh!

wmdb commented Jun 18, 2013

Uh oh!

P33M commented Jun 19, 2013

Uh oh!

wmdb commented Jun 20, 2013

Uh oh!

P33M commented Jun 20, 2013

Uh oh!

wmdb commented Jun 21, 2013

Uh oh!

P33M commented Jun 21, 2013

Uh oh!

wmdb commented Jun 21, 2013

Uh oh!

P33M commented Jun 22, 2013

Uh oh!

popcornmix commented Jun 23, 2013

Uh oh!

qtfp commented Jun 27, 2013

Uh oh!

Uh oh!