Bug #1371233 “USB 3.0 connection is unreliable + xHCI xhci_drop_...” : Bugs : linux package : Ubuntu

Revision history for this message

Karl-Philipp Richter (krichter722) wrote on 2014-09-18:

#1

AlsaInfo.txt Edit (30.0 KiB, text/plain; charset="utf-8")
BootDmesg.txt Edit (133.8 KiB, text/plain; charset="utf-8")
CRDA.txt Edit (257 bytes, text/plain; charset="utf-8")
CurrentDmesg.txt Edit (10.1 KiB, text/plain; charset="utf-8")
Dependencies.txt Edit (2.9 KiB, text/plain; charset="utf-8")
IwConfig.txt Edit (613 bytes, text/plain; charset="utf-8")
Lspci.txt Edit (9.4 KiB, text/plain; charset="utf-8")
Lsusb.txt Edit (1.2 KiB, text/plain; charset="utf-8")
ProcCpuinfo.txt Edit (7.5 KiB, text/plain; charset="utf-8")
ProcInterrupts.txt Edit (3.3 KiB, text/plain; charset="utf-8")
ProcModules.txt Edit (7.1 KiB, text/plain; charset="utf-8")
PulseList.txt Edit (10.0 KiB, text/plain; charset="utf-8")
RfKill.txt Edit (246 bytes, text/plain; charset="utf-8")
UdevDb.txt Edit (262.6 KiB, text/plain; charset="utf-8")
UdevLog.txt Edit (494.6 KiB, text/plain; charset="utf-8")
WifiSyslog.txt Edit (741.3 KiB, text/plain; charset="utf-8")

Revision history for this message

Brad Figg (brad-figg) wrote on 2014-09-18: Status changed to Confirmed

#2

This change was made by a bot.

Changed in linux (Ubuntu):
status:	New → Confirmed

Karl-Philipp Richter (krichter722) on 2014-09-18

description:

updated

Revision history for this message

Joseph Salisbury (jsalisbury) wrote on 2014-09-18:

#3

Would it be possible for you to test the latest upstream kernel? Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Please test the latest v3.17 kernel[0].

If this bug is fixed in the mainline kernel, please add the following tag 'kernel-fixed-upstream'.

If the mainline kernel does not fix this bug, please add the tag: 'kernel-bug-exists-upstream'.

If you are unable to test the mainline kernel, for example it will not boot, please add the tag: 'kernel-unable-to-test-upstream'.
Once testing of the upstream kernel is complete, please mark this bug as "Confirmed".

Thanks in advance.

[0] http://kernel.ubuntu.com/~kernel-ppa/mainline/v3.17-rc5-utopic/

Changed in linux (Ubuntu):
importance:	Undecided → Medium
tags:	added: kernel-da-key

Joseph Salisbury (jsalisbury) on 2014-09-18

Changed in linux (Ubuntu):
importance:	Medium → High

Revision history for this message

Karl-Philipp Richter (krichter722) wrote on 2014-09-18:

#4

Tested in 3.17-rc5 and experienced the issue. Due to the high number of connection failures one partition used for reproduction has vanished, but this shouldn't matter because data rescue in gparted has been a main reproduction scenario. I had to remove apt packages `multipath-*` in order to make all devices on USB ports being recognized.

penalvch (penalvch) on 2014-09-19

tags:	added: latest-bios-1.21
tags:	added: kernel-bug-exists-upstream-3.15-rc7 utopic

Revision history for this message

Joseph Salisbury (jsalisbury) wrote on 2014-09-19:

#5

This issue appears to be an upstream bug, since you tested the latest upstream kernel. Would it be possible for you to open an upstream bug report[0]? That will allow the upstream Developers to examine the issue, and may provide a quicker resolution to the bug.

Please follow the instructions on the wiki page[0]. The first step is to email the appropriate mailing list. If no response is received, then a bug may be opened on bugzilla.kernel.org.

Once this bug is reported upstream, please add the tag: 'kernel-bug-reported-upstream'.

[0] https://wiki.ubuntu.com/Bugs/Upstream/kernel

Revision history for this message

Joseph Salisbury (jsalisbury) wrote on 2014-09-19:

#6

Also, was there a prior kernel version that did not exhibit this bug? If there is, we can perform a bisect to identify the commit that introduced this.

Revision history for this message

Karl-Philipp Richter (krichter722) wrote on 2014-09-24:

#7

After obtaining an Orico A3H7 USB 3.0 hub with (sufficient) power supply the reproducability of the bug got limited to the broken HDD (reproducability with it is 100 % (disk is recognized by gparted and doesn't have a /dev/sdxY file)). Failure of of ethernet adapter no longer occurs which makes me think the original issue was related to insufficient power on USB connections and that the error `xHCI xhci_drop_endpoint called with disabled ep` is related rather to a consequence of the power failure rather than being the error of the power failure.

In my point of view it'd make sense to get the HDD working again (there's probably an issue with the partition table) and testing whether it works in 3.13.0-36 and then other versions (including mainline).

Sorry for the delay of my response.

Revision history for this message

Karl-Philipp Richter (krichter722) wrote on 2014-09-24:

#8

At the same time of upgrade from 13.10 to 14.04 I changed to btrfs which causes incredible trouble (it can't be said often enough that it's irresponsible to offer it as root filesystem without(!) a warning), so there're so many possible reasons for the bug (most of them are bugs for themselves) that for a non-developer it's impossible to provide accurate reports.

Revision history for this message

Karl-Philipp Richter (krichter722) wrote on 2014-09-24:

#9

Found a working kernel: on Ubuntu 14.04 live system with 3.13.0-24-generic I can run ddrescue which found > 50 errors on the HDD so far, but the error doesn't occur and ddrescue proceed - slowly, but at least it does! Here's the thing: running the 3.13.0-24-generic kernel on Ubuntu installation doesn't work, so that I assume it's not just a kernel issue or not a kernel issue at all, but related to an upgraded apt package. How to proceed? How is bisecting done?

Revision history for this message

legolas558 (legolas558) wrote on 2014-09-27:

#10

Running kernel 3.13.0-35-generic here.
I was affected by the same issue. Using "echo -1 >/sys/module/usbcore/parameters/autosuspend" seemed to help a bit, although the reset entries "xHCI xhci_drop_endpoint called with disabled ep *******" were still there and system was blocking because of these continuous drops, leading to an unreliable (and incredibly slow) writing to the disk.

The best way to trigger the issue was for example to run an rsync reading from one partition on the external disk and writing to another partition on the same disk.

Plugging the disk with an USB 2.0 cable would show no problems at all with same tests.

After plugging the external disk to a an USB port on the back, I can confirm it works perfectly (and still in high speed mode, "new SuperSpeed USB device number 2 using xhci_hcd").

Upon inspection, I found that the USB 3.0 cable connecting the front USB ports to the motherboard was a bit loose at connecting the USB 3.0 part of it. USB 2.0 traffic would always work perfectly fine but problems would arise when estabilishing 3.0 links.

So, long story short, please check other USB 3.0 ports and that those you are using are well connected. Although I was suspecting the PSU of the external disk (12V 1.5A) and a bug in the kernel (as I read here and elsewhere), in the end it was just a cable problem.

Hope it helps.

Revision history for this message

Karl-Philipp Richter (krichter722) wrote on 2014-09-28:

#11

photo of kernel panic stack trace; ddrescue log file Edit (1.1 MiB, application/octet-stream)

Thanks @legolas558, I checked all cables and exchanged most of my USB equipment (hubs and cables). I tested in combinations which would reveal a faulty USB part. Inspired by your comment I opened the machine and checked the internal cable connections as well. They're all fine.

I `ddrescue` test case causes a kernel panic in 3.17-rc6, now, photo attached[1]. This occurs ~90 % of the time. I attach the log file of `ddrescue` 1.7.0 as well[2], maybe it can serve to produce a test file for you. It definitely works in the non-persistent live system based on 3.13.0-24-generic.

---
[1] I'm currently stuck with `linux-crashtools`, please help me http://askubuntu.com/questions/529452/how-to-cause-a-test-crash-with-kdump for a text output of the panic stack.
[2] Although the wiki states no compressed attachements, there's no way of figuring out how to attach a second file...

Revision history for this message

Karl-Philipp Richter (krichter722) wrote on 2014-09-30:

#12

Panic confirmed on 3.17-rc7.

Revision history for this message

Karl-Philipp Richter (krichter722) wrote on 2014-11-15:

#13

Download full text (24.2 KiB)

In the meantime I tested with 3.2.64, 3.4.104, 3.10.60 and 3.12.32 and had no problem, i.e. GNU `ddrescue ` 1.17 processes the device without kernel panic (1.5 TB with > 7000 errors recognized and copied on a damaged device).

In 3.14.24 I don't get a kernel panic, but `ddrescue` get stuck at reading a damaged block while `dmesg` shows

    [ 2074.174135] sdh: sdh1 sdh9
    [ 2462.587818] usb 4-1.3.1.3: reset SuperSpeed USB device number 9 using xhci_hcd
    [ 2462.603164] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff88041d586d80
    [ 2462.603168] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff88041d586dc0
    [ 2502.601314] usb 4-1.3.1.3: reset SuperSpeed USB device number 9 using xhci_hcd
    [ 2502.616594] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff88041d586d80
    [ 2502.616602] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff88041d586dc0
    [ 2647.252571] INFO: task usb-storage:604 blocked for more than 120 seconds.
    [ 2647.252580] Tainted: PF W O 3.14.24-031424-generic #201411141736
    [ 2647.252591] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
    [ 2647.252593] usb-storage D ffffffff81811ae0 0 604 2 0x00000000
    [ 2647.252596] ffff88041cb55af8 0000000000000046 ffff88041cb55af8 ffff88041cb55fd8
    [ 2647.252599] 0000000000014540 0000000000014540 ffffffff81c144a0 ffff88041cb0a7c0
    [ 2647.252601] 0000000100000000 ffff880422b9d808 7fffffffffffffff 7fffffffffffffff
    [ 2647.252603] Call Trace:
    [ 2647.252609] [<ffffffff8177a739>] schedule+0x29/0x70
    [ 2647.252612] [<ffffffff817799e5>] schedule_timeout+0x1e5/0x250
    [ 2647.252616] [<ffffffff8156bd58>] ? usb_hcd_submit_urb+0x88/0x1b0
    [ 2647.252618] [<ffffffff8177b9d7>] wait_for_completion+0xa7/0x160
    [ 2647.252620] [<ffffffff8156cece>] ? usb_alloc_urb+0x1e/0x50
    [ 2647.252624] [<ffffffff810a4da0>] ? try_to_wake_up+0x210/0x210
    [ 2647.252626] [<ffffffff8156f14a>] usb_sg_wait+0x13a/0x1f0
    [ 2647.252646] [<ffffffffa019f531>] usb_stor_bulk_transfer_sglist.part.5+0x51/0xc0 [usb_storage]
    [ 2647.252651] [<ffffffffa019f637>] usb_stor_bulk_transfer_sglist+0x97/0xa0 [usb_storage]
    [ 2647.252655] [<ffffffffa019f66e>] usb_stor_bulk_srb+0x2e/0x50 [usb_storage]
    [ 2647.252659] [<ffffffffa019f7d7>] usb_stor_Bulk_transport+0x147/0x3f0 [usb_storage]
    [ 2647.252662] [<ffffffff817799e5>] ? schedule_timeout+0x1e5/0x250
    [ 2647.252666] [<ffffffffa01a006e>] usb_stor_invoke_transport+0x3e/0x570 [usb_storage]
    [ 2647.252668] [<ffffffff8177b1bd>] ? wait_for_completion_interruptible+0xcd/0x1c0
    [ 2647.252672] [<ffffffffa019ee5e>] usb_stor_transparent_scsi_command+0xe/0x10 [usb_storage]
    [ 2647.252676] [<ffffffffa01a172a>] usb_stor_control_thread+0x1ba/0x310 [usb_storage]
    [ 2647.252681] [<ffffffffa01a1570>] ? fill_inquiry_response+0x20/0x20 [usb_storage]
    [ 2647.252683] [<ffffffff81093079>] kthread+0xc9/0xe0
    [ 2647.252685] [<ffffffff81092fb0>] ? flush_kthread_worker+0xb0/0xb0
    [ 2647.252687] [<ffffffff817875bc>] ret_from_fork+0x7c/0xb0
    [ 2647....

In the meantime I tested with 3.2.64, 3.4.104, 3.10.60 and 3.12.32 and had no problem, i.e. GNU `ddrescue ` 1.17 processes the device without kernel panic (1.5 TB with > 7000 errors recognized and copied on a damaged device).

In 3.14.24 I don't get a kernel panic, but `ddrescue` get stuck at reading a damaged block while `dmesg` shows

[ 2074.174135]  sdh: sdh1 sdh9
    [ 2462.587818] usb 4-1.3.1.3: reset SuperSpeed USB device number 9 using xhci_hcd
    [ 2462.603164] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff88041d586d80
    [ 2462.603168] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff88041d586dc0
    [ 2502.601314] usb 4-1.3.1.3: reset SuperSpeed USB device number 9 using xhci_hcd
    [ 2502.616594] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff88041d586d80
    [ 2502.616602] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff88041d586dc0
    [ 2647.252571] INFO: task usb-storage:604 blocked for more than 120 seconds.
    [ 2647.252580]       Tainted: PF       W  O 3.14.24-031424-generic #201411141736
    [ 2647.252591] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
    [ 2647.252593] usb-storage     D ffffffff81811ae0     0   604      2 0x00000000
    [ 2647.252596]  ffff88041cb55af8 0000000000000046 ffff88041cb55af8 ffff88041cb55fd8
    [ 2647.252599]  0000000000014540 0000000000014540 ffffffff81c144a0 ffff88041cb0a7c0
    [ 2647.252601]  0000000100000000 ffff880422b9d808 7fffffffffffffff 7fffffffffffffff
    [ 2647.252603] Call Trace:
    [ 2647.252609]  [<ffffffff8177a739>] schedule+0x29/0x70
    [ 2647.252612]  [<ffffffff817799e5>] schedule_timeout+0x1e5/0x250
    [ 2647.252616]  [<ffffffff8156bd58>] ? usb_hcd_submit_urb+0x88/0x1b0
    [ 2647.252618]  [<ffffffff8177b9d7>] wait_for_completion+0xa7/0x160
    [ 2647.252620]  [<ffffffff8156cece>] ? usb_alloc_urb+0x1e/0x50
    [ 2647.252624]  [<ffffffff810a4da0>] ? try_to_wake_up+0x210/0x210
    [ 2647.252626]  [<ffffffff8156f14a>] usb_sg_wait+0x13a/0x1f0
    [ 2647.252646]  [<ffffffffa019f531>] usb_stor_bulk_transfer_sglist.part.5+0x51/0xc0 [usb_storage]
    [ 2647.252651]  [<ffffffffa019f637>] usb_stor_bulk_transfer_sglist+0x97/0xa0 [usb_storage]
    [ 2647.252655]  [<ffffffffa019f66e>] usb_stor_bulk_srb+0x2e/0x50 [usb_storage]
    [ 2647.252659]  [<ffffffffa019f7d7>] usb_stor_Bulk_transport+0x147/0x3f0 [usb_storage]
    [ 2647.252662]  [<ffffffff817799e5>] ? schedule_timeout+0x1e5/0x250
    [ 2647.252666]  [<ffffffffa01a006e>] usb_stor_invoke_transport+0x3e/0x570 [usb_storage]
    [ 2647.252668]  [<ffffffff8177b1bd>] ? wait_for_completion_interruptible+0xcd/0x1c0
    [ 2647.252672]  [<ffffffffa019ee5e>] usb_stor_transparent_scsi_command+0xe/0x10 [usb_storage]
    [ 2647.252676]  [<ffffffffa01a172a>] usb_stor_control_thread+0x1ba/0x310 [usb_storage]
    [ 2647.252681]  [<ffffffffa01a1570>] ? fill_inquiry_response+0x20/0x20 [usb_storage]
    [ 2647.252683]  [<ffffffff81093079>] kthread+0xc9/0xe0
    [ 2647.252685]  [<ffffffff81092fb0>] ? flush_kthread_worker+0xb0/0xb0
    [ 2647.252687]  [<ffffffff817875bc>] ret_from_fork+0x7c/0xb0
    [ 2647.252689]  [<ffffffff81092fb0>] ? flush_kthread_worker+0xb0/0xb0
    [ 2767.388999] INFO: task scsi_eh_10:602 blocked for more than 120 seconds.
    [ 2767.389003]       Tainted: PF       W  O 3.14.24-031424-generic #201411141736
    [ 2767.389004] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
    [ 2767.389005] scsi_eh_10      D ffffffff81811ae0     0   602      2 0x00000000
    [ 2767.389008]  ffff88041cb51c78 0000000000000046 0000000000000000 ffff88041cb51fd8
    [ 2767.389011]  0000000000014540 0000000000014540 ffff880428e693e0 ffff88041cb093e0
    [ 2767.389013]  000000000000000e ffff880422b9d6f0 ffff880422b9d6f4 00000000ffffffff
    [ 2767.389015] Call Trace:
    [ 2767.389020]  [<ffffffff8177a739>] schedule+0x29/0x70
    [ 2767.389022]  [<ffffffff8177aa5e>] schedule_preempt_disabled+0xe/0x10
    [ 2767.389024]  [<ffffffff8177c894>] __mutex_lock_slowpath+0x114/0x1b0
    [ 2767.389026]  [<ffffffff8177c953>] mutex_lock+0x23/0x37
    [ 2767.389043]  [<ffffffffa019e9bb>] device_reset+0x2b/0x60 [usb_storage]
    [ 2767.389046]  [<ffffffff815077ee>] scsi_try_bus_device_reset+0x2e/0x60
    [ 2767.389047]  [<ffffffff8150a37f>] scsi_eh_bus_device_reset+0xdf/0x270
    [ 2767.389049]  [<ffffffff8150a663>] ? scsi_eh_stu+0x153/0x280
    [ 2767.389051]  [<ffffffff8150a7de>] scsi_eh_ready_devs+0x4e/0xa0
    [ 2767.389053]  [<ffffffff8150b81d>] scsi_unjam_host+0x10d/0x1f0
    [ 2767.389055]  [<ffffffff8150ba65>] scsi_error_handler+0x165/0x1d0
    [ 2767.389057]  [<ffffffff8150b900>] ? scsi_unjam_host+0x1f0/0x1f0
    [ 2767.389060]  [<ffffffff81093079>] kthread+0xc9/0xe0
    [ 2767.389061]  [<ffffffff81092fb0>] ? flush_kthread_worker+0xb0/0xb0
    [ 2767.389064]  [<ffffffff817875bc>] ret_from_fork+0x7c/0xb0
    [ 2767.389066]  [<ffffffff81092fb0>] ? flush_kthread_worker+0xb0/0xb0
    [ 2767.389067] INFO: task usb-storage:604 blocked for more than 120 seconds.
    [ 2767.389068]       Tainted: PF       W  O 3.14.24-031424-generic #201411141736
    [ 2767.389069] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
    [ 2767.389070] usb-storage     D ffffffff81811ae0     0   604      2 0x00000000
    [ 2767.389072]  ffff88041cb55af8 0000000000000046 ffff88041cb55af8 ffff88041cb55fd8
    [ 2767.389073]  0000000000014540 0000000000014540 ffffffff81c144a0 ffff88041cb0a7c0
    [ 2767.389075]  0000000100000000 ffff880422b9d808 7fffffffffffffff 7fffffffffffffff
    [ 2767.389077] Call Trace:
    [ 2767.389079]  [<ffffffff8177a739>] schedule+0x29/0x70
    [ 2767.389081]  [<ffffffff817799e5>] schedule_timeout+0x1e5/0x250
    [ 2767.389085]  [<ffffffff8156bd58>] ? usb_hcd_submit_urb+0x88/0x1b0
    [ 2767.389086]  [<ffffffff8177b9d7>] wait_for_completion+0xa7/0x160
    [ 2767.389088]  [<ffffffff8156cece>] ? usb_alloc_urb+0x1e/0x50
    [ 2767.389092]  [<ffffffff810a4da0>] ? try_to_wake_up+0x210/0x210
    [ 2767.389093]  [<ffffffff8156f14a>] usb_sg_wait+0x13a/0x1f0
    [ 2767.389098]  [<ffffffffa019f531>] usb_stor_bulk_transfer_sglist.part.5+0x51/0xc0 [usb_storage]
    [ 2767.389102]  [<ffffffffa019f637>] usb_stor_bulk_transfer_sglist+0x97/0xa0 [usb_storage]
    [ 2767.389106]  [<ffffffffa019f66e>] usb_stor_bulk_srb+0x2e/0x50 [usb_storage]
    [ 2767.389110]  [<ffffffffa019f7d7>] usb_stor_Bulk_transport+0x147/0x3f0 [usb_storage]
    [ 2767.389112]  [<ffffffff817799e5>] ? schedule_timeout+0x1e5/0x250
    [ 2767.389116]  [<ffffffffa01a006e>] usb_stor_invoke_transport+0x3e/0x570 [usb_storage]
    [ 2767.389118]  [<ffffffff8177b1bd>] ? wait_for_completion_interruptible+0xcd/0x1c0
    [ 2767.389122]  [<ffffffffa019ee5e>] usb_stor_transparent_scsi_command+0xe/0x10 [usb_storage]
    [ 2767.389126]  [<ffffffffa01a172a>] usb_stor_control_thread+0x1ba/0x310 [usb_storage]
    [ 2767.389131]  [<ffffffffa01a1570>] ? fill_inquiry_response+0x20/0x20 [usb_storage]
    [ 2767.389132]  [<ffffffff81093079>] kthread+0xc9/0xe0
    [ 2767.389134]  [<ffffffff81092fb0>] ? flush_kthread_worker+0xb0/0xb0
    [ 2767.389136]  [<ffffffff817875bc>] ret_from_fork+0x7c/0xb0
    [ 2767.389137]  [<ffffffff81092fb0>] ? flush_kthread_worker+0xb0/0xb0
    [ 2887.525506] INFO: task scsi_eh_10:602 blocked for more than 120 seconds.
    [ 2887.525515]       Tainted: PF       W  O 3.14.24-031424-generic #201411141736
    [ 2887.525517] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
    [ 2887.525521] scsi_eh_10      D ffffffff81811ae0     0   602      2 0x00000000
    [ 2887.525528]  ffff88041cb51c78 0000000000000046 0000000000000000 ffff88041cb51fd8
    [ 2887.525548]  0000000000014540 0000000000014540 ffff880428e693e0 ffff88041cb093e0
    [ 2887.525550]  000000000000000e ffff880422b9d6f0 ffff880422b9d6f4 00000000ffffffff
    [ 2887.525552] Call Trace:
    [ 2887.525558]  [<ffffffff8177a739>] schedule+0x29/0x70
    [ 2887.525560]  [<ffffffff8177aa5e>] schedule_preempt_disabled+0xe/0x10
    [ 2887.525562]  [<ffffffff8177c894>] __mutex_lock_slowpath+0x114/0x1b0
    [ 2887.525564]  [<ffffffff8177c953>] mutex_lock+0x23/0x37
    [ 2887.525583]  [<ffffffffa019e9bb>] device_reset+0x2b/0x60 [usb_storage]
    [ 2887.525586]  [<ffffffff815077ee>] scsi_try_bus_device_reset+0x2e/0x60
    [ 2887.525588]  [<ffffffff8150a37f>] scsi_eh_bus_device_reset+0xdf/0x270
    [ 2887.525590]  [<ffffffff8150a663>] ? scsi_eh_stu+0x153/0x280
    [ 2887.525592]  [<ffffffff8150a7de>] scsi_eh_ready_devs+0x4e/0xa0
    [ 2887.525594]  [<ffffffff8150b81d>] scsi_unjam_host+0x10d/0x1f0
    [ 2887.525596]  [<ffffffff8150ba65>] scsi_error_handler+0x165/0x1d0
    [ 2887.525598]  [<ffffffff8150b900>] ? scsi_unjam_host+0x1f0/0x1f0
    [ 2887.525602]  [<ffffffff81093079>] kthread+0xc9/0xe0
    [ 2887.525604]  [<ffffffff81092fb0>] ? flush_kthread_worker+0xb0/0xb0
    [ 2887.525606]  [<ffffffff817875bc>] ret_from_fork+0x7c/0xb0
    [ 2887.525608]  [<ffffffff81092fb0>] ? flush_kthread_worker+0xb0/0xb0
    [ 2887.525610] INFO: task usb-storage:604 blocked for more than 120 seconds.
    [ 2887.525611]       Tainted: PF       W  O 3.14.24-031424-generic #201411141736
    [ 2887.525612] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
    [ 2887.525613] usb-storage     D ffffffff81811ae0     0   604      2 0x00000000
    [ 2887.525615]  ffff88041cb55af8 0000000000000046 ffff88041cb55af8 ffff88041cb55fd8
    [ 2887.525617]  0000000000014540 0000000000014540 ffffffff81c144a0 ffff88041cb0a7c0
    [ 2887.525619]  0000000100000000 ffff880422b9d808 7fffffffffffffff 7fffffffffffffff
    [ 2887.525621] Call Trace:
    [ 2887.525622]  [<ffffffff8177a739>] schedule+0x29/0x70
    [ 2887.525626]  [<ffffffff817799e5>] schedule_timeout+0x1e5/0x250
    [ 2887.525629]  [<ffffffff8156bd58>] ? usb_hcd_submit_urb+0x88/0x1b0
    [ 2887.525631]  [<ffffffff8177b9d7>] wait_for_completion+0xa7/0x160
    [ 2887.525633]  [<ffffffff8156cece>] ? usb_alloc_urb+0x1e/0x50
    [ 2887.525636]  [<ffffffff810a4da0>] ? try_to_wake_up+0x210/0x210
    [ 2887.525638]  [<ffffffff8156f14a>] usb_sg_wait+0x13a/0x1f0
    [ 2887.525643]  [<ffffffffa019f531>] usb_stor_bulk_transfer_sglist.part.5+0x51/0xc0 [usb_storage]
    [ 2887.525648]  [<ffffffffa019f637>] usb_stor_bulk_transfer_sglist+0x97/0xa0 [usb_storage]
    [ 2887.525652]  [<ffffffffa019f66e>] usb_stor_bulk_srb+0x2e/0x50 [usb_storage]
    [ 2887.525656]  [<ffffffffa019f7d7>] usb_stor_Bulk_transport+0x147/0x3f0 [usb_storage]
    [ 2887.525658]  [<ffffffff817799e5>] ? schedule_timeout+0x1e5/0x250
    [ 2887.525663]  [<ffffffffa01a006e>] usb_stor_invoke_transport+0x3e/0x570 [usb_storage]
    [ 2887.525665]  [<ffffffff8177b1bd>] ? wait_for_completion_interruptible+0xcd/0x1c0
    [ 2887.525669]  [<ffffffffa019ee5e>] usb_stor_transparent_scsi_command+0xe/0x10 [usb_storage]
    [ 2887.525674]  [<ffffffffa01a172a>] usb_stor_control_thread+0x1ba/0x310 [usb_storage]
    [ 2887.525678]  [<ffffffffa01a1570>] ? fill_inquiry_response+0x20/0x20 [usb_storage]
    [ 2887.525680]  [<ffffffff81093079>] kthread+0xc9/0xe0
    [ 2887.525682]  [<ffffffff81092fb0>] ? flush_kthread_worker+0xb0/0xb0
    [ 2887.525684]  [<ffffffff817875bc>] ret_from_fork+0x7c/0xb0
    [ 2887.525685]  [<ffffffff81092fb0>] ? flush_kthread_worker+0xb0/0xb0
    [ 2887.525716] INFO: task pool:14006 blocked for more than 120 seconds.
    [ 2887.525717]       Tainted: PF       W  O 3.14.24-031424-generic #201411141736
    [ 2887.525718] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
    [ 2887.525719] pool            D ffffffff81811ae0     0 14006  10901 0x00000000
    [ 2887.525720]  ffff880169c3d778 0000000000000082 0000000000000000 ffff880169c3dfd8
    [ 2887.525722]  0000000000014540 0000000000014540 ffff880428e6cf80 ffff880054a11dd0
    [ 2887.525724]  ffff880169c3d778 ffff88043f3d4e20 ffff880054a11dd0 ffffffff81162050
    [ 2887.525726] Call Trace:
    [ 2887.525730]  [<ffffffff81162050>] ? __lock_page+0x70/0x70
    [ 2887.525731]  [<ffffffff8177a739>] schedule+0x29/0x70
    [ 2887.525733]  [<ffffffff8177a80f>] io_schedule+0x8f/0xd0
    [ 2887.525735]  [<ffffffff8116205e>] sleep_on_page+0xe/0x20
    [ 2887.525737]  [<ffffffff8177aed2>] __wait_on_bit+0x62/0x90
    [ 2887.525739]  [<ffffffff811621c0>] wait_on_page_bit+0x80/0x90
    [ 2887.525742]  [<ffffffff810b74d0>] ? wake_atomic_t_function+0x40/0x40
    [ 2887.525758]  [<ffffffffa02c008a>] read_extent_buffer_pages+0x2da/0x310 [btrfs]
    [ 2887.525760]  [<ffffffff81163064>] ? add_to_page_cache_lru+0x34/0x50
    [ 2887.525770]  [<ffffffffa02948d0>] ? verify_parent_transid+0x170/0x170 [btrfs]
    [ 2887.525779]  [<ffffffffa0296956>] btree_read_extent_buffer_pages.constprop.126+0xb6/0x120 [btrfs]
    [ 2887.525789]  [<ffffffffa02982b3>] read_tree_block+0x43/0x70 [btrfs]
    [ 2887.525797]  [<ffffffffa0279160>] read_block_for_search.isra.41+0x150/0x1d0 [btrfs]
    [ 2887.525804]  [<ffffffffa027b484>] btrfs_search_slot+0x304/0x830 [btrfs]
    [ 2887.525813]  [<ffffffffa02937af>] btrfs_lookup_inode+0x2f/0xa0 [btrfs]
    [ 2887.525824]  [<ffffffffa02a3a3c>] btrfs_read_locked_inode+0x7c/0x610 [btrfs]
    [ 2887.525827]  [<ffffffff811f0c6b>] ? inode_sb_list_add+0x5b/0x70
    [ 2887.525829]  [<ffffffff811f25d6>] ? iget5_locked+0x1d6/0x200
    [ 2887.525838]  [<ffffffffa02a1380>] ? btrfs_readpage+0x30/0x30 [btrfs]
    [ 2887.525848]  [<ffffffffa02aaa68>] btrfs_iget+0x78/0xf0 [btrfs]
    [ 2887.525858]  [<ffffffffa02ab14b>] btrfs_lookup_dentry+0x24b/0x280 [btrfs]
    [ 2887.525860]  [<ffffffff811ef9da>] ? __d_alloc+0x14a/0x180
    [ 2887.525869]  [<ffffffffa02ab196>] btrfs_lookup+0x16/0x40 [btrfs]
    [ 2887.525871]  [<ffffffff811e15ad>] lookup_real+0x1d/0x60
    [ 2887.525873]  [<ffffffff811e1bd8>] __lookup_hash+0x38/0x50
    [ 2887.525876]  [<ffffffff81769acc>] lookup_slow+0x45/0xab
    [ 2887.525878]  [<ffffffff811e44f0>] path_lookupat+0x6e0/0x710
    [ 2887.525880]  [<ffffffff811e2de0>] ? getname_flags.part.18+0x30/0x140
    [ 2887.525881]  [<ffffffff811e2de0>] ? getname_flags.part.18+0x30/0x140
    [ 2887.525884]  [<ffffffff811e4554>] filename_lookup+0x34/0xc0
    [ 2887.525885]  [<ffffffff811e2f56>] ? getname_flags+0x66/0x80
    [ 2887.525888]  [<ffffffff811e7ed9>] user_path_at_empty+0x59/0xa0
    [ 2887.525890]  [<ffffffff811e2d86>] ? final_putname+0x26/0x50
    [ 2887.525891]  [<ffffffff811e3059>] ? putname+0x29/0x40
    [ 2887.525893]  [<ffffffff811e7ee3>] ? user_path_at_empty+0x63/0xa0
    [ 2887.525895]  [<ffffffff811e7f31>] user_path_at+0x11/0x20
    [ 2887.525898]  [<ffffffff811dc911>] vfs_fstatat+0x51/0xb0
    [ 2887.525900]  [<ffffffff811dc9be>] vfs_lstat+0x1e/0x20
    [ 2887.525903]  [<ffffffff811dc9d5>] SYSC_newlstat+0x15/0x30
    [ 2887.525904]  [<ffffffff811dcc2b>] ? SyS_readlinkat+0x4b/0x120
    [ 2887.525906]  [<ffffffff811dcbbe>] SyS_newlstat+0xe/0x10
    [ 2887.525908]  [<ffffffff8178766d>] system_call_fastpath+0x1a/0x1f
    [ 3007.661983] INFO: task scsi_eh_10:602 blocked for more than 120 seconds.
    [ 3007.661987]       Tainted: PF       W  O 3.14.24-031424-generic #201411141736
    [ 3007.661988] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
    [ 3007.661990] scsi_eh_10      D ffffffff81811ae0     0   602      2 0x00000000
    [ 3007.661993]  ffff88041cb51c78 0000000000000046 0000000000000000 ffff88041cb51fd8
    [ 3007.661996]  0000000000014540 0000000000014540 ffff880428e693e0 ffff88041cb093e0
    [ 3007.661998]  000000000000000e ffff880422b9d6f0 ffff880422b9d6f4 00000000ffffffff
    [ 3007.662000] Call Trace:
    [ 3007.662006]  [<ffffffff8177a739>] schedule+0x29/0x70
    [ 3007.662008]  [<ffffffff8177aa5e>] schedule_preempt_disabled+0xe/0x10
    [ 3007.662010]  [<ffffffff8177c894>] __mutex_lock_slowpath+0x114/0x1b0
    [ 3007.662012]  [<ffffffff8177c953>] mutex_lock+0x23/0x37
    [ 3007.662031]  [<ffffffffa019e9bb>] device_reset+0x2b/0x60 [usb_storage]
    [ 3007.662034]  [<ffffffff815077ee>] scsi_try_bus_device_reset+0x2e/0x60
    [ 3007.662036]  [<ffffffff8150a37f>] scsi_eh_bus_device_reset+0xdf/0x270
    [ 3007.662038]  [<ffffffff8150a663>] ? scsi_eh_stu+0x153/0x280
    [ 3007.662040]  [<ffffffff8150a7de>] scsi_eh_ready_devs+0x4e/0xa0
    [ 3007.662042]  [<ffffffff8150b81d>] scsi_unjam_host+0x10d/0x1f0
    [ 3007.662044]  [<ffffffff8150ba65>] scsi_error_handler+0x165/0x1d0
    [ 3007.662046]  [<ffffffff8150b900>] ? scsi_unjam_host+0x1f0/0x1f0
    [ 3007.662049]  [<ffffffff81093079>] kthread+0xc9/0xe0
    [ 3007.662051]  [<ffffffff81092fb0>] ? flush_kthread_worker+0xb0/0xb0
    [ 3007.662054]  [<ffffffff817875bc>] ret_from_fork+0x7c/0xb0
    [ 3007.662056]  [<ffffffff81092fb0>] ? flush_kthread_worker+0xb0/0xb0
    [ 3007.662058] INFO: task usb-storage:604 blocked for more than 120 seconds.
    [ 3007.662059]       Tainted: PF       W  O 3.14.24-031424-generic #201411141736
    [ 3007.662060] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
    [ 3007.662061] usb-storage     D ffffffff81811ae0     0   604      2 0x00000000
    [ 3007.662063]  ffff88041cb55af8 0000000000000046 ffff88041cb55af8 ffff88041cb55fd8
    [ 3007.662065]  0000000000014540 0000000000014540 ffffffff81c144a0 ffff88041cb0a7c0
    [ 3007.662067]  0000000100000000 ffff880422b9d808 7fffffffffffffff 7fffffffffffffff
    [ 3007.662068] Call Trace:
    [ 3007.662070]  [<ffffffff8177a739>] schedule+0x29/0x70
    [ 3007.662074]  [<ffffffff817799e5>] schedule_timeout+0x1e5/0x250
    [ 3007.662077]  [<ffffffff8156bd58>] ? usb_hcd_submit_urb+0x88/0x1b0
    [ 3007.662079]  [<ffffffff8177b9d7>] wait_for_completion+0xa7/0x160
    [ 3007.662081]  [<ffffffff8156cece>] ? usb_alloc_urb+0x1e/0x50
    [ 3007.662084]  [<ffffffff810a4da0>] ? try_to_wake_up+0x210/0x210
    [ 3007.662086]  [<ffffffff8156f14a>] usb_sg_wait+0x13a/0x1f0
    [ 3007.662091]  [<ffffffffa019f531>] usb_stor_bulk_transfer_sglist.part.5+0x51/0xc0 [usb_storage]
    [ 3007.662096]  [<ffffffffa019f637>] usb_stor_bulk_transfer_sglist+0x97/0xa0 [usb_storage]
    [ 3007.662100]  [<ffffffffa019f66e>] usb_stor_bulk_srb+0x2e/0x50 [usb_storage]
    [ 3007.662104]  [<ffffffffa019f7d7>] usb_stor_Bulk_transport+0x147/0x3f0 [usb_storage]
    [ 3007.662107]  [<ffffffff817799e5>] ? schedule_timeout+0x1e5/0x250
    [ 3007.662111]  [<ffffffffa01a006e>] usb_stor_invoke_transport+0x3e/0x570 [usb_storage]
    [ 3007.662113]  [<ffffffff8177b1bd>] ? wait_for_completion_interruptible+0xcd/0x1c0
    [ 3007.662118]  [<ffffffffa019ee5e>] usb_stor_transparent_scsi_command+0xe/0x10 [usb_storage]
    [ 3007.662123]  [<ffffffffa01a172a>] usb_stor_control_thread+0x1ba/0x310 [usb_storage]
    [ 3007.662127]  [<ffffffffa01a1570>] ? fill_inquiry_response+0x20/0x20 [usb_storage]
    [ 3007.662129]  [<ffffffff81093079>] kthread+0xc9/0xe0
    [ 3007.662131]  [<ffffffff81092fb0>] ? flush_kthread_worker+0xb0/0xb0
    [ 3007.662133]  [<ffffffff817875bc>] ret_from_fork+0x7c/0xb0
    [ 3007.662134]  [<ffffffff81092fb0>] ? flush_kthread_worker+0xb0/0xb0
    [ 3007.662165] INFO: task pool:14006 blocked for more than 120 seconds.
    [ 3007.662166]       Tainted: PF       W  O 3.14.24-031424-generic #201411141736
    [ 3007.662167] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
    [ 3007.662168] pool            D ffffffff81811ae0     0 14006  10901 0x00000000
    [ 3007.662170]  ffff880169c3d778 0000000000000082 0000000000000000 ffff880169c3dfd8
    [ 3007.662171]  0000000000014540 0000000000014540 ffff880428e6cf80 ffff880054a11dd0
    [ 3007.662173]  ffff880169c3d778 ffff88043f3d4e20 ffff880054a11dd0 ffffffff81162050
    [ 3007.662175] Call Trace:
    [ 3007.662179]  [<ffffffff81162050>] ? __lock_page+0x70/0x70
    [ 3007.662180]  [<ffffffff8177a739>] schedule+0x29/0x70
    [ 3007.662182]  [<ffffffff8177a80f>] io_schedule+0x8f/0xd0
    [ 3007.662184]  [<ffffffff8116205e>] sleep_on_page+0xe/0x20
    [ 3007.662185]  [<ffffffff8177aed2>] __wait_on_bit+0x62/0x90
    [ 3007.662188]  [<ffffffff811621c0>] wait_on_page_bit+0x80/0x90
    [ 3007.662190]  [<ffffffff810b74d0>] ? wake_atomic_t_function+0x40/0x40
    [ 3007.662207]  [<ffffffffa02c008a>] read_extent_buffer_pages+0x2da/0x310 [btrfs]
    [ 3007.662209]  [<ffffffff81163064>] ? add_to_page_cache_lru+0x34/0x50
    [ 3007.662219]  [<ffffffffa02948d0>] ? verify_parent_transid+0x170/0x170 [btrfs]
    [ 3007.662228]  [<ffffffffa0296956>] btree_read_extent_buffer_pages.constprop.126+0xb6/0x120 [btrfs]
    [ 3007.662238]  [<ffffffffa02982b3>] read_tree_block+0x43/0x70 [btrfs]
    [ 3007.662246]  [<ffffffffa0279160>] read_block_for_search.isra.41+0x150/0x1d0 [btrfs]
    [ 3007.662253]  [<ffffffffa027b484>] btrfs_search_slot+0x304/0x830 [btrfs]
    [ 3007.662263]  [<ffffffffa02937af>] btrfs_lookup_inode+0x2f/0xa0 [btrfs]
    [ 3007.662273]  [<ffffffffa02a3a3c>] btrfs_read_locked_inode+0x7c/0x610 [btrfs]
    [ 3007.662276]  [<ffffffff811f0c6b>] ? inode_sb_list_add+0x5b/0x70
    [ 3007.662278]  [<ffffffff811f25d6>] ? iget5_locked+0x1d6/0x200
    [ 3007.662287]  [<ffffffffa02a1380>] ? btrfs_readpage+0x30/0x30 [btrfs]
    [ 3007.662297]  [<ffffffffa02aaa68>] btrfs_iget+0x78/0xf0 [btrfs]
    [ 3007.662307]  [<ffffffffa02ab14b>] btrfs_lookup_dentry+0x24b/0x280 [btrfs]
    [ 3007.662309]  [<ffffffff811ef9da>] ? __d_alloc+0x14a/0x180
    [ 3007.662318]  [<ffffffffa02ab196>] btrfs_lookup+0x16/0x40 [btrfs]
    [ 3007.662320]  [<ffffffff811e15ad>] lookup_real+0x1d/0x60
    [ 3007.662322]  [<ffffffff811e1bd8>] __lookup_hash+0x38/0x50
    [ 3007.662324]  [<ffffffff81769acc>] lookup_slow+0x45/0xab
    [ 3007.662326]  [<ffffffff811e44f0>] path_lookupat+0x6e0/0x710
    [ 3007.662328]  [<ffffffff811e2de0>] ? getname_flags.part.18+0x30/0x140
    [ 3007.662330]  [<ffffffff811e2de0>] ? getname_flags.part.18+0x30/0x140
    [ 3007.662332]  [<ffffffff811e4554>] filename_lookup+0x34/0xc0
    [ 3007.662334]  [<ffffffff811e2f56>] ? getname_flags+0x66/0x80
    [ 3007.662336]  [<ffffffff811e7ed9>] user_path_at_empty+0x59/0xa0
    [ 3007.662338]  [<ffffffff811e2d86>] ? final_putname+0x26/0x50
    [ 3007.662340]  [<ffffffff811e3059>] ? putname+0x29/0x40
    [ 3007.662342]  [<ffffffff811e7ee3>] ? user_path_at_empty+0x63/0xa0
    [ 3007.662344]  [<ffffffff811e7f31>] user_path_at+0x11/0x20
    [ 3007.662347]  [<ffffffff811dc911>] vfs_fstatat+0x51/0xb0
    [ 3007.662349]  [<ffffffff811dc9be>] vfs_lstat+0x1e/0x20
    [ 3007.662352]  [<ffffffff811dc9d5>] SYSC_newlstat+0x15/0x30
    [ 3007.662353]  [<ffffffff811dcc2b>] ? SyS_readlinkat+0x4b/0x120
    [ 3007.662355]  [<ffffffff811dcbbe>] SyS_newlstat+0xe/0x10
    [ 3007.662357]  [<ffffffff8178766d>] system_call_fastpath+0x1a/0x1f
    [ 3127.765910] INFO: task scsi_eh_10:602 blocked for more than 120 seconds.
    [ 3127.765916]       Tainted: PF       W  O 3.14.24-031424-generic #201411141736
    [ 3127.765917] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
    [ 3127.765919] scsi_eh_10      D ffffffff81811ae0     0   602      2 0x00000000
    [ 3127.765923]  ffff88041cb51c78 0000000000000046 0000000000000000 ffff88041cb51fd8
    [ 3127.765926]  0000000000014540 0000000000014540 ffff880428e693e0 ffff88041cb093e0
    [ 3127.765929]  000000000000000e ffff880422b9d6f0 ffff880422b9d6f4 00000000ffffffff
    [ 3127.765932] Call Trace:
    [ 3127.765938]  [<ffffffff8177a739>] schedule+0x29/0x70
    [ 3127.765941]  [<ffffffff8177aa5e>] schedule_preempt_disabled+0xe/0x10
    [ 3127.765944]  [<ffffffff8177c894>] __mutex_lock_slowpath+0x114/0x1b0
    [ 3127.765946]  [<ffffffff8177c953>] mutex_lock+0x23/0x37
    [ 3127.765968]  [<ffffffffa019e9bb>] device_reset+0x2b/0x60 [usb_storage]
    [ 3127.765972]  [<ffffffff815077ee>] scsi_try_bus_device_reset+0x2e/0x60
    [ 3127.765974]  [<ffffffff8150a37f>] scsi_eh_bus_device_reset+0xdf/0x270
    [ 3127.765977]  [<ffffffff8150a663>] ? scsi_eh_stu+0x153/0x280
    [ 3127.765979]  [<ffffffff8150a7de>] scsi_eh_ready_devs+0x4e/0xa0
    [ 3127.765982]  [<ffffffff8150b81d>] scsi_unjam_host+0x10d/0x1f0
    [ 3127.765985]  [<ffffffff8150ba65>] scsi_error_handler+0x165/0x1d0
    [ 3127.765987]  [<ffffffff8150b900>] ? scsi_unjam_host+0x1f0/0x1f0
    [ 3127.765991]  [<ffffffff81093079>] kthread+0xc9/0xe0
    [ 3127.765994]  [<ffffffff81092fb0>] ? flush_kthread_worker+0xb0/0xb0
    [ 3127.765997]  [<ffffffff817875bc>] ret_from_fork+0x7c/0xb0
    [ 3127.765999]  [<ffffffff81092fb0>] ? flush_kthread_worker+0xb0/0xb0

which does not occur on previous versions mentioned above.

3.17.2 still causes kernel panic with complete system crash with the same device.

A current workaround therefore is to switch to 3.12.32 where you'll definitely won't have problems with xhci (the USB driver) and probalby a lot with btrfs (tradeoff).

Revision history for this message

gazhay (gazhay) wrote on 2014-12-09:

#14

I'm finding xhci unreliable across the board since 14.x, worse in 14.10

Scanners (usb2) that previously worked in usb3 ports now do not work, causing seg faults in applications and all kinds of errors in dmesg. (segfault at 0 ip 00007f1f79b9dd3f sp 00007f1fa283ca10 error 4 in libsane-genesys.so.1.0.24[7f1f79b4f000+6e000])

There are also unexpected length errors, and device not found errors.

Revision history for this message

Karl-Philipp Richter (krichter722) wrote on 2014-12-09:

#15

I've found this fixed in 3.12.32 and in 3.17.4 (no issues in 3.17.5 and 3.17.6 as well) in all kernels in between issues occured. I reported more information on different kernels and different error messages occuring when reproducing the issue on them on the same issue reported by another person (Ubuntu guidelines discourage this sort of helpful cross posting), but launchpad linking and search facilities are so bad, that I don't have energy to compensate them now. @gazhay try > 3.17.4 or 3.12.x with x >= 32. Maybe 3.18.0 contains a regression again so that it stopped working...

Revision history for this message

technik007_cz (technik007-cz) wrote on 2014-12-15:

#16

I'm running on kernel 3.13.0-43-lowlatency, laptop Lenovo Ideapad Y500. I have this "xHCI xhci_drop_endpoint called with disabled ep" problems with different USB 3.0 devices since I bought laptop at the end of year 2013. What increase possibility of this failures is long USB 3.0 cable or using of USB 3.0 extension. But even I keep my system/kernel updated this problem have never been solved with new kernel. I have tried different USB 3.0 hubs, cables, enclosures and what is sad if you plug "noncompatible device" it put into hell other devices like webcamera internally connected into usb 3.0 hub.

Revision history for this message

technik007_cz (technik007-cz) wrote on 2014-12-15:

#17

I gonna try 3.17.4 kernel or up Karl...

Karl-Philipp Richter (krichter722) on 2014-12-29

Changed in linux (Ubuntu):
status:	Confirmed → Fix Committed

Revision history for this message

Scott (e2e8e2) wrote on 2015-01-26:

#18

The only kernel version that works for me is 3.13.0.24.28 . I just tried upgrading Ubuntu to 3.13.0.44.51 and the problem came back with a vengeance (destroyed my software raid configuration of 2 usb 3 drives).

Revision history for this message

Alan Pope 🍺🐧🐱 🦄 (popey) wrote on 2015-03-16:

#19

This is marked "Fix committed" but I'm still seeing it on 3.19.0-9-generic on 15.04.

dding to a USB3 key attached to a USB3 hub causes my syslog to be spammed with a bunch of:-

Mar 16 11:54:24 deep-thought kernel: [ 5515.310662] usb 2-1.1: reset SuperSpeed USB device number 7 using xhci_hcd
Mar 16 11:54:24 deep-thought kernel: [ 5515.327096] xhci_hcd 0000:0e:00.0: xHCI xhci_drop_endpoint called with disabled ep ffff880032dc5cc0

As well as some:-

Mar 16 11:55:26 deep-thought kernel: [ 5576.916045] hub 2-1:1.0: hub_port_status failed (err = -71)

Revision history for this message

Joseph Salisbury (jsalisbury) wrote on 2015-03-16:

#20

The v4.0 -rc4 kernel is now available. Can folks affected by this bug test the latest kernel? It can be downloaded from:

http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.0-rc4-vivid/

Changed in linux (Ubuntu Vivid):
status:	Fix Committed → Confirmed

Joseph Salisbury (jsalisbury) on 2015-03-16

Changed in linux (Ubuntu Trusty):
status:	New → Confirmed
Changed in linux (Ubuntu Utopic):
status:	New → Confirmed
importance:	Undecided → High
Changed in linux (Ubuntu Trusty):
importance:	Undecided → High
tags:	added: vivid

Revision history for this message

Alan Pope 🍺🐧🐱 🦄 (popey) wrote on 2015-03-16:

#21

Tried 4.0-rc4 on vivid and saw none of the xhci error messages I saw on 3.19. I performed exactly the same operation using the same hardware, namely ddrescuing an ~7.5GB image to an 8GB stick.

alan@deep-thought:/data/usb⟫ dmesg -T | grep xhci_hcd
[Mon Mar 16 19:48:40 2015] xhci_hcd 0000:0e:00.0: xHCI Host Controller
[Mon Mar 16 19:48:40 2015] xhci_hcd 0000:0e:00.0: new USB bus registered, assigned bus number 1
[Mon Mar 16 19:48:40 2015] xhci_hcd 0000:0e:00.0: hcc params 0x014042cb hci version 0x96 quirks 0x00000004
[Mon Mar 16 19:48:40 2015] xhci_hcd 0000:0e:00.0: xHCI Host Controller
[Mon Mar 16 19:48:40 2015] xhci_hcd 0000:0e:00.0: new USB bus registered, assigned bus number 2
[Mon Mar 16 19:49:49 2015] usb 2-1: new SuperSpeed USB device number 2 using xhci_hcd
[Mon Mar 16 19:49:50 2015] usb 1-1: new high-speed USB device number 2 using xhci_hcd
[Mon Mar 16 19:49:59 2015] usb 2-1.3: new SuperSpeed USB device number 3 using xhci_hcd
[Mon Mar 16 19:54:59 2015] usb 2-1.3: new SuperSpeed USB device number 4 using xhci_hcd

Revision history for this message

Joseph Salisbury (jsalisbury) wrote on 2015-03-16:

#22

I'd like to perform a bisect to figure out what commit caused this regression. We need to identify the earliest kernel where the issue started happening as well as the latest kernel that did not have this issue. Reading the comments, it sounds like this first started in the 3.13.0-25 Ubuntu kernel, which is based on upstream 3.13.10.

Can folks affected by this bug test the following kernels and post back

3.13.9: http://kernel.ubuntu.com/~kernel-ppa/mainline/v3.13.9-trusty/
3.13.10: http://kernel.ubuntu.com/~kernel-ppa/mainline/v3.13.10-trusty/

Revision history for this message

Joseph Salisbury (jsalisbury) wrote on 2015-03-16:

#23

I re-read comment #21, and you 'Saw none' of the errors with 4.0, which means this bug might be fixed there. It might be better of to perform a 'Reverse' bisect to identify the commit that fixes this bug and requested it in the prior stable kernels.

Can you test the following kernels and report back? We are looking for the last kernel version that has the bug and the first that does not:

v4.0-rc1: http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.0-rc1-vivid/
v4.0-rc2: http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.0-rc2-vivid/
v4.0-rc3: http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.0-rc3-vivid/

tags:

added: performing-bisect

Revision history for this message

Brad Figg (brad-figg) wrote on 2015-03-24: Test with newer development kernel (3.19.0-9.9)

#24

Thank you for taking the time to file a bug report on this issue.

However, given the number of bugs that the Kernel Team receives during any development cycle it is impossible for us to review them all. Therefore, we occasionally resort to using automated bots to request further testing. This is such a request.

We have noted that there is a newer version of the development kernel than the one you last tested when this issue was found. Please test again with the newer kernel and indicate in the bug if this issue still exists or not.

You can update to the latest development kernel by simply running the following commands in a terminal window:

sudo apt-get update
sudo apt-get dist-upgrade

If the bug still exists, change the bug status from Incomplete to Confirmed. If the bug no longer exists, change the bug status from Incomplete to Fix Released.

If you want this bot to quit automatically requesting kernel tests, add a tag named: bot-stop-nagging.

Thank you for your help, we really do appreciate it.

Changed in linux (Ubuntu):
status:	Confirmed → Incomplete
tags:	added: kernel-request-3.19.0-9.9

tehownt (tehownt) on 2015-04-25

Changed in linux (Ubuntu Vivid):
status:	Incomplete → Confirmed

Revision history for this message

Stefan (steffel) wrote on 2015-04-27:

#25

Download full text (4.2 KiB)

Experiencing a problem on xhci module with an Opticon Barcode Scanner NLV-1001 (ttyUSB device). After a few barcode-scan-triggers, it doesn't return from opening the device.

Tried:

- 3.13.0-35-generic
- 3.16.0-34-generic
- 3.17.0-031700-generic
- 3.19.1-031901-generic

Without USB3 (when disabled in BIOS) - ehci module is used, then, and no problems show up.

Could anyone tell me whether my problem with following trace is related to this issue?
Thank you for your help!

(kernel.log)
Apr 27 10:32:10 myhost123 kernel: [ 3721.041230] INFO: task myapp:2920 blocked for more than 120 seconds.
Apr 27 10:32:10 myhost123 kernel: [ 3721.041235] Tainted: G OE 3.19.1-031901-generic #201504091335
Apr 27 10:32:10 myhost123 kernel: [ 3721.041236] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Apr 27 10:32:10 myhost123 kernel: [ 3721.041237] myapp D e4995b74 0 2920 2477 0x00000004
Apr 27 10:32:10 myhost123 kernel: [ 3721.041240] e4995be0 00200086 00000000 e4995b74 e85ba000 0899fc06 000002fb 00000001
Apr 27 10:32:10 myhost123 kernel: [ 3721.041244] 00000001 e4995fec c15511bf c1b88f40 e51b0e00 e926cf40 e48cee40 e5730620
Apr 27 10:32:10 myhost123 kernel: [ 3721.041247] 00000000 00000000 e4cfb940 00000001 00000000 e863f070 e5376610 00000c01
Apr 27 10:32:10 myhost123 kernel: [ 3721.041250] Call Trace:
Apr 27 10:32:10 myhost123 kernel: [ 3721.041258] [<c15511bf>] ? xhci_queue_ctrl_tx+0x1ef/0x260
Apr 27 10:32:10 myhost123 kernel: [ 3721.041261] [<c1548d5d>] ? xhci_urb_enqueue+0x16d/0x420
Apr 27 10:32:10 myhost123 kernel: [ 3721.041263] [<c16eac03>] schedule+0x23/0x60
Apr 27 10:32:10 myhost123 kernel: [ 3721.041266] [<c16ed135>] schedule_timeout+0x165/0x1c0
Apr 27 10:32:10 myhost123 kernel: [ 3721.041270] [<c1509180>] ? usb_hcd_submit_urb+0x80/0x180
Apr 27 10:32:10 myhost123 kernel: [ 3721.041272] [<c150a480>] ? usb_submit_urb.part.9+0x1e0/0x520
Apr 27 10:32:10 myhost123 kernel: [ 3721.041275] [<c11808f4>] ? vmap_pmd_range+0x94/0xe0
Apr 27 10:32:10 myhost123 kernel: [ 3721.041277] [<c16ebbb5>] wait_for_completion_timeout+0x85/0x140
Apr 27 10:32:10 myhost123 kernel: [ 3721.041280] [<c1089770>] ? try_to_wake_up+0x210/0x210
Apr 27 10:32:10 myhost123 kernel: [ 3721.041282] [<c150b541>] usb_start_wait_urb+0x71/0x150
Apr 27 10:32:10 myhost123 kernel: [ 3721.041284] [<c1194feb>] ? __kmalloc+0xab/0x230
Apr 27 10:32:10 myhost123 kernel: [ 3721.041286] [<c1509ff9>] ? usb_alloc_urb+0x19/0x40
Apr 27 10:32:10 myhost123 kernel: [ 3721.041288] [<c150b83b>] usb_control_msg+0xbb/0xe0
Apr 27 10:32:10 myhost123 kernel: [ 3721.041295] [<f058b6fa>] send_control_msg.isra.4+0x7a/0xa0 [opticon]
Apr 27 10:32:10 myhost123 kernel: [ 3721.041297] [<f058b830>] opticon_open+0x40/0x84 [opticon]
Apr 27 10:32:10 myhost123 kernel: [ 3721.041300] [<f05d50d1>] serial_port_activate+0x61/0x90 [usbserial]
Apr 27 10:32:10 myhost123 kernel: [ 3721.041303] [<c1416791>] tty_port_open+0x71/0xf0
Apr 27 10:32:10 myhost123 kernel: [ 3721.041307] [<f05d5b7c>] serial_open+0x2c/0x70 [usbserial]
Apr 27 10:32:10 myhost123 kernel: [ 3721.041309] [<c140f22e>] tty_open+0x3e/0x3f0
Apr 27 10:32:10 myhost123 kernel: [ 3721.041312] [<c11ae...

I am still experiencing this issue as of now:

marcos@S4X8-MANNY:~$ sudo apt-get update && sudo apt-get upgrade -y && uname -a
Ign http://es.archive.ubuntu.com vivid InRelease
Ign http://es.archive.ubuntu.com vivid-updates InRelease
Ign http://es.archive.ubuntu.com vivid-backports InRelease
Obj http://es.archive.ubuntu.com vivid Release.gpg
Des:1 http://es.archive.ubuntu.com vivid-updates Release.gpg [933 B]
Obj http://es.archive.ubuntu.com vivid-backports Release.gpg                          
Obj http://es.archive.ubuntu.com vivid Release             
Des:2 http://es.archive.ubuntu.com vivid-updates Release [63,5 kB]            
Ign http://security.ubuntu.com vivid-security InRelease                          
Obj http://security.ubuntu.com vivid-security Release.gpg                        
Obj http://es.archive.ubuntu.com vivid-backports Release                         
Obj http://es.archive.ubuntu.com vivid/main Sources                            
Obj http://es.archive.ubuntu.com vivid/restricted Sources
Obj http://security.ubuntu.com vivid-security Release
Obj http://es.archive.ubuntu.com vivid/universe Sources            
Obj http://es.archive.ubuntu.com vivid/multiverse Sources          
Obj http://es.archive.ubuntu.com vivid/main amd64 Packages
Obj http://es.archive.ubuntu.com vivid/restricted amd64 Packages           
Obj http://security.ubuntu.com vivid-security/main Sources                 
Obj http://es.archive.ubuntu.com vivid/universe amd64 Packages             
Obj http://es.archive.ubuntu.com vivid/multiverse amd64 Packages           
Obj http://es.archive.ubuntu.com vivid/main i386 Packages                  
Obj http://security.ubuntu.com vivid-security/restricted Sources           
Obj http://es.archive.ubuntu.com vivid/restricted i386 Packages
Obj http://security.ubuntu.com vivid-security/universe Sources             
Obj http://es.archive.ubuntu.com vivid/universe i386 Packages              
Obj http://es.archive.ubuntu.com vivid/multiverse i386 Packages       
Obj http://security.ubuntu.com vivid-security/multiverse Sources      
Obj http://es.archive.ubuntu.com vivid/main Translation-es            
Obj http://es.archive.ubuntu.com vivid/main Translation-en            
Obj http://es.archive.ubuntu.com vivid/multiverse Translation-es      
Obj http://security.ubuntu.com vivid-security/main amd64 Packages     
Obj http://es.archive.ubuntu.com vivid/multiverse Translation-en          
Obj http://es.archive.ubuntu.com vivid/restricted Translation-es          
Obj http://security.ubuntu.com vivid-security/restricted amd64 Packages   
Obj http://es.archive.ubuntu.com vivid/restricted Translation-en          
Obj http://es.archive.ubuntu.com vivid/universe Translation-es            
Obj http://es.archive.ubuntu.com vivid/universe Translation-en            
Obj http://security.ubuntu.com vivid-security/universe amd64 Packages     
Des:3 http://es.archive.ubuntu.com vivid-updates/main Sources [9.756 B]   
Des:4 http://es.archive.ubuntu.com vivid-updates/restricted Sources [28 B] 
Obj http://security.ubuntu.com vivid-security/multiverse amd64 Packages                         
Des:5 http://es.archive.ubuntu.com vivid-updates/universe Sources [3.044 B]                     
Des:6 http://es.archive.ubuntu.com vivid-updates/multiverse Sources [28 B]                      
Des:7 http://es.archive.ubuntu.com vivid-updates/main amd64 Packages [27,8 kB]
Obj http://security.ubuntu.com vivid-security/main i386 Packages                
Des:8 http://es.archive.ubuntu.com vivid-updates/restricted amd64 Packages [28 B]
Des:9 http://es.archive.ubuntu.com vivid-updates/universe amd64 Packages [10,3 kB]                 
Obj http://security.ubuntu.com vivid-security/restricted i386 Packages                                   
Des:10 http://es.archive.ubuntu.com vivid-updates/multiverse amd64 Packages [28 B]               
Obj http://security.ubuntu.com vivid-security/universe i386 Packages                              
Des:11 http://es.archive.ubuntu.com vivid-updates/main i386 Packages [27,8 kB]
Des:12 http://es.archive.ubuntu.com vivid-updates/restricted i386 Packages [28 B]
Obj http://security.ubuntu.com vivid-security/multiverse i386 Packages    
Des:13 http://es.archive.ubuntu.com vivid-updates/universe i386 Packages [10,4 kB]
Des:14 http://es.archive.ubuntu.com vivid-updates/multiverse i386 Packages [28 B]                 
Obj http://security.ubuntu.com vivid-security/main Translation-en                                 
Obj http://es.archive.ubuntu.com vivid-updates/main Translation-en        
Obj http://es.archive.ubuntu.com vivid-updates/multiverse Translation-en  
Obj http://es.archive.ubuntu.com vivid-updates/restricted Translation-en  
Obj http://security.ubuntu.com vivid-security/multiverse Translation-en   
Obj http://es.archive.ubuntu.com vivid-updates/universe Translation-en    
Obj http://es.archive.ubuntu.com vivid-backports/main Sources             
Obj http://security.ubuntu.com vivid-security/restricted Translation-en   
Obj http://es.archive.ubuntu.com vivid-backports/restricted Sources       
Obj http://es.archive.ubuntu.com vivid-backports/universe Sources         
Obj http://es.archive.ubuntu.com vivid-backports/multiverse Sources       
Obj http://es.archive.ubuntu.com vivid-backports/main amd64 Packages      
Obj http://es.archive.ubuntu.com vivid-backports/restricted amd64 Packages
Obj http://es.archive.ubuntu.com vivid-backports/universe amd64 Packages  
Obj http://security.ubuntu.com vivid-security/universe Translation-en     
Obj http://es.archive.ubuntu.com vivid-backports/multiverse amd64 Packages
Obj http://es.archive.ubuntu.com vivid-backports/main i386 Packages
Obj http://es.archive.ubuntu.com vivid-backports/restricted i386 Packages
Obj http://es.archive.ubuntu.com vivid-backports/universe i386 Packages
Obj http://es.archive.ubuntu.com vivid-backports/multiverse i386 Packages
Obj http://es.archive.ubuntu.com vivid-backports/main Translation-en
Obj http://es.archive.ubuntu.com vivid-backports/multiverse Translation-en
Obj http://es.archive.ubuntu.com vivid-backports/restricted Translation-en
Obj http://es.archive.ubuntu.com vivid-backports/universe Translation-en
Descargados 154 kB en 15s (9.634 B/s)                                                                                                                                                          
Leyendo lista de paquetes... Hecho
Leyendo lista de paquetes... Hecho
Creando árbol de dependencias       
Leyendo la información de estado... Hecho
Calculando la actualización... Listo
0 actualizados, 0 nuevos se instalarán, 0 para eliminar y 0 no actualizados.
Linux S4X8-MANNY 3.19.0-15-generic #15-Ubuntu SMP Thu Apr 16 23:32:37 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux
marcos@S4X8-MANNY:~$ date
lun may  4 15:51:47 CEST 2015
marcos@S4X8-MANNY:~$

Dmesg:
[  382.651086] usb 2-1: new SuperSpeed USB device number 2 using xhci_hcd
[  382.670760] usb 2-1: New USB device found, idVendor=0781, idProduct=5580
[  382.670778] usb 2-1: New USB device strings: Mfr=1, Product=2, SerialNumber=3
[  382.670787] usb 2-1: Product: Extreme
[  382.670795] usb 2-1: Manufacturer: SanDisk
[  382.670802] usb 2-1: SerialNumber: AA010122141450271936
[  382.758883] usb-storage 2-1:1.0: USB Mass Storage device detected
[  382.769656] scsi host1: usb-storage 2-1:1.0
[  382.770632] usbcore: registered new interface driver usb-storage
[  382.789612] usbcore: registered new interface driver uas
[  383.768238] scsi 1:0:0:0: Direct-Access     SanDisk  Extreme          0001 PQ: 0 ANSI: 6
[  383.769037] sd 1:0:0:0: Attached scsi generic sg1 type 0
[  383.774253] sd 1:0:0:0: [sdb] 30651688 512-byte logical blocks: (15.6 GB/14.6 GiB)
[  383.775283] sd 1:0:0:0: [sdb] Write Protect is off
[  383.775299] sd 1:0:0:0: [sdb] Mode Sense: 33 00 00 08
[  383.775818] sd 1:0:0:0: [sdb] Write cache: disabled, read cache: enabled, doesn't support DPO or FUA
[  383.782895]  sdb: sdb1
[  383.784832] sd 1:0:0:0: [sdb] Attached SCSI disk
[  953.311706] usb 2-1: reset SuperSpeed USB device number 2 using xhci_hcd
[  953.328516] xhci_hcd 0000:00:10.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800446d2e40
[  953.328535] xhci_hcd 0000:00:10.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800446d2e88
[  953.449884] usb 2-1: reset SuperSpeed USB device number 2 using xhci_hcd
[  953.700878] xhci_hcd 0000:00:10.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800446d2e40
[  953.700898] xhci_hcd 0000:00:10.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800446d2e88
[ 1440.567515] usb 2-1: device not accepting address 2, error -22
[ 1441.087950] usb 2-1: USB disconnect, device number 2
[ 1441.093257] sd 1:0:0:0: [sdb] FAILED Result: hostbyte=DID_NO_CONNECT driverbyte=DRIVER_OK
[ 1441.093274] sd 1:0:0:0: [sdb] CDB: 
[ 1441.093282] Read(10): 28 00 00 fa 04 c0 00 00 02 00
[ 1441.093309] blk_update_request: I/O error, dev sdb, sector 16385216
[ 1441.120102] xhci_hcd 0000:00:10.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800446d2e40
[ 1441.120119] xhci_hcd 0000:00:10.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800446d2e88

Revision history for this message

Scott (e2e8e2) wrote on 2015-05-09:

#29

I know people are looking at this, but it' been a long time, and since this bug can and does cause data corruption (it happened to me) I'd hope that this one would be near the top of the heap.

Revision history for this message

Joseph Salisbury (jsalisbury) wrote on 2015-05-11:

#30

Can folks affected by this bug test the following kernels and report back? We are looking for the last kernel version that has the bug and the first that does not:

v4.0-rc1: http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.0-rc1-vivid/
v4.0-rc2: http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.0-rc2-vivid/
v4.0-rc3: http://kernel.ubuntu.com/~kernel-ppa/mainline/v4.0-rc3-vivid/

Revision history for this message

NightShade (tim-night-shade) wrote on 2015-05-12:

#31

tim@desktop-tim:~$ uname -a
Linux desktop-tim 4.0.1-040001-generic #201504290935 SMP Wed Apr 29 09:36:55 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux

No bug, I'll pull the other kernels down and test them over the next few days

Joseph Salisbury (jsalisbury) on 2015-06-09

Changed in linux (Ubuntu Utopic):
status:	Confirmed → Incomplete
Changed in linux (Ubuntu Trusty):
status:	Confirmed → Incomplete

Revision history for this message

Manuel Iglesias Alonso (glesialo) wrote on 2015-07-13:

#32

I had to plug my USB3 external drive into a (much slower) USB2 socket to avoid data being corrupted by this bug.

Could you please reactivate this report and start looking for a solution?

Thanks.

Revision history for this message

gazhay (gazhay) wrote on 2015-07-16:

#33

Recently bought a new machine and put 15.04 on it fresh.

I can confirm that my scanner which I had got working thanks to a usb2 card, no longer works as the motherboard of my new machine forces you to use xhci for all ports regardless.

There still seems to be some sort of bug in vivid.

Revision history for this message

Mikhail (mikhail-kuzmin) wrote on 2015-08-07:

#34

I'm having the same problem on 3.13.0-57-lowlatency on Ubuntu 14.04.

Revision history for this message

Scott (e2e8e2) wrote on 2015-08-26:

#35

If anyone has the time to test this (the system it's happening to me on is off site and I won't be back to it for a while), see if the problem still occurs if you connect the device through a USB 3 hub rather than directly to the USB port. It just occurred to me that somewhere in the process of trying to circumvent this problem I added a USB 3 hub to the mix, and now I'm wondering if it's the hub that's circumventing the problem or if backing down to 3.13.0.24.28 did the trick.

As it doesn't look like this bug is going to receive attention anytime soon I thought it might be worth someone doing a test. If there aren't any results by the time I get back to my system that was having trouble in a couple of months I'll try it myself.

Revision history for this message

Yuriy Vidineev (adeptg) wrote on 2015-08-26:

#36

Dell XPS 13 9333, Ubuntu 14.04, 3.19.0-26-generic, USB 3.0 HDD enclosure (ASMedia AS2105). Directly connected to USB3 port - not detected at all. Connected to USB2.0 hub - detected and perfectly worked. Connected to USB 3.0 hub (ASIX AX88179) - determined ~50% times (~50% no any line in syslog after plug in). When it successfully determined in USB3 hub - works after it without any problem (however my test was short - 15 min rsync with a lot of files (my home folder))

Revision history for this message

Scott (e2e8e2) wrote on 2015-08-28:

#37

So there was a difference between being plugged into a hub and not. Mine is working with 3.13.0.24.28, but I also have the hard drives connected through a USB 3 hub. I'm not sure if this information will help find the bug, but it would be nice if we could determine that using a hub can circumvent the problem.

I also am quite perplexed as to why this bug hasn't been addressed as it's about a year old now and it can cause data corruption. One would think that bugs which have the potential to cause data corruption would be among the highest priority to be fixed.

Revision history for this message

daniel lopez (dlopez7892) wrote on 2015-09-11:

#38

I'm using a ASUS Rampage IV Black edition with Ubuntu currently and am also experiencing this issue. I've tested several kernel versions and distros, and this is not an issue unique to Ubuntu. It will happen on pretty much any kernel =>3.15 with certain ASMedia USB 3.0 controllers. I'm not sure if it is isolated to ASMedia ICs, but it does impact at least two ASmedia products. The upstream kernel devs are aware of the issue and don't seem very optimistic about being able to fix it. Apparently there is a poor design for a PCI to USB bridge used in these controllers, and the possibility of fixing it without ASMedia or the motherboard manufacturers contributing code is fairly slim. For me, I had to disable USB 3.0 entirely to even get Linux =>3.15 to boot on this board without having a kernel panic. While I'd love to see this get fixed, I'm not sure that chasing this down is worth the Ubuntu kernel team's time.

Revision history for this message

Scott (e2e8e2) wrote on 2015-09-11:

#39

It's not exclusive to that controller. It's happening to me with a NEC uPD720200 USB 3.0 Host Controller, which is a very common USB 3 controller chip.

Revision history for this message

Malachi de AElfweald (malachid) wrote on 2015-10-14:

#40

I see the same problem with a hub plugged in and nothing attached to it. This happens repeatedly. I've reported it to them as well, but it seems to be more of a problem with the xhci_hcd. This is with 3.19.0-30.

[ 3323.263466] usb 2-1.1: USB disconnect, device number 103
[ 3323.284305] usb 2-1.4: USB disconnect, device number 104
[ 3323.398767] usb 2-1: reset SuperSpeed USB device number 2 using xhci_hcd
[ 3323.416724] xhci_hcd 0000:04:00.0: xHCI xhci_drop_endpoint called with disabled ep ffff8804288644e0
[ 3323.697914] usb 2-1.1: new SuperSpeed USB device number 105 using xhci_hcd
[ 3323.713630] usb 2-1.1: New USB device found, idVendor=05e3, idProduct=0612
[ 3323.713633] usb 2-1.1: New USB device strings: Mfr=1, Product=2, SerialNumber=3
[ 3323.713635] usb 2-1.1: Product: USB3.0 Hub
[ 3323.713636] usb 2-1.1: Manufacturer: SKIVA TECHNOLOGIES INC
[ 3323.713638] usb 2-1.1: SerialNumber: t2
[ 3323.715331] hub 2-1.1:1.0: USB hub found
[ 3323.715596] hub 2-1.1:1.0: 4 ports detected
[ 3323.789786] usb 2-1.4: new SuperSpeed USB device number 106 using xhci_hcd
[ 3323.805724] usb 2-1.4: New USB device found, idVendor=05e3, idProduct=0612
[ 3323.805727] usb 2-1.4: New USB device strings: Mfr=1, Product=2, SerialNumber=3
[ 3323.805729] usb 2-1.4: Product: USB3.0 Hub
[ 3323.805731] usb 2-1.4: Manufacturer: SKIVA TECHNOLOGIES INC
[ 3323.805732] usb 2-1.4: SerialNumber: t3
[ 3323.807841] hub 2-1.4:1.0: USB hub found
[ 3323.808171] hub 2-1.4:1.0: 4 ports detected

Revision history for this message

Malachi de AElfweald (malachid) wrote on 2015-10-23:

#41

Bug did not go away with 15.10 / 4.2.0-16

Revision history for this message

Malachi de AElfweald (malachid) wrote on 2015-10-29:

#42

Regarding comment #37

If I plug an Android phone (HTC One m7 GPe) to the same port instead of the USB3 Hub, it stays connected.
If I connect the USB hub instead, I get this problem.
If I connect the phone through the USB hub, I loose access to the phone rather quickly during one of the drop cycles.

Revision history for this message

Malachi de AElfweald (malachid) wrote on 2015-10-29:

#43

Could be related to https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1012291

Revision history for this message

Malachi de AElfweald (malachid) wrote on 2015-10-29:

#44

I believe I have fixed mine. Perhaps someone else can test the fix on theirs?
From this http://unix.stackexchange.com/questions/91027/how-to-disable-usb-autosuspend-on-kernel-3-7-10-or-above

Edit the /etc/default/grub file and append to the GRUB_CMDLINE_LINUX_DEFAULT line:
usbcore.autosuspend=-1

sudo update-grub
reboot

I don't appear to be getting the disconnects anymore.

Revision history for this message

Robert Oswald (robert-oswald) wrote on 2015-11-06:

#45

Tried your workaround but problem still exists.

Log:
Nov 6 21:35:38 rodomp01 kernel: [ 289.927150] sd 4:0:0:0: [sdb] uas_eh_abort_handler ffff88021311cd80 tag 0, inflight: CMD IN
Nov 6 21:35:41 rodomp01 kernel: [ 292.927539] scsi host4: uas_eh_task_mgmt: ABORT TASK timed out
Nov 6 21:35:41 rodomp01 kernel: [ 292.927588] sd 4:0:0:0: uas_eh_device_reset_handler
Nov 6 21:35:41 rodomp01 kernel: [ 292.927597] scsi host4: uas_eh_task_mgmt: LOGICAL UNIT RESET: error already running a task
Nov 6 21:35:41 rodomp01 kernel: [ 292.927603] scsi host4: uas_eh_bus_reset_handler start
Nov 6 21:35:41 rodomp01 kernel: [ 292.927669] usb 3-1: stat urb: killed, stream 2
Nov 6 21:35:41 rodomp01 kernel: [ 292.927763] sd 4:0:0:0: [sdb] uas_data_cmplt ffff88021311cd80 tag 0, inflight: CMD abort
Nov 6 21:35:41 rodomp01 kernel: [ 292.927770] sd 4:0:0:0: [sdb] data cmplt err -2 stream 2
Nov 6 21:35:41 rodomp01 kernel: [ 292.927804] sd 4:0:0:0: [sdb] uas_zap_dead ffff88021311cd80 tag 0, inflight: CMD abort
Nov 6 21:35:41 rodomp01 kernel: [ 292.927822] sd 4:0:0:0: [sdb] abort completed
Nov 6 21:35:41 rodomp01 kernel: [ 293.039920] usb 3-1: reset SuperSpeed USB device number 2 using xhci_hcd
Nov 6 21:35:41 rodomp01 kernel: [ 293.056271] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d620f000
Nov 6 21:35:41 rodomp01 kernel: [ 293.056277] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d620f048
Nov 6 21:35:41 rodomp01 kernel: [ 293.056280] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d620f090
Nov 6 21:35:41 rodomp01 kernel: [ 293.056283] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d620f0d8
Nov 6 21:35:41 rodomp01 kernel: [ 293.057750] scsi host4: uas_eh_bus_reset_handler success

cat /proc/cmdline:
BOOT_IMAGE=/vmlinuz-3.16.0-51-generic root=UUID=cee44af9-c463-4cf1-ac47-36add5959072 ro quiet splash usbcore.autosuspend=-1 vt.handoff=7

Tried your workaround but problem still exists.

Log:
Nov  6 21:35:38 rodomp01 kernel: [  289.927150] sd 4:0:0:0: [sdb] uas_eh_abort_handler ffff88021311cd80 tag 0, inflight: CMD IN
Nov  6 21:35:41 rodomp01 kernel: [  292.927539] scsi host4: uas_eh_task_mgmt: ABORT TASK timed out
Nov  6 21:35:41 rodomp01 kernel: [  292.927588] sd 4:0:0:0: uas_eh_device_reset_handler
Nov  6 21:35:41 rodomp01 kernel: [  292.927597] scsi host4: uas_eh_task_mgmt: LOGICAL UNIT RESET: error already running a task
Nov  6 21:35:41 rodomp01 kernel: [  292.927603] scsi host4: uas_eh_bus_reset_handler start
Nov  6 21:35:41 rodomp01 kernel: [  292.927669] usb 3-1: stat urb: killed, stream 2
Nov  6 21:35:41 rodomp01 kernel: [  292.927763] sd 4:0:0:0: [sdb] uas_data_cmplt ffff88021311cd80 tag 0, inflight: CMD abort
Nov  6 21:35:41 rodomp01 kernel: [  292.927770] sd 4:0:0:0: [sdb] data cmplt err -2 stream 2
Nov  6 21:35:41 rodomp01 kernel: [  292.927804] sd 4:0:0:0: [sdb] uas_zap_dead ffff88021311cd80 tag 0, inflight: CMD abort
Nov  6 21:35:41 rodomp01 kernel: [  292.927822] sd 4:0:0:0: [sdb] abort completed
Nov  6 21:35:41 rodomp01 kernel: [  293.039920] usb 3-1: reset SuperSpeed USB device number 2 using xhci_hcd
Nov  6 21:35:41 rodomp01 kernel: [  293.056271] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d620f000
Nov  6 21:35:41 rodomp01 kernel: [  293.056277] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d620f048
Nov  6 21:35:41 rodomp01 kernel: [  293.056280] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d620f090
Nov  6 21:35:41 rodomp01 kernel: [  293.056283] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d620f0d8
Nov  6 21:35:41 rodomp01 kernel: [  293.057750] scsi host4: uas_eh_bus_reset_handler success

cat /proc/cmdline:
BOOT_IMAGE=/vmlinuz-3.16.0-51-generic root=UUID=cee44af9-c463-4cf1-ac47-36add5959072 ro quiet splash usbcore.autosuspend=-1 vt.handoff=7

Revision history for this message

user (user-3) wrote on 2015-11-26:

#46

hi there,

I had the same issue, and grub update helped me to resolve this issue

BOOT_IMAGE=/vmlinuz-3.19.0-33-generic root=UUID=...... ro quiet splash nox2apic usbcore.autosuspend=-1

Revision history for this message

Scott (e2e8e2) wrote on 2016-01-15:

#47

Since my original post a year ago I have had a couple of occurrences of this problem even on the older kernel version that I'm using. I tried setting nox2apic as a boot parameter, and it appears to work. The only time I was getting the error on that kernel version was when I was resynching a RAID 1 array on 2 2tb USB 3 drives on the same controller. I got the error today when doing that, so I aborted the resynch and rebooted with nox2apic. It resynched all 2tb with no occurrences of the error. Now I'm going to try upgrading the kernel to current and see if I can cause the error again with nox2apic set.

Revision history for this message

Scott (e2e8e2) wrote on 2016-01-17:

#48

I was able to upgrade to kernel version 3.13.0-74-generic (Ubuntu 14.04 LTS) from 3.13.0.24.28 and everything appears to work fine (i.e. no messages and no lost USB connections) with nox2apic set. I copied hundreds of gigabytes back and forth on USB 3 drives without any problems and a binary compare of the results showed the copies were exact. Unfortunately I can't upgrade any further than that as the next version, Ubuntu 14.10, is now obsolete and the next LTS version isn't out yet; so my installation won't upgrade at all right now. I'm stuck unless I want to do an install from scratch.

Revision history for this message

databill (julienjut) wrote on 2016-02-04:

#49

I was affected by usb3.0 for years. Once Kernel 3.12(test release, linux-headers-3.12.0-031200rc2-generic_3.12.0-031200rc2.201309231935_amd64.deb, etc) has resolved my issue and works for a year, but since I update ubuntu to 15.04, it appears again and again.

I have the phenomenon as link described, http://unix.stackexchange.com/questions/91027/how-to-disable-usb-autosuspend-on-kernel-3-7-10-or-above.
I'm testing #44 suggestion, and I hope it will help me to step out the issue. Thanks, Malachi de AElfweald (malachid)

I will give back the test result tomorrow.

Revision history for this message

databill (julienjut) wrote on 2016-02-05:

#50

Issue is resolved! It has been working over 24 Hours.

#uptime
11:02:54 up 1 day, 1:37, 2 users, load average: 0.28, 0.25, 0.24

Thanks again!

solutions refer to #44,

Edit the /etc/default/grub file and append to the GRUB_CMDLINE_LINUX_DEFAULT line:
usbcore.autosuspend=-1

sudo update-grub
sudo reboot

Revision history for this message

Scott (e2e8e2) wrote on 2016-03-13:

#51

I have found another setting that in my case was the real cause of the problem, namely ASPM (Active State Power Management). I had noticed that I was getting the disabled endpoint error message at odd times, like in the middle of the night, when there was little or no activity to the USB 3 disks attached to the controller. It always struck me as odd that the problem would occur randomly like that and not be clustered around the periods of high activity.

It turns out that ASPM was active by default in the BIOS, buried in a power management setting for PCI devices (i.e. it didn't say ASPM in the setting title). Once I turned that setting off all the errors stopped, both when the devices were active and when they were idle. I'm sort of assuming that the issues happens when ASPM tries to turn off or go to a lower power state on the USB controller. I don't know if this is a bug in the BIOS, firmware of the USB controller, or the linux kernel device driver, or a combination of those. Whichever it is though, turning this off in the BIOS solved everything in every kernel version that I've tested. I also installed Arch Linux so that I could try a recent kernel version and it works find there too. So if you are having this problem and it the other solutions haven't worked for you, look for this setting in your BIOS. It might be in the power management section, but also might be in the peripherals, devices, or advanced PCI settings area.

Joseph Salisbury (jsalisbury) on 2016-03-14

tags:

removed: performing-bisect

Revision history for this message

Rolf Leggewie (r0lf) wrote on 2016-04-24:

#52

utopic has seen the end of its life and is no longer receiving any updates. Marking the utopic task for this ticket as "Won't Fix".

Changed in linux (Ubuntu Utopic):
status:	Incomplete → Won't Fix

Revision history for this message

Launchpad Janitor (janitor) wrote on 2016-06-24:

#53

[Expired for linux (Ubuntu Vivid) because there has been no activity for 60 days.]

Changed in linux (Ubuntu Vivid):
status:	Incomplete → Expired

Revision history for this message

Launchpad Janitor (janitor) wrote on 2016-06-24:

#54

[Expired for linux (Ubuntu) because there has been no activity for 60 days.]

Changed in linux (Ubuntu):
status:	Incomplete → Expired

Revision history for this message

Launchpad Janitor (janitor) wrote on 2016-06-24:

#55

[Expired for linux (Ubuntu Trusty) because there has been no activity for 60 days.]

Changed in linux (Ubuntu Trusty):
status:	Incomplete → Expired

Revision history for this message

sibulini (sibulini) wrote on 2017-04-25:

#56

Hi everyone, from your discussions, I can't find the root cause about the problem. I meet this issue once in 3.14.55 with android OS. I hope get the root cause. Thanks a lot!

Ubuntu
linux package

USB 3.0 connection is unreliable + xHCI xhci_drop_endpoint called with disabled ep

Bug Description

Other bug subscribers

Bug attachments

Remote bug watches

	Status	Importance	Assigned to
linux (Ubuntu)	Expired	High	Unassigned
Trusty	Expired	High	Unassigned
Utopic	Won't Fix	High	Unassigned
Vivid	Expired	High	Unassigned

Ubuntulinux package

USB 3.0 connection is unreliable + xHCI xhci_drop_endpoint called with disabled ep

Bug Description

Other bug subscribers

Bug attachments

Remote bug watches

Ubuntu
linux package