USB 3.0 connection is unreliable + xHCI xhci_drop_endpoint called with disabled ep
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
| linux (Ubuntu) |
High
|
Unassigned | ||
| Trusty |
High
|
Unassigned | ||
| Utopic |
High
|
Unassigned | ||
| Vivid |
High
|
Unassigned |
Bug Description
based on https:/
Also confirmed on Lenovo IdeaPad-Z500 after BIOS update to 71CN51WW(V1.21) (changelog didn't indicate any USB issues anyway). Also confirmed with 3.16.3 and 3.16.0-14 on Ubuntu 14.10-beta1 after updates.
The issue seems to cause failure of ASIX AX179 gigabit ethernet chip as well, but is independent from the usage of the ethernet adapter.
I'm without any clue and stuck with an unreliable USB and ethernet connection which basically means no I/O out of the machine!!
ProblemType: Bug
DistroRelease: Ubuntu 14.04
Package: linux-image-
ProcVersionSign
Uname: Linux 3.13.0-35-generic x86_64
ApportVersion: 2.14.1-0ubuntu3.4
Architecture: amd64
AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/by-path', '/dev/snd/
CurrentDesktop: Unity
Date: Thu Sep 18 19:58:24 2014
EcryptfsInUse: Yes
InstallationDate: Installed on 2014-09-10 (8 days ago)
InstallationMedia: Ubuntu 14.04 LTS "Trusty Tahr" - Release amd64 (20140417)
MachineType: LENOVO 20221
ProcEnviron:
TERM=xterm
PATH=(custom, no user)
XDG_RUNTIME_
LANG=de_DE.UTF-8
SHELL=/bin/bash
ProcFB: 0 inteldrmfb
ProcKernelCmdLine: BOOT_IMAGE=
RelatedPackageV
linux-
linux-
linux-firmware 1.127.5
SourcePackage: linux
StagingDrivers: rts5139
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 07/12/2013
dmi.bios.vendor: LENOVO
dmi.bios.version: 71CN51WW(V1.21)
dmi.board.
dmi.board.name: INVALID
dmi.board.vendor: LENOVO
dmi.board.version: 31900003WIN8 STD MLT
dmi.chassis.
dmi.chassis.type: 10
dmi.chassis.vendor: LENOVO
dmi.chassis.
dmi.modalias: dmi:bvnLENOVO:
dmi.product.name: 20221
dmi.product.
dmi.sys.vendor: LENOVO
Karl-Philipp Richter (krichter722) wrote : | #1 |
Changed in linux (Ubuntu): | |
status: | New → Confirmed |
description: | updated |
Joseph Salisbury (jsalisbury) wrote : | #3 |
Would it be possible for you to test the latest upstream kernel? Refer to https:/
If this bug is fixed in the mainline kernel, please add the following tag 'kernel-
If the mainline kernel does not fix this bug, please add the tag: 'kernel-
If you are unable to test the mainline kernel, for example it will not boot, please add the tag: 'kernel-
Once testing of the upstream kernel is complete, please mark this bug as "Confirmed".
Thanks in advance.
[0] http://
Changed in linux (Ubuntu): | |
importance: | Undecided → Medium |
tags: | added: kernel-da-key |
Changed in linux (Ubuntu): | |
importance: | Medium → High |
Karl-Philipp Richter (krichter722) wrote : | #4 |
Tested in 3.17-rc5 and experienced the issue. Due to the high number of connection failures one partition used for reproduction has vanished, but this shouldn't matter because data rescue in gparted has been a main reproduction scenario. I had to remove apt packages `multipath-*` in order to make all devices on USB ports being recognized.
tags: | added: latest-bios-1.21 |
tags: | added: kernel-bug-exists-upstream-3.15-rc7 utopic |
Joseph Salisbury (jsalisbury) wrote : | #5 |
This issue appears to be an upstream bug, since you tested the latest upstream kernel. Would it be possible for you to open an upstream bug report[0]? That will allow the upstream Developers to examine the issue, and may provide a quicker resolution to the bug.
Please follow the instructions on the wiki page[0]. The first step is to email the appropriate mailing list. If no response is received, then a bug may be opened on bugzilla.
Once this bug is reported upstream, please add the tag: 'kernel-
Joseph Salisbury (jsalisbury) wrote : | #6 |
Also, was there a prior kernel version that did not exhibit this bug? If there is, we can perform a bisect to identify the commit that introduced this.
Karl-Philipp Richter (krichter722) wrote : | #7 |
After obtaining an Orico A3H7 USB 3.0 hub with (sufficient) power supply the reproducability of the bug got limited to the broken HDD (reproducability with it is 100 % (disk is recognized by gparted and doesn't have a /dev/sdxY file)). Failure of of ethernet adapter no longer occurs which makes me think the original issue was related to insufficient power on USB connections and that the error `xHCI xhci_drop_endpoint called with disabled ep` is related rather to a consequence of the power failure rather than being the error of the power failure.
In my point of view it'd make sense to get the HDD working again (there's probably an issue with the partition table) and testing whether it works in 3.13.0-36 and then other versions (including mainline).
Sorry for the delay of my response.
Karl-Philipp Richter (krichter722) wrote : | #8 |
At the same time of upgrade from 13.10 to 14.04 I changed to btrfs which causes incredible trouble (it can't be said often enough that it's irresponsible to offer it as root filesystem without(!) a warning), so there're so many possible reasons for the bug (most of them are bugs for themselves) that for a non-developer it's impossible to provide accurate reports.
Karl-Philipp Richter (krichter722) wrote : | #9 |
Found a working kernel: on Ubuntu 14.04 live system with 3.13.0-24-generic I can run ddrescue which found > 50 errors on the HDD so far, but the error doesn't occur and ddrescue proceed - slowly, but at least it does! Here's the thing: running the 3.13.0-24-generic kernel on Ubuntu installation doesn't work, so that I assume it's not just a kernel issue or not a kernel issue at all, but related to an upgraded apt package. How to proceed? How is bisecting done?
legolas558 (legolas558) wrote : | #10 |
Running kernel 3.13.0-35-generic here.
I was affected by the same issue. Using "echo -1 >/sys/module/
The best way to trigger the issue was for example to run an rsync reading from one partition on the external disk and writing to another partition on the same disk.
Plugging the disk with an USB 2.0 cable would show no problems at all with same tests.
After plugging the external disk to a an USB port on the back, I can confirm it works perfectly (and still in high speed mode, "new SuperSpeed USB device number 2 using xhci_hcd").
Upon inspection, I found that the USB 3.0 cable connecting the front USB ports to the motherboard was a bit loose at connecting the USB 3.0 part of it. USB 2.0 traffic would always work perfectly fine but problems would arise when estabilishing 3.0 links.
So, long story short, please check other USB 3.0 ports and that those you are using are well connected. Although I was suspecting the PSU of the external disk (12V 1.5A) and a bug in the kernel (as I read here and elsewhere), in the end it was just a cable problem.
Hope it helps.
Thanks @legolas558, I checked all cables and exchanged most of my USB equipment (hubs and cables). I tested in combinations which would reveal a faulty USB part. Inspired by your comment I opened the machine and checked the internal cable connections as well. They're all fine.
I `ddrescue` test case causes a kernel panic in 3.17-rc6, now, photo attached[1]. This occurs ~90 % of the time. I attach the log file of `ddrescue` 1.7.0 as well[2], maybe it can serve to produce a test file for you. It definitely works in the non-persistent live system based on 3.13.0-24-generic.
---
[1] I'm currently stuck with `linux-crashtools`, please help me http://
[2] Although the wiki states no compressed attachements, there's no way of figuring out how to attach a second file...
Panic confirmed on 3.17-rc7.
In the meantime I tested with 3.2.64, 3.4.104, 3.10.60 and 3.12.32 and had no problem, i.e. GNU `ddrescue ` 1.17 processes the device without kernel panic (1.5 TB with > 7000 errors recognized and copied on a damaged device).
In 3.14.24 I don't get a kernel panic, but `ddrescue` get stuck at reading a damaged block while `dmesg` shows
[ 2074.174135] sdh: sdh1 sdh9
[ 2462.587818] usb 4-1.3.1.3: reset SuperSpeed USB device number 9 using xhci_hcd
[ 2462.603164] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff88041d586d80
[ 2462.603168] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff88041d586dc0
[ 2502.601314] usb 4-1.3.1.3: reset SuperSpeed USB device number 9 using xhci_hcd
[ 2502.616594] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff88041d586d80
[ 2502.616602] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff88041d586dc0
[ 2647.252571] INFO: task usb-storage:604 blocked for more than 120 seconds.
[ 2647.252580] Tainted: PF W O 3.14.24-
[ 2647.252591] "echo 0 > /proc/sys/
[ 2647.252593] usb-storage D ffffffff81811ae0 0 604 2 0x00000000
[ 2647.252596] ffff88041cb55af8 0000000000000046 ffff88041cb55af8 ffff88041cb55fd8
[ 2647.252599] 0000000000014540 0000000000014540 ffffffff81c144a0 ffff88041cb0a7c0
[ 2647.252601] 0000000100000000 ffff880422b9d808 7fffffffffffffff 7fffffffffffffff
[ 2647.252603] Call Trace:
[ 2647.252609] [<ffffffff8177a
[ 2647.252612] [<ffffffff81779
[ 2647.252616] [<ffffffff8156b
[ 2647.252618] [<ffffffff8177b
[ 2647.252620] [<ffffffff8156c
[ 2647.252624] [<ffffffff810a4
[ 2647.252626] [<ffffffff8156f
[ 2647.252646] [<ffffffffa019f
[ 2647.252651] [<ffffffffa019f
[ 2647.252655] [<ffffffffa019f
[ 2647.252659] [<ffffffffa019f
[ 2647.252662] [<ffffffff81779
[ 2647.252666] [<ffffffffa01a0
[ 2647.252668] [<ffffffff8177b
[ 2647.252672] [<ffffffffa019e
[ 2647.252676] [<ffffffffa01a1
[ 2647.252681] [<ffffffffa01a1
[ 2647.252683] [<ffffffff81093
[ 2647.252685] [<ffffffff81092
[ 2647.252687] [<ffffffff81787
[ 2647....
gazhay (gazhay) wrote : | #14 |
I'm finding xhci unreliable across the board since 14.x, worse in 14.10
Scanners (usb2) that previously worked in usb3 ports now do not work, causing seg faults in applications and all kinds of errors in dmesg. (segfault at 0 ip 00007f1f79b9dd3f sp 00007f1fa283ca10 error 4 in libsane-
There are also unexpected length errors, and device not found errors.
I've found this fixed in 3.12.32 and in 3.17.4 (no issues in 3.17.5 and 3.17.6 as well) in all kernels in between issues occured. I reported more information on different kernels and different error messages occuring when reproducing the issue on them on the same issue reported by another person (Ubuntu guidelines discourage this sort of helpful cross posting), but launchpad linking and search facilities are so bad, that I don't have energy to compensate them now. @gazhay try > 3.17.4 or 3.12.x with x >= 32. Maybe 3.18.0 contains a regression again so that it stopped working...
technik007_cz (technik007-cz) wrote : | #16 |
I'm running on kernel 3.13.0-
technik007_cz (technik007-cz) wrote : | #17 |
I gonna try 3.17.4 kernel or up Karl...
Changed in linux (Ubuntu): | |
status: | Confirmed → Fix Committed |
Scott (e2e8e2) wrote : | #18 |
The only kernel version that works for me is 3.13.0.24.28 . I just tried upgrading Ubuntu to 3.13.0.44.51 and the problem came back with a vengeance (destroyed my software raid configuration of 2 usb 3 drives).
Alan Pope 🍺🐧🐱 🦄 (popey) wrote : | #19 |
This is marked "Fix committed" but I'm still seeing it on 3.19.0-9-generic on 15.04.
dding to a USB3 key attached to a USB3 hub causes my syslog to be spammed with a bunch of:-
Mar 16 11:54:24 deep-thought kernel: [ 5515.310662] usb 2-1.1: reset SuperSpeed USB device number 7 using xhci_hcd
Mar 16 11:54:24 deep-thought kernel: [ 5515.327096] xhci_hcd 0000:0e:00.0: xHCI xhci_drop_endpoint called with disabled ep ffff880032dc5cc0
As well as some:-
Mar 16 11:55:26 deep-thought kernel: [ 5576.916045] hub 2-1:1.0: hub_port_status failed (err = -71)
Joseph Salisbury (jsalisbury) wrote : | #20 |
The v4.0 -rc4 kernel is now available. Can folks affected by this bug test the latest kernel? It can be downloaded from:
http://
Changed in linux (Ubuntu Vivid): | |
status: | Fix Committed → Confirmed |
Changed in linux (Ubuntu Trusty): | |
status: | New → Confirmed |
Changed in linux (Ubuntu Utopic): | |
status: | New → Confirmed |
importance: | Undecided → High |
Changed in linux (Ubuntu Trusty): | |
importance: | Undecided → High |
tags: | added: vivid |
Alan Pope 🍺🐧🐱 🦄 (popey) wrote : | #21 |
Tried 4.0-rc4 on vivid and saw none of the xhci error messages I saw on 3.19. I performed exactly the same operation using the same hardware, namely ddrescuing an ~7.5GB image to an 8GB stick.
alan@deep-
[Mon Mar 16 19:48:40 2015] xhci_hcd 0000:0e:00.0: xHCI Host Controller
[Mon Mar 16 19:48:40 2015] xhci_hcd 0000:0e:00.0: new USB bus registered, assigned bus number 1
[Mon Mar 16 19:48:40 2015] xhci_hcd 0000:0e:00.0: hcc params 0x014042cb hci version 0x96 quirks 0x00000004
[Mon Mar 16 19:48:40 2015] xhci_hcd 0000:0e:00.0: xHCI Host Controller
[Mon Mar 16 19:48:40 2015] xhci_hcd 0000:0e:00.0: new USB bus registered, assigned bus number 2
[Mon Mar 16 19:49:49 2015] usb 2-1: new SuperSpeed USB device number 2 using xhci_hcd
[Mon Mar 16 19:49:50 2015] usb 1-1: new high-speed USB device number 2 using xhci_hcd
[Mon Mar 16 19:49:59 2015] usb 2-1.3: new SuperSpeed USB device number 3 using xhci_hcd
[Mon Mar 16 19:54:59 2015] usb 2-1.3: new SuperSpeed USB device number 4 using xhci_hcd
Joseph Salisbury (jsalisbury) wrote : | #22 |
I'd like to perform a bisect to figure out what commit caused this regression. We need to identify the earliest kernel where the issue started happening as well as the latest kernel that did not have this issue. Reading the comments, it sounds like this first started in the 3.13.0-25 Ubuntu kernel, which is based on upstream 3.13.10.
Can folks affected by this bug test the following kernels and post back
3.13.9: http://
3.13.10: http://
Joseph Salisbury (jsalisbury) wrote : | #23 |
I re-read comment #21, and you 'Saw none' of the errors with 4.0, which means this bug might be fixed there. It might be better of to perform a 'Reverse' bisect to identify the commit that fixes this bug and requested it in the prior stable kernels.
Can you test the following kernels and report back? We are looking for the last kernel version that has the bug and the first that does not:
v4.0-rc1: http://
v4.0-rc2: http://
v4.0-rc3: http://
tags: | added: performing-bisect |
Thank you for taking the time to file a bug report on this issue.
However, given the number of bugs that the Kernel Team receives during any development cycle it is impossible for us to review them all. Therefore, we occasionally resort to using automated bots to request further testing. This is such a request.
We have noted that there is a newer version of the development kernel than the one you last tested when this issue was found. Please test again with the newer kernel and indicate in the bug if this issue still exists or not.
You can update to the latest development kernel by simply running the following commands in a terminal window:
sudo apt-get update
sudo apt-get dist-upgrade
If the bug still exists, change the bug status from Incomplete to Confirmed. If the bug no longer exists, change the bug status from Incomplete to Fix Released.
If you want this bot to quit automatically requesting kernel tests, add a tag named: bot-stop-nagging.
Thank you for your help, we really do appreciate it.
Changed in linux (Ubuntu): | |
status: | Confirmed → Incomplete |
tags: | added: kernel-request-3.19.0-9.9 |
Changed in linux (Ubuntu Vivid): | |
status: | Incomplete → Confirmed |
Stefan (steffel) wrote : | #25 |
Experiencing a problem on xhci module with an Opticon Barcode Scanner NLV-1001 (ttyUSB device). After a few barcode-
Tried:
- 3.13.0-35-generic
- 3.16.0-34-generic
- 3.17.0-
- 3.19.1-
Without USB3 (when disabled in BIOS) - ehci module is used, then, and no problems show up.
Could anyone tell me whether my problem with following trace is related to this issue?
Thank you for your help!
(kernel.log)
Apr 27 10:32:10 myhost123 kernel: [ 3721.041230] INFO: task myapp:2920 blocked for more than 120 seconds.
Apr 27 10:32:10 myhost123 kernel: [ 3721.041235] Tainted: G OE 3.19.1-
Apr 27 10:32:10 myhost123 kernel: [ 3721.041236] "echo 0 > /proc/sys/
Apr 27 10:32:10 myhost123 kernel: [ 3721.041237] myapp D e4995b74 0 2920 2477 0x00000004
Apr 27 10:32:10 myhost123 kernel: [ 3721.041240] e4995be0 00200086 00000000 e4995b74 e85ba000 0899fc06 000002fb 00000001
Apr 27 10:32:10 myhost123 kernel: [ 3721.041244] 00000001 e4995fec c15511bf c1b88f40 e51b0e00 e926cf40 e48cee40 e5730620
Apr 27 10:32:10 myhost123 kernel: [ 3721.041247] 00000000 00000000 e4cfb940 00000001 00000000 e863f070 e5376610 00000c01
Apr 27 10:32:10 myhost123 kernel: [ 3721.041250] Call Trace:
Apr 27 10:32:10 myhost123 kernel: [ 3721.041258] [<c15511bf>] ? xhci_queue_
Apr 27 10:32:10 myhost123 kernel: [ 3721.041261] [<c1548d5d>] ? xhci_urb_
Apr 27 10:32:10 myhost123 kernel: [ 3721.041263] [<c16eac03>] schedule+0x23/0x60
Apr 27 10:32:10 myhost123 kernel: [ 3721.041266] [<c16ed135>] schedule_
Apr 27 10:32:10 myhost123 kernel: [ 3721.041270] [<c1509180>] ? usb_hcd_
Apr 27 10:32:10 myhost123 kernel: [ 3721.041272] [<c150a480>] ? usb_submit_
Apr 27 10:32:10 myhost123 kernel: [ 3721.041275] [<c11808f4>] ? vmap_pmd_
Apr 27 10:32:10 myhost123 kernel: [ 3721.041277] [<c16ebbb5>] wait_for_
Apr 27 10:32:10 myhost123 kernel: [ 3721.041280] [<c1089770>] ? try_to_
Apr 27 10:32:10 myhost123 kernel: [ 3721.041282] [<c150b541>] usb_start_
Apr 27 10:32:10 myhost123 kernel: [ 3721.041284] [<c1194feb>] ? __kmalloc+
Apr 27 10:32:10 myhost123 kernel: [ 3721.041286] [<c1509ff9>] ? usb_alloc_
Apr 27 10:32:10 myhost123 kernel: [ 3721.041288] [<c150b83b>] usb_control_
Apr 27 10:32:10 myhost123 kernel: [ 3721.041295] [<f058b6fa>] send_control_
Apr 27 10:32:10 myhost123 kernel: [ 3721.041297] [<f058b830>] opticon_
Apr 27 10:32:10 myhost123 kernel: [ 3721.041300] [<f05d50d1>] serial_
Apr 27 10:32:10 myhost123 kernel: [ 3721.041303] [<c1416791>] tty_port_
Apr 27 10:32:10 myhost123 kernel: [ 3721.041307] [<f05d5b7c>] serial_
Apr 27 10:32:10 myhost123 kernel: [ 3721.041309] [<c140f22e>] tty_open+0x3e/0x3f0
Apr 27 10:32:10 myhost123 kernel: [ 3721.041312] [<c11ae...
Thank you for taking the time to file a bug report on this issue.
However, given the number of bugs that the Kernel Team receives during any development cycle it is impossible for us to review them all. Therefore, we occasionally resort to using automated bots to request further testing. This is such a request.
We have noted that there is a newer version of the development kernel than the one you last tested when this issue was found. Please test again with the newer kernel and indicate in the bug if this issue still exists or not.
You can update to the latest development kernel by simply running the following commands in a terminal window:
sudo apt-get update
sudo apt-get dist-upgrade
If the bug still exists, change the bug status from Incomplete to Confirmed. If the bug no longer exists, change the bug status from Incomplete to Fix Released.
If you want this bot to quit automatically requesting kernel tests, add a tag named: bot-stop-nagging.
Thank you for your help, we really do appreciate it.
Changed in linux (Ubuntu): | |
status: | Confirmed → Incomplete |
tags: | added: kernel-request-3.19.0-15.15 |
NightShade (tim-night-shade) wrote : | #27 |
I was having the same problem with a number of USB3 harddisks, upgrading to 4.0.1 from the mainline PPA has fixed it for me.
I think it might be triggered by my Yubikey security key. I get "WARN Event TRB for slot 8 ep 4 with no TDs queued?" when gpg accesses the key.
Marcos Vives Del Sol (socram8888) wrote : | #28 |
I am still experiencing this issue as of now:
marcos@
Ign http://
Ign http://
Ign http://
Obj http://
Des:1 http://
Obj http://
Obj http://
Des:2 http://
Ign http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Obj http://
Des:3 http://
Des:4 http://
Obj http://
Des:5 http://
Scott (e2e8e2) wrote : | #29 |
I know people are looking at this, but it' been a long time, and since this bug can and does cause data corruption (it happened to me) I'd hope that this one would be near the top of the heap.
Joseph Salisbury (jsalisbury) wrote : | #30 |
Can folks affected by this bug test the following kernels and report back? We are looking for the last kernel version that has the bug and the first that does not:
v4.0-rc1: http://
v4.0-rc2: http://
v4.0-rc3: http://
NightShade (tim-night-shade) wrote : | #31 |
tim@desktop-tim:~$ uname -a
Linux desktop-tim 4.0.1-040001-
No bug, I'll pull the other kernels down and test them over the next few days
Changed in linux (Ubuntu Utopic): | |
status: | Confirmed → Incomplete |
Changed in linux (Ubuntu Trusty): | |
status: | Confirmed → Incomplete |
Manuel Iglesias Alonso (glesialo) wrote : | #32 |
I had to plug my USB3 external drive into a (much slower) USB2 socket to avoid data being corrupted by this bug.
Could you please reactivate this report and start looking for a solution?
Thanks.
gazhay (gazhay) wrote : | #33 |
Recently bought a new machine and put 15.04 on it fresh.
I can confirm that my scanner which I had got working thanks to a usb2 card, no longer works as the motherboard of my new machine forces you to use xhci for all ports regardless.
There still seems to be some sort of bug in vivid.
Mikhail (mikhail-kuzmin) wrote : | #34 |
I'm having the same problem on 3.13.0-
Scott (e2e8e2) wrote : | #35 |
If anyone has the time to test this (the system it's happening to me on is off site and I won't be back to it for a while), see if the problem still occurs if you connect the device through a USB 3 hub rather than directly to the USB port. It just occurred to me that somewhere in the process of trying to circumvent this problem I added a USB 3 hub to the mix, and now I'm wondering if it's the hub that's circumventing the problem or if backing down to 3.13.0.24.28 did the trick.
As it doesn't look like this bug is going to receive attention anytime soon I thought it might be worth someone doing a test. If there aren't any results by the time I get back to my system that was having trouble in a couple of months I'll try it myself.
Yuriy Vidineev (adeptg) wrote : | #36 |
Dell XPS 13 9333, Ubuntu 14.04, 3.19.0-26-generic, USB 3.0 HDD enclosure (ASMedia AS2105). Directly connected to USB3 port - not detected at all. Connected to USB2.0 hub - detected and perfectly worked. Connected to USB 3.0 hub (ASIX AX88179) - determined ~50% times (~50% no any line in syslog after plug in). When it successfully determined in USB3 hub - works after it without any problem (however my test was short - 15 min rsync with a lot of files (my home folder))
Scott (e2e8e2) wrote : | #37 |
So there was a difference between being plugged into a hub and not. Mine is working with 3.13.0.24.28, but I also have the hard drives connected through a USB 3 hub. I'm not sure if this information will help find the bug, but it would be nice if we could determine that using a hub can circumvent the problem.
I also am quite perplexed as to why this bug hasn't been addressed as it's about a year old now and it can cause data corruption. One would think that bugs which have the potential to cause data corruption would be among the highest priority to be fixed.
daniel lopez (dlopez7892) wrote : | #38 |
I'm using a ASUS Rampage IV Black edition with Ubuntu currently and am also experiencing this issue. I've tested several kernel versions and distros, and this is not an issue unique to Ubuntu. It will happen on pretty much any kernel =>3.15 with certain ASMedia USB 3.0 controllers. I'm not sure if it is isolated to ASMedia ICs, but it does impact at least two ASmedia products. The upstream kernel devs are aware of the issue and don't seem very optimistic about being able to fix it. Apparently there is a poor design for a PCI to USB bridge used in these controllers, and the possibility of fixing it without ASMedia or the motherboard manufacturers contributing code is fairly slim. For me, I had to disable USB 3.0 entirely to even get Linux =>3.15 to boot on this board without having a kernel panic. While I'd love to see this get fixed, I'm not sure that chasing this down is worth the Ubuntu kernel team's time.
Scott (e2e8e2) wrote : | #39 |
It's not exclusive to that controller. It's happening to me with a NEC uPD720200 USB 3.0 Host Controller, which is a very common USB 3 controller chip.
Malachi de AElfweald (malachid) wrote : | #40 |
I see the same problem with a hub plugged in and nothing attached to it. This happens repeatedly. I've reported it to them as well, but it seems to be more of a problem with the xhci_hcd. This is with 3.19.0-30.
[ 3323.263466] usb 2-1.1: USB disconnect, device number 103
[ 3323.284305] usb 2-1.4: USB disconnect, device number 104
[ 3323.398767] usb 2-1: reset SuperSpeed USB device number 2 using xhci_hcd
[ 3323.416724] xhci_hcd 0000:04:00.0: xHCI xhci_drop_endpoint called with disabled ep ffff8804288644e0
[ 3323.697914] usb 2-1.1: new SuperSpeed USB device number 105 using xhci_hcd
[ 3323.713630] usb 2-1.1: New USB device found, idVendor=05e3, idProduct=0612
[ 3323.713633] usb 2-1.1: New USB device strings: Mfr=1, Product=2, SerialNumber=3
[ 3323.713635] usb 2-1.1: Product: USB3.0 Hub
[ 3323.713636] usb 2-1.1: Manufacturer: SKIVA TECHNOLOGIES INC
[ 3323.713638] usb 2-1.1: SerialNumber: t2
[ 3323.715331] hub 2-1.1:1.0: USB hub found
[ 3323.715596] hub 2-1.1:1.0: 4 ports detected
[ 3323.789786] usb 2-1.4: new SuperSpeed USB device number 106 using xhci_hcd
[ 3323.805724] usb 2-1.4: New USB device found, idVendor=05e3, idProduct=0612
[ 3323.805727] usb 2-1.4: New USB device strings: Mfr=1, Product=2, SerialNumber=3
[ 3323.805729] usb 2-1.4: Product: USB3.0 Hub
[ 3323.805731] usb 2-1.4: Manufacturer: SKIVA TECHNOLOGIES INC
[ 3323.805732] usb 2-1.4: SerialNumber: t3
[ 3323.807841] hub 2-1.4:1.0: USB hub found
[ 3323.808171] hub 2-1.4:1.0: 4 ports detected
Malachi de AElfweald (malachid) wrote : | #41 |
Bug did not go away with 15.10 / 4.2.0-16
Malachi de AElfweald (malachid) wrote : | #42 |
Regarding comment #37
If I plug an Android phone (HTC One m7 GPe) to the same port instead of the USB3 Hub, it stays connected.
If I connect the USB hub instead, I get this problem.
If I connect the phone through the USB hub, I loose access to the phone rather quickly during one of the drop cycles.
Malachi de AElfweald (malachid) wrote : | #43 |
Could be related to https:/
Malachi de AElfweald (malachid) wrote : | #44 |
I believe I have fixed mine. Perhaps someone else can test the fix on theirs?
From this http://
Edit the /etc/default/grub file and append to the GRUB_CMDLINE_
usbcore.
sudo update-grub
reboot
I don't appear to be getting the disconnects anymore.
Robert Oswald (robert-oswald) wrote : | #45 |
Tried your workaround but problem still exists.
Log:
Nov 6 21:35:38 rodomp01 kernel: [ 289.927150] sd 4:0:0:0: [sdb] uas_eh_
Nov 6 21:35:41 rodomp01 kernel: [ 292.927539] scsi host4: uas_eh_task_mgmt: ABORT TASK timed out
Nov 6 21:35:41 rodomp01 kernel: [ 292.927588] sd 4:0:0:0: uas_eh_
Nov 6 21:35:41 rodomp01 kernel: [ 292.927597] scsi host4: uas_eh_task_mgmt: LOGICAL UNIT RESET: error already running a task
Nov 6 21:35:41 rodomp01 kernel: [ 292.927603] scsi host4: uas_eh_
Nov 6 21:35:41 rodomp01 kernel: [ 292.927669] usb 3-1: stat urb: killed, stream 2
Nov 6 21:35:41 rodomp01 kernel: [ 292.927763] sd 4:0:0:0: [sdb] uas_data_cmplt ffff88021311cd80 tag 0, inflight: CMD abort
Nov 6 21:35:41 rodomp01 kernel: [ 292.927770] sd 4:0:0:0: [sdb] data cmplt err -2 stream 2
Nov 6 21:35:41 rodomp01 kernel: [ 292.927804] sd 4:0:0:0: [sdb] uas_zap_dead ffff88021311cd80 tag 0, inflight: CMD abort
Nov 6 21:35:41 rodomp01 kernel: [ 292.927822] sd 4:0:0:0: [sdb] abort completed
Nov 6 21:35:41 rodomp01 kernel: [ 293.039920] usb 3-1: reset SuperSpeed USB device number 2 using xhci_hcd
Nov 6 21:35:41 rodomp01 kernel: [ 293.056271] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d620f000
Nov 6 21:35:41 rodomp01 kernel: [ 293.056277] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d620f048
Nov 6 21:35:41 rodomp01 kernel: [ 293.056280] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d620f090
Nov 6 21:35:41 rodomp01 kernel: [ 293.056283] xhci_hcd 0000:00:14.0: xHCI xhci_drop_endpoint called with disabled ep ffff8800d620f0d8
Nov 6 21:35:41 rodomp01 kernel: [ 293.057750] scsi host4: uas_eh_
cat /proc/cmdline:
BOOT_IMAGE=
user (user-3) wrote : | #46 |
hi there,
I had the same issue, and grub update helped me to resolve this issue
BOOT_IMAGE=
Scott (e2e8e2) wrote : | #47 |
Since my original post a year ago I have had a couple of occurrences of this problem even on the older kernel version that I'm using. I tried setting nox2apic as a boot parameter, and it appears to work. The only time I was getting the error on that kernel version was when I was resynching a RAID 1 array on 2 2tb USB 3 drives on the same controller. I got the error today when doing that, so I aborted the resynch and rebooted with nox2apic. It resynched all 2tb with no occurrences of the error. Now I'm going to try upgrading the kernel to current and see if I can cause the error again with nox2apic set.
Scott (e2e8e2) wrote : | #48 |
I was able to upgrade to kernel version 3.13.0-74-generic (Ubuntu 14.04 LTS) from 3.13.0.24.28 and everything appears to work fine (i.e. no messages and no lost USB connections) with nox2apic set. I copied hundreds of gigabytes back and forth on USB 3 drives without any problems and a binary compare of the results showed the copies were exact. Unfortunately I can't upgrade any further than that as the next version, Ubuntu 14.10, is now obsolete and the next LTS version isn't out yet; so my installation won't upgrade at all right now. I'm stuck unless I want to do an install from scratch.
databill (julienjut) wrote : | #49 |
I was affected by usb3.0 for years. Once Kernel 3.12(test release, linux-headers-
I have the phenomenon as link described, http://
I'm testing #44 suggestion, and I hope it will help me to step out the issue. Thanks, Malachi de AElfweald (malachid)
I will give back the test result tomorrow.
databill (julienjut) wrote : | #50 |
Issue is resolved! It has been working over 24 Hours.
#uptime
11:02:54 up 1 day, 1:37, 2 users, load average: 0.28, 0.25, 0.24
Thanks again!
solutions refer to #44,
Edit the /etc/default/grub file and append to the GRUB_CMDLINE_
usbcore.
sudo update-grub
sudo reboot
Scott (e2e8e2) wrote : | #51 |
I have found another setting that in my case was the real cause of the problem, namely ASPM (Active State Power Management). I had noticed that I was getting the disabled endpoint error message at odd times, like in the middle of the night, when there was little or no activity to the USB 3 disks attached to the controller. It always struck me as odd that the problem would occur randomly like that and not be clustered around the periods of high activity.
It turns out that ASPM was active by default in the BIOS, buried in a power management setting for PCI devices (i.e. it didn't say ASPM in the setting title). Once I turned that setting off all the errors stopped, both when the devices were active and when they were idle. I'm sort of assuming that the issues happens when ASPM tries to turn off or go to a lower power state on the USB controller. I don't know if this is a bug in the BIOS, firmware of the USB controller, or the linux kernel device driver, or a combination of those. Whichever it is though, turning this off in the BIOS solved everything in every kernel version that I've tested. I also installed Arch Linux so that I could try a recent kernel version and it works find there too. So if you are having this problem and it the other solutions haven't worked for you, look for this setting in your BIOS. It might be in the power management section, but also might be in the peripherals, devices, or advanced PCI settings area.
tags: | removed: performing-bisect |
Rolf Leggewie (r0lf) wrote : | #52 |
utopic has seen the end of its life and is no longer receiving any updates. Marking the utopic task for this ticket as "Won't Fix".
Changed in linux (Ubuntu Utopic): | |
status: | Incomplete → Won't Fix |
Launchpad Janitor (janitor) wrote : | #53 |
[Expired for linux (Ubuntu Vivid) because there has been no activity for 60 days.]
Changed in linux (Ubuntu Vivid): | |
status: | Incomplete → Expired |
Launchpad Janitor (janitor) wrote : | #54 |
[Expired for linux (Ubuntu) because there has been no activity for 60 days.]
Changed in linux (Ubuntu): | |
status: | Incomplete → Expired |
Launchpad Janitor (janitor) wrote : | #55 |
[Expired for linux (Ubuntu Trusty) because there has been no activity for 60 days.]
Changed in linux (Ubuntu Trusty): | |
status: | Incomplete → Expired |
sibulini (sibulini) wrote : | #56 |
Hi everyone, from your discussions, I can't find the root cause about the problem. I meet this issue once in 3.14.55 with android OS. I hope get the root cause. Thanks a lot!
This change was made by a bot.