sil5744 over eSATA regression - repeated kernel failures & device initialisation times out

Bug #910999 reported by racitup
14
This bug affects 2 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Incomplete
Medium
Unassigned

Bug Description

I have recently purchased an external eSATA/USB2.0 hard drive enclosure (Startech S252U2ERR) which uses the Silicon Image sil5744 RAID SATA controller. The enclosure can take up to two 2.5" hdds.
There are 4 modes: JBOD, BIG, RAID0 and RAID1

Upon using this on Kubuntu 11.10 only the JBOD mode works okay and both installed disks are detected using eSATA Port Multipler.

In any of the other modes, specifically RAID1, the device never initialises properly and the kernel retries eventually time out.
The following is repeated in the System Log after running KDE Partition Editor:

...
01/02/12 09:42:03 PM ubuntu kernel [ 1194.864697] ata6.00: hard resetting link
01/02/12 09:42:03 PM ubuntu udevd[14719] timeout: killing 'udisks-part-id /dev/sdb' [16632]
01/02/12 09:42:03 PM ubuntu kernel [ 1195.356424] ata6.00: SATA link up 3.0 Gbps (SStatus 123 SControl 310)
01/02/12 09:42:03 PM ubuntu kernel [ 1195.356478] ata6.01: hard resetting link
01/02/12 09:42:03 PM ubuntu kernel [ 1195.676511] ata6.01: SATA link down (SStatus 0 SControl 310)
01/02/12 09:42:03 PM ubuntu kernel [ 1195.676889] ata6.00: configured for UDMA/133
01/02/12 09:42:04 PM ubuntu kernel [ 1195.724152] ata6: EH complete
01/02/12 09:42:04 PM ubuntu kernel [ 1195.724240] ata6.00: failed to read SCR 1 (Emask=0x40)
01/02/12 09:42:04 PM ubuntu kernel [ 1195.724247] ata6.01: failed to read SCR 1 (Emask=0x40)
01/02/12 09:42:04 PM ubuntu kernel [ 1195.724257] ata6.15: exception Emask 0x10 SAct 0x0 SErr 0x800000 action 0x6 frozen
01/02/12 09:42:04 PM ubuntu kernel [ 1195.724274] ata6.15: irq_stat 0x08000000, interface fatal error
01/02/12 09:42:04 PM ubuntu kernel [ 1195.724277] ata6.15: SError: { LinkSeq }
01/02/12 09:42:04 PM ubuntu kernel [ 1195.724281] ata6.00: exception Emask 0x100 SAct 0x1 SErr 0x0 action 0x6 frozen
01/02/12 09:42:04 PM ubuntu kernel [ 1195.724285] ata6.00: failed command: READ FPDMA QUEUED
01/02/12 09:42:04 PM ubuntu kernel [ 1195.724290] ata6.00: cmd 60/08:00:00:00:00/00:00:00:00:00/40 tag 0 ncq 4096 in
01/02/12 09:42:04 PM ubuntu kernel [ 1195.724292] res 50/00:00:00:00:00/00:00:00:00:00/00 Emask 0x100 (unknown error)
01/02/12 09:42:04 PM ubuntu kernel [ 1195.724295] ata6.00: status: { DRDY }
01/02/12 09:42:04 PM ubuntu kernel [ 1195.724298] ata6.01: exception Emask 0x100 SAct 0x0 SErr 0x0 action 0x6 frozen
01/02/12 09:42:04 PM ubuntu kernel [ 1195.724303] ata6.15: hard resetting link
01/02/12 09:42:04 PM ubuntu kernel [ 1196.216056] ata6.15: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
01/02/12 09:42:04 PM ubuntu kernel [ 1196.216685] ata6.01: limiting SATA link speed to 1.5 Gbps
01/02/12 09:42:04 PM ubuntu kernel [ 1196.216693] ata6.00: hard resetting link
01/02/12 09:42:04 PM ubuntu udevd[14719] timeout: killing 'udisks-part-id /dev/sdb' [16632]
01/02/12 09:42:05 PM ubuntu kernel [ 1196.708343] ata6.00: SATA link up 3.0 Gbps (SStatus 123 SControl 310)
01/02/12 09:42:05 PM ubuntu kernel [ 1196.708393] ata6.01: hard resetting link
01/02/12 09:42:05 PM ubuntu kernel [ 1197.028570] ata6.01: SATA link down (SStatus 0 SControl 310)
01/02/12 09:42:05 PM ubuntu kernel [ 1197.028946] ata6.00: configured for UDMA/133
01/02/12 09:42:05 PM ubuntu kernel [ 1197.076169] ata6: EH complete
...

Finally ending in:
...
01/02/12 09:42:06 PM ubuntu kernel [ 1198.430570] ata6.00: status: { DRDY }
01/02/12 09:42:06 PM ubuntu kernel [ 1198.430573] ata6.01: exception Emask 0x100 SAct 0x0 SErr 0x0 action 0x6 frozen
01/02/12 09:42:06 PM ubuntu kernel [ 1198.430581] ata6.15: hard resetting link
01/02/12 09:42:07 PM ubuntu kernel [ 1198.920059] ata6.15: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
01/02/12 09:42:07 PM ubuntu kernel [ 1198.920589] ata6.01: limiting SATA link speed to 1.5 Gbps
01/02/12 09:42:07 PM ubuntu kernel [ 1198.920594] ata6.00: hard resetting link
01/02/12 09:42:07 PM ubuntu kernel [ 1199.412218] ata6.00: SATA link up 3.0 Gbps (SStatus 123 SControl 310)
01/02/12 09:42:07 PM ubuntu kernel [ 1199.412250] ata6.01: hard resetting link
01/02/12 09:42:07 PM ubuntu udevd[14719] timeout: killing 'udisks-probe-ata-smart /dev/sdb' [16633]
01/02/12 09:42:08 PM ubuntu kernel [ 1199.732455] ata6.01: SATA link down (SStatus 0 SControl 310)
01/02/12 09:42:08 PM ubuntu kernel [ 1199.732935] ata6.00: configured for UDMA/133
01/02/12 09:42:08 PM ubuntu udevd[14719] 'udisks-probe-ata-smart /dev/sdb' [16633] terminated by signal 9 (Killed)
01/02/12 09:42:08 PM ubuntu kernel [ 1199.780454] ata6: EH complete
...

Every second the LEDs blink on the controller as the attempts to initialise the device are retried.
Eventually the initialisation times out and the partition editor opens but does not report the attached disk.

The exact same hardware works fine under Ubuntu 10.04, hence the "regression" heading.
The System log reports the following:
...
Jan 2 20:58:22 adventure kernel: [ 293.958683] ata6: hard resetting link
Jan 2 20:58:23 adventure kernel: [ 294.880095] ata6: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
Jan 2 20:58:23 adventure kernel: [ 294.880822] ata6.15: Port Multiplier 1.1, 0x1095:0x5744 r33, 2 ports, feat 0x1/0x9
Jan 2 20:58:23 adventure kernel: [ 294.881131] ata6.00: hard resetting link
Jan 2 20:58:24 adventure kernel: [ 295.410555] ata6.00: SATA link up 3.0 Gbps (SStatus 123 SControl 320)
Jan 2 20:58:24 adventure kernel: [ 295.410617] ata6.01: hard resetting link
Jan 2 20:58:24 adventure kernel: [ 295.760728] ata6.01: SATA link down (SStatus 0 SControl 320)
Jan 2 20:58:24 adventure kernel: [ 295.761029] ata6.00: ATA-7: External Disk 0, 1.1597, max UDMA/133
Jan 2 20:58:24 adventure kernel: [ 295.761038] ata6.00: 156301488 sectors, multi 1: LBA48 NCQ (depth 31/32)
Jan 2 20:58:24 adventure kernel: [ 295.761278] ata6.00: configured for UDMA/133
Jan 2 20:58:24 adventure kernel: [ 295.761389] ata6: EH complete
Jan 2 20:58:24 adventure kernel: [ 295.761677] scsi 5:0:0:0: Direct-Access ATA External Disk 0 1.15 PQ: 0 ANSI: 5
Jan 2 20:58:24 adventure kernel: [ 295.762115] sd 5:0:0:0: Attached scsi generic sg2 type 0
Jan 2 20:58:24 adventure kernel: [ 295.768799] sd 5:0:0:0: [sdb] 156301488 512-byte logical blocks: (80.0 GB/74.5 GiB)
Jan 2 20:58:24 adventure kernel: [ 295.768919] sd 5:0:0:0: [sdb] Write Protect is off
Jan 2 20:58:24 adventure kernel: [ 295.768989] sd 5:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
Jan 2 20:58:24 adventure kernel: [ 295.769946] sdb: unknown partition table
Jan 2 20:58:24 adventure kernel: [ 295.773805] sd 5:0:0:0: [sdb] Attached SCSI disk
Jan 2 20:58:28 adventure kernel: [ 300.040123] Machine check events logged
...

ProblemType: Bug
DistroRelease: Ubuntu 11.10
Package: linux-image-3.0.0-12-generic 3.0.0-12.20
ProcVersionSignature: Ubuntu 3.0.0-12.20-generic 3.0.4
Uname: Linux 3.0.0-12-generic x86_64
AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.24.
ApportVersion: 1.23-0ubuntu3
Architecture: amd64
ArecordDevices:
 **** List of CAPTURE Hardware Devices ****
 card 0: Intel [HDA Intel], device 0: STAC92xx Analog [STAC92xx Analog]
   Subdevices: 1/1
   Subdevice #0: subdevice #0
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC0: ubuntu 15082 F.... pulseaudio
CRDA: Error: [Errno 2] No such file or directory
Card0.Amixer.info:
 Card hw:0 'Intel'/'HDA Intel at 0xd9300000 irq 47'
   Mixer name : 'IDT 92HD71B7X'
   Components : 'HDA:111d76b2,103c1505,00100302 HDA:11c11040,103c137e,00100200'
   Controls : 20
   Simple ctrls : 12
CasperVersion: 1.287
Date: Mon Jan 2 21:33:30 2012
HotplugNewDevices:

HotplugNewMounts:

LiveMediaBuild: Kubuntu 11.10 "Oneiric Ocelot" - Release amd64 (20111012)
MachineType: Hewlett-Packard HP Pavilion dv3500 Notebook PC
ProcEnviron:
 LANGUAGE=
 LANG=en_US.UTF-8
 SHELL=/bin/bash
ProcKernelCmdLine: BOOT_IMAGE=/casper/vmlinuz noprompt cdrom-detect/try-usb=true persistent file=/cdrom/preseed/khostname.seed boot=casper maybe-ubiquity initrd=/casper/initrd.lz quiet splash -- keyboard-configuration/layoutcode=gb
RelatedPackageVersions:
 linux-restricted-modules-3.0.0-12-generic N/A
 linux-backports-modules-3.0.0-12-generic N/A
 linux-firmware 1.60
SourcePackage: linux
Symptom: storage
UdevMonitorLog:
 monitor will print the received events for:
 UDEV - the event which udev sends out after rule processing
UdisksMonitorLog: Monitoring activity from the disks daemon. Press Ctrl+C to cancel.
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 08/25/2009
dmi.bios.vendor: Hewlett-Packard
dmi.bios.version: F.18
dmi.board.asset.tag: Base Board Asset Tag
dmi.board.name: 1505
dmi.board.vendor: Inventec
dmi.board.version: KBC Version 13.16
dmi.chassis.asset.tag: CNU8404BRJ
dmi.chassis.type: 10
dmi.chassis.vendor: Inventec
dmi.chassis.version: N/A
dmi.modalias: dmi:bvnHewlett-Packard:bvrF.18:bd08/25/2009:svnHewlett-Packard:pnHPPaviliondv3500NotebookPC:pvrF.18:rvnInventec:rn1505:rvrKBCVersion13.16:cvnInventec:ct10:cvrN/A:
dmi.product.name: HP Pavilion dv3500 Notebook PC
dmi.product.version: F.18
dmi.sys.vendor: Hewlett-Packard

Revision history for this message
racitup (racitup) wrote :
Revision history for this message
racitup (racitup) wrote :

I have just updated to the latest everything on 11.10 and still have the issue:

uname -a
Linux zbase 3.0.0-14-generic #23-Ubuntu SMP Mon Nov 21 20:28:43 UTC 2011 x86_64 x86_64 x86_64 GNU/Linux

Revision history for this message
Launchpad Janitor (janitor) wrote :

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in linux-meta (Ubuntu):
status: New → Confirmed
Revision history for this message
Michael Behrns-Miller (q8q-m-ks2) wrote :

This issue is affecting me as well. 2.6 kernels worked with sil5744 with no issues during heavy usage. 3.0 and 3.1 kernels have consistently corrupted the raid shortly after access. Using ext3 file system. Here is kernel output from startup. New to this, let me know if I need to post additional info.

Nov 9 21:28:52 dune kernel: [ 491.868554] ata4.00: hard resetting link
Nov 9 21:28:52 dune kernel: [ 492.328216] ata4.00: SATA link up 3.0 Gbps (SStatus 123 SControl 310)
Nov 9 21:28:52 dune kernel: [ 492.328243] ata4.01: hard resetting link
Nov 9 21:28:53 dune kernel: [ 492.634322] ata4.01: SATA link down (SStatus 0 SControl 310)
Nov 9 21:28:53 dune kernel: [ 492.634621] ata4.00: configured for UDMA/133
Nov 9 21:28:53 dune kernel: [ 492.667079] sd 3:0:0:0: [sdc] Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE
Nov 9 21:28:53 dune kernel: [ 492.667084] sd 3:0:0:0: [sdc] Sense Key : Aborted Command [current] [descriptor]
Nov 9 21:28:53 dune kernel: [ 492.667089] Descriptor sense data with sense descriptors (in hex):
Nov 9 21:28:53 dune kernel: [ 492.667091] 72 0b 00 00 00 00 00 0c 00 0a 80 00 00 00 00 00
Nov 9 21:28:53 dune kernel: [ 492.667097] 00 00 00 00
Nov 9 21:28:53 dune kernel: [ 492.667099] sd 3:0:0:0: [sdc] Add. Sense: No additional sense information
Nov 9 21:28:53 dune kernel: [ 492.667102] sd 3:0:0:0: [sdc] CDB: Read(10): 28 00 92 06 37 57 00 00 08 00
Nov 9 21:28:53 dune kernel: [ 492.667108] end_request: I/O error, dev sdc, sector 2449880919
Nov 9 21:28:53 dune kernel: [ 492.667128] ata4: EH complete
Nov 9 21:28:53 dune kernel: [ 492.667250] ata4.00: failed to read SCR 1 (Emask=0x40)
Nov 9 21:28:53 dune kernel: [ 492.667258] ata4.01: failed to read SCR 1 (Emask=0x40)
Nov 9 21:28:53 dune kernel: [ 492.667267] ata4.15: exception Emask 0x50 SAct 0x0 SErr 0x800 action 0x6 frozen
Nov 9 21:28:53 dune kernel: [ 492.667270] ata4.15: irq_stat 0x08000000, interface fatal error
Nov 9 21:28:53 dune kernel: [ 492.667273] ata4.15: SError: { HostInt }
Nov 9 21:28:53 dune kernel: [ 492.667276] ata4.00: exception Emask 0x100 SAct 0x1 SErr 0x0 action 0x6 frozen
Nov 9 21:28:53 dune kernel: [ 492.667280] ata4.00: failed command: READ FPDMA QUEUED
Nov 9 21:28:53 dune kernel: [ 492.667285] ata4.00: cmd 60/08:00:bf:57:06/00:00:92:00:00/40 tag 0 ncq 4096 in
Nov 9 21:28:53 dune kernel: [ 492.667286] res 50/00:00:00:00:00/00:00:00:00:00/00 Emask 0x100 (unknown error)
Nov 9 21:28:53 dune kernel: [ 492.667288] ata4.00: status: { DRDY }
Nov 9 21:28:53 dune kernel: [ 492.667290] ata4.01: exception Emask 0x100 SAct 0x0 SErr 0x0 action 0x6 frozen
Nov 9 21:28:53 dune kernel: [ 492.667297] ata4.15: hard resetting link

Revision history for this message
Michael Behrns-Miller (q8q-m-ks2) wrote :

I am using MobileSTOR MS2UT+B external eSata enclosure in Raid 1 mode.
http://www.sansdigital.com/mobilestor/ms2utplusb.html

Revision history for this message
racitup (racitup) wrote :

Hi Michael, please could you post the latest kernel version you found this bug in; you mention 3.1?

Revision history for this message
Michael Behrns-Miller (q8q-m-ks2) wrote :
Download full text (3.7 KiB)

Sorry, I've been dealing with this for months, let me try to gather up more info: 2.6.34 worked perfectly. The Nov 9 log was using kernel 3.0.6 I believe. 3.1.0 had the same problem. Right now, I'm using this:

Linux dune 3.1.6-gentoo #1 SMP Mon Jan 2 13:51:06 EST 2012 x86_64 AMD Phenom(tm) 9850 Quad-Core Processor AuthenticAMD GNU/Linux

The problem seems to have dissipated some with this kernel. I can mount the raid and ls the contents. But after a while it became unavailable. I'll try to gather more information. Let me know if there's something specific I can provide. Here is dmesg output, the Sans Digital MobileSTOR MS2UT+B raid1 is ata4:

ata4: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
ata3: softreset failed (device not ready)
ata3: applying PMP SRST workaround and retrying
ata1: softreset failed (device not ready)
ata1: applying PMP SRST workaround and retrying
ata4.15: Port Multiplier 1.1, 0x1095:0x5744 r33, 2 ports, feat 0x1/0x9
ata4.00: hard resetting link
ata3: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
ata3.00: ATA-8: ST31500341AS, SD1A, max UDMA/133
ata3.00: 2930277168 sectors, multi 16: LBA48 NCQ (depth 31/32)
ata3.00: configured for UDMA/133
usb 4-1: new low speed USB device number 2 using ohci_hcd
ata1.00: ATA-7: ST3400620AS, 3.AAC, max UDMA/133
ata1.00: 781422768 sectors, multi 16: LBA48 NCQ (depth 31/32)
ata1.00: configured for UDMA/133
scsi 0:0:0:0: Direct-Access ATA ST3400620AS 3.AA PQ: 0 ANSI: 5
sd 0:0:0:0: [sda] 781422768 512-byte logical blocks: (400 GB/372 GiB)
sd 0:0:0:0: Attached scsi generic sg0 type 0
scsi 2:0:0:0: Direct-Access ATA ST31500341AS SD1A PQ: 0 ANSI: 5
sd 0:0:0:0: [sda] Write Protect is off
sd 0:0:0:0: [sda] Mode Sense: 00 3a 00 00
sd 0:0:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
sd 2:0:0:0: [sdb] 2930277168 512-byte logical blocks: (1.50 TB/1.36 TiB)
sd 2:0:0:0: Attached scsi generic sg1 type 0
sd 2:0:0:0: [sdb] Write Protect is off
sd 2:0:0:0: [sdb] Mode Sense: 00 3a 00 00
sd 2:0:0:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA
 sdb: sdb1
sd 2:0:0:0: [sdb] Attached SCSI disk
 sda: sda1 sda2 < sda5 sda6 > sda3 sda4
sd 0:0:0:0: [sda] Attached SCSI disk
input: Logitech USB Receiver as /devices/pci0000:00/0000:00:12.1/usb4/4-1/4-1:1.0/input/input2
logitech 0003:046D:C517.0001: input: USB HID v1.10 Keyboard [Logitech USB Receiver] on usb-0000:00:12.1-1/input0
logitech 0003:046D:C517.0002: fixing up Logitech keyboard report descriptor
input: Logitech USB Receiver as /devices/pci0000:00/0000:00:12.1/usb4/4-1/4-1:1.1/input/input3
logitech 0003:046D:C517.0002: input: USB HID v1.10 Mouse [Logitech USB Receiver] on usb-0000:00:12.1-1/input1
ata4.00: SATA link up 3.0 Gbps (SStatus 123 SControl 320)
ata4.01: hard resetting link
usb 4-2: new low speed USB device number 3 using ohci_hcd
ata4.01: SATA link down (SStatus 0 SControl 320)
ata4.00: ATA-7: External Disk 0, 1.1583, max UDMA/133
ata4.00: 2930277168 sectors, multi 1: LBA48 NCQ (depth 31/32)
ata4.00: configured for UDMA/133
ata4: EH complete
scsi 3:0:0:0: Direct-Access ATA Externa...

Read more...

Revision history for this message
Joseph Salisbury (jsalisbury) wrote :

Would it be possible for you to test the latest upstream kernel? It will allow additional upstream developers to examine the issue. Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . If possible, please test the latest v3.2-rcN kernel (Not a kernel in the daily directory). Once you've tested the upstream kernel, please remove the 'needs-upstream-testing' tag(Only that one tag, please leave the other tags). This can be done by clicking on the yellow pencil icon next to the tag located at the bottom of the bug description and deleting the 'needs-upstream-testing' text.

If this bug is fixed by the mainline kernel, please add the following tag 'kernel-fixed-upstream-KERNEL-VERSION'. For example, if kernel version 3.2-rc1 fixed and issue, the tag would be: 'kernel-fixed-upstream-v3.2-rc1'.

If the mainline kernel does not fix this bug, please add the tag: 'kernel-bug-exists-upstream'.

If you are unable to test the mainline kernel, for example it will not boot, please add the tag: 'kernel-unable-to-test-upstream'. If you believe this bug does not require upstream testing, please add the tag: 'kernel-upstream-testing-not-needed'.

Once testing of the upstream kernel is complete, please mark this bug as "Confirmed".

Thanks in advance.

affects: linux-meta (Ubuntu) → linux (Ubuntu)
Changed in linux (Ubuntu):
importance: Undecided → Medium
status: Confirmed → Incomplete
tags: added: needs-upstream-testing
Revision history for this message
racitup (racitup) wrote :

Just a bit more information:
This appears to be broken in the 2.6 kernel somewhere as I have just tried a fresh install (USB boot) of natty (11.04) with kernel 2.6.38-8 and it too is broken, but not quite as broken as the 3.0.0-12 kernel. It takes a long time for the failures to time out, but when it does it leaves the drive accessible (you can see it in partition editor)

I will try the latest mainline kernel on oneiric when I get a chance.

Revision history for this message
racitup (racitup) wrote :
Download full text (3.4 KiB)

I have tested with the upstream kernel and the bug still exists:
rich@zbase:~$ uname -a
Linux zbase 3.2.0-030200rc7-generic #201112240135 SMP Sat Dec 24 06:35:57 UTC 2011 x86_64 x86_64 x86_64 GNU/Linux

syslog excerpt:
...
06/01/2012 02:45:02 zbase kernel [ 482.280704] ata3: EH complete
06/01/2012 02:45:02 zbase kernel [ 482.281026] ata3.00: failed to read SCR 1 (Emask=0x40)
06/01/2012 02:45:02 zbase kernel [ 482.281039] ata3.01: failed to read SCR 1 (Emask=0x40)
06/01/2012 02:45:02 zbase kernel [ 482.281060] ata3.15: exception Emask 0x10 SAct 0x0 SErr 0x0 action 0x6 frozen
06/01/2012 02:45:02 zbase kernel [ 482.281069] ata3.15: irq_stat 0x08000000, interface fatal error
06/01/2012 02:45:02 zbase kernel [ 482.281082] ata3.00: exception Emask 0x100 SAct 0x1 SErr 0x0 action 0x6 frozen
06/01/2012 02:45:02 zbase kernel [ 482.281094] ata3.00: failed command: READ FPDMA QUEUED
06/01/2012 02:45:02 zbase kernel [ 482.281114] ata3.00: cmd 60/08:00:08:00:00/00:00:00:00:00/40 tag 0 ncq 4096 in
06/01/2012 02:45:02 zbase kernel [ 482.281118] res 50/00:00:00:00:00/00:00:00:00:00/00 Emask 0x100 (unknown error)
06/01/2012 02:45:02 zbase kernel [ 482.281128] ata3.00: status: { DRDY }
06/01/2012 02:45:02 zbase kernel [ 482.281139] ata3.01: exception Emask 0x100 SAct 0x0 SErr 0x0 action 0x6 frozen
06/01/2012 02:45:02 zbase kernel [ 482.281156] ata3.15: hard resetting link
06/01/2012 02:45:03 zbase kernel [ 482.772185] ata3.15: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
06/01/2012 02:45:03 zbase kernel [ 482.773082] ata3.01: limiting SATA link speed to 1.5 Gbps
06/01/2012 02:45:03 zbase kernel [ 482.773102] ata3.00: hard resetting link
06/01/2012 02:45:03 zbase kernel [ 483.264483] ata3.00: SATA link up 3.0 Gbps (SStatus 123 SControl 310)
06/01/2012 02:45:03 zbase kernel [ 483.264564] ata3.01: hard resetting link
06/01/2012 02:45:03 zbase kernel [ 483.584516] ata3.01: SATA link down (SStatus 0 SControl 310)
06/01/2012 02:45:03 zbase kernel [ 483.584921] ata3.00: configured for UDMA/133
06/01/2012 02:45:03 zbase kernel [ 483.632178] ata3: EH complete
06/01/2012 02:45:03 zbase kernel [ 483.632289] ata3.00: failed to read SCR 1 (Emask=0x40)
06/01/2012 02:45:03 zbase kernel [ 483.632301] ata3.01: failed to read SCR 1 (Emask=0x40)
06/01/2012 02:45:03 zbase kernel [ 483.632322] ata3.15: exception Emask 0x10 SAct 0x0 SErr 0x0 action 0x6 frozen
06/01/2012 02:45:03 zbase kernel [ 483.632333] ata3.15: irq_stat 0x08000000, interface fatal error
06/01/2012 02:45:03 zbase kernel [ 483.632346] ata3.00: exception Emask 0x100 SAct 0x1 SErr 0x0 action 0x6 frozen
06/01/2012 02:45:03 zbase kernel [ 483.632358] ata3.00: failed command: READ FPDMA QUEUED
06/01/2012 02:45:03 zbase kernel [ 483.632379] ata3.00: cmd 60/08:00:08:00:00/00:00:00:00:00/40 tag 0 ncq 4096 in
06/01/2012 02:45:03 zbase kernel [ 483.632383] res 50/00:00:00:00:00/00:00:00:00:00/00 Emask 0x100 (unknown error)
06/01/2012 02:45:03 zbase kernel [ 483.632393] ata3.00: status: { DRDY }
06/01/2012 02:45:03 zbase kernel [ 483.632403] ata3.01: exception Emask 0x100 SAct 0x0 SErr 0x0 action 0x6 frozen
06/01/2012 02:45:03 zbase kernel [ 483.632420] ata3.15: hard resett...

Read more...

tags: added: kernel-bug-exists-upstream
removed: needs-upstream-testing
Changed in linux (Ubuntu):
status: Incomplete → Confirmed
Revision history for this message
racitup (racitup) wrote :

Okay I have more news having spent most of the day trying to at least figure out where this bug was introduced!

It appears to be introduced in linux kernel version 2.6.37:
 - Everything before this (version 2.6.36.4 and backwards) works on Ubuntu 11.04, 10.10 and 10.04
 - Everything after this (version 2.6.37-rc1 and onwards, including 3.x) does not work on Ubuntu 10.10, 11.04, 11.10 and 12.04

As far as I can tell the same kernel module is responsible for this behaviour in all versions: ahci
Taken from lspci -v

Hope to have more information when I've found the relevant kernel changelog.
May also post the bug to the kernel bug tracker...

Cheers,
Richard

Revision history for this message
racitup (racitup) wrote :

This bug also coincides with the "Used" count given by lsmod against the ahci kernel module being either 5 or 6, instead of 2 when it works on older kernels.

I have also tried forcing the ahci module to be loaded before others (by placing 'ahci' in /etc/initramfs-tools/modules, running 'sudo update-initramfs -u' and rebooting). It makes no difference.

Revision history for this message
racitup (racitup) wrote :
Revision history for this message
penalvch (penalvch) wrote :

racitup, this bug was reported a while ago and there hasn't been any activity in it recently. We were wondering if this is still an issue? If so, could you please test for this with the latest development release of Ubuntu? ISO images are available from http://cdimage.ubuntu.com/daily-live/current/ .

If it remains an issue, could you please run the following command in the development release from a Terminal (Applications->Accessories->Terminal), as it will automatically gather and attach updated debug information to this report:

apport-collect -p linux <replace-with-bug-number>

Also, could you please test the latest upstream kernel available following https://wiki.ubuntu.com/KernelMainlineBuilds ? It will allow additional upstream developers to examine the issue. Please do not test the daily folder, but the one all the way at the bottom. Once you've tested the upstream kernel, please comment on which kernel version specifically you tested. If this bug is fixed in the mainline kernel, please add the following tags:
kernel-fixed-upstream
kernel-fixed-upstream-VERSION-NUMBER

where VERSION-NUMBER is the version number of the kernel you tested. For example:
kernel-fixed-upstream-v3.11-rc5

This can be done by clicking on the yellow circle with a black pencil icon next to the word Tags located at the bottom of the bug description. As well, please remove the tag:
needs-upstream-testing

If the mainline kernel does not fix this bug, please add the following tags:
kernel-bug-exists-upstream
kernel-bug-exists-upstream-VERSION-NUMBER

As well, please remove the tag:
needs-upstream-testing

Once testing of the upstream kernel is complete, please mark this bug's Status as Confirmed. Please let us know your results. Thank you for your understanding.

tags: added: needs-full-computer-model needs-upstream-testing
tags: added: regression-release
Changed in linux (Ubuntu):
status: Confirmed → Incomplete
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.