udisks-probe-ata-smart causes HSM violations
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Linux |
Invalid
|
Undecided
|
Unassigned | ||
linux (Ubuntu) |
Invalid
|
High
|
Unassigned | ||
Lucid |
Invalid
|
High
|
Unassigned |
Bug Description
This is related to bug 445852, it causes the same effects, but under different circumstances.
During boot on an SSD system and when logging in and starting something rather IO intense, like firefox, the system freezes for 30 seconds, and afterwards dmesg shows an error like
ata2: lost interrupt (Status 0x58)
ata2: drained 16384 bytes to clear DRQ.
ata2.00: exception Emask 0x0 SAct 0x0 SErr 0x0 action 0x6 frozen
ata2.00: BMDMA stat 0x4
ata2.00: cmd c8/00:40:
res 58/00:40:
ata2.00: status: { DRDY DRQ }
ata2: soft resetting link
ata2.00: configured for UDMA/66
ata2: EH complete
The main cause of bug 445828 was fixed now, but there are still some users who get those HSM violations/30 second hangs during boot.
Martin Pitt got ssh access to Jarige's machine which is still affected (he's willing to provide access to other people for debugging).
A lot of different commands were tried to reproduce this at runtime, like
# for i in `seq 50`; do skdump --can-smart /dev/sda; hdparm -B254 /dev/sda; sleep 0.2; done
# udevadm trigger --action=change --sysname-match=sda # (also in a loop)
# (/lib/udev/
and so on, but it seems impossible to reproduce at runtime unfortunately. I also tried those commands while a "grep -r . /usr" was running in the background to induce I/O and disk reading activity.
The interesting thing is that the bug goes away if you either disable /lib/udev/
ProblemType: Bug
DistroRelease: Ubuntu 10.04
Package: linux-image-
Regression: No
Reproducible: Yes
ProcVersionSign
Uname: Linux 2.6.32-21-generic i686
AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.21.
AplayDevices:
**** List of PLAYBACK Hardware Devices ****
card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog]
Subdevices: 1/1
Subdevice #0: subdevice #0
Architecture: i386
ArecordDevices:
**** List of CAPTURE Hardware Devices ****
card 0: Intel [HDA Intel], device 0: ALC268 Analog [ALC268 Analog]
Subdevices: 1/1
Subdevice #0: subdevice #0
AudioDevicesInUse:
USER PID ACCESS COMMAND
/dev/snd/
Card0.Amixer.info:
Card hw:0 'Intel'/'HDA Intel at 0x78540000 irq 16'
Mixer name : 'Realtek ALC268'
Components : 'HDA:10ec0268,
Controls : 8
Simple ctrls : 5
Date: Mon May 3 15:21:38 2010
InstallationMedia: Ubuntu-
MachineType: Acer AOA110
ProcCmdLine: BOOT_IMAGE=
ProcEnviron:
LANG=en_US.utf8
SHELL=/bin/bash
RelatedPackageV
RfKill:
0: phy0: Wireless LAN
Soft blocked: no
Hard blocked: no
SourcePackage: linux
dmi.bios.date: 05/09/2008
dmi.bios.vendor: INSYDE
dmi.bios.version: v0.3109
dmi.board.
dmi.board.name: Base Board Product Name
dmi.board.vendor: Intel Corp.
dmi.board.version: Base Board Version
dmi.chassis.type: 1
dmi.chassis.vendor: Chassis Manufacturer
dmi.chassis.
dmi.modalias: dmi:bvnINSYDE:
dmi.product.name: AOA110
dmi.product.
dmi.sys.vendor: Acer
description: | updated |
Changed in linux (Ubuntu Lucid): | |
assignee: | nobody → Tim Gardner (timg-tpi) |
status: | New → In Progress |
tags: | added: kernel-core kernel-reviewed |
Changed in hdparm (Ubuntu): | |
assignee: | Martin Pitt (pitti) → nobody |
Changed in linux (Ubuntu Lucid): | |
importance: | Undecided → High |
Changed in linux (Ubuntu Lucid): | |
assignee: | Tim Gardner (timg-tpi) → nobody |
Changed in linux (Ubuntu): | |
assignee: | Canonical Kernel Team (canonical-kernel-team) → nobody |
At this point I'd appreciate some input from the kernel team what this message actually means, and what the likely cause could be. It does not really seem specific to either hdparm nor libatasmart, all they do is things like https:/ /bugs.launchpad .net/ubuntu/ karmic/ +source/ libatasmart/ +bug/445852/ comments/ 202 (i. e. using the normal SCSI ioctls).
One possible workaround might be to not run hdparm on SSD devices. This only might fix the symptom, but it might be an appropriate SRU for lucid.