SATA link power management causes disk errors and corruption
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Linux |
Expired
|
Medium
|
|||
linux (Ubuntu) |
Invalid
|
High
|
Unassigned | ||
Lucid |
Won't Fix
|
Low
|
Unassigned | ||
Maverick |
Invalid
|
Medium
|
Unassigned | ||
Natty |
Invalid
|
High
|
Unassigned | ||
pm-utils (Ubuntu) |
Fix Released
|
High
|
Unassigned | ||
Lucid |
Invalid
|
Undecided
|
Unassigned | ||
Maverick |
Invalid
|
Undecided
|
Unassigned | ||
Natty |
Fix Released
|
High
|
Unassigned | ||
pm-utils-powersave-policy (Ubuntu) |
Invalid
|
High
|
Unassigned | ||
Lucid |
Fix Released
|
Undecided
|
Unassigned | ||
Maverick |
Invalid
|
Undecided
|
Unassigned | ||
Natty |
Invalid
|
High
|
Unassigned |
Bug Description
SRU Justification for pm-utils-
Impact: On certain hardware, enabling power saving for the SATA link can cause data corruption.
How Addressed: The proposed branch removes the sata link power policy script. This will cause the link to be maintained at the normal power usage instead of dropping when the power is removed from the machine.
Reproduction: On an affected machine, unplug and plug in the power a few times. Data corruption will result.
Regression Potential: Removing the script will cause the SATA link to stay fully powered at all times. This may cause an increase in the battery usage for some machines. There should be no functionality regressions or bugs introduced by this change.
=====
Using Lucid on my laptop, I see errors like this in dmesg quite frequently (every few hours):
Mar 14 23:00:09 chris-laptop kernel: [42987.460608] ata1.00: exception Emask 0x10 SAct 0x1 SErr 0x50000 action 0xe frozen
Mar 14 23:00:09 chris-laptop kernel: [42987.460618] ata1.00: irq_stat 0x00400000, PHY RDY changed
Mar 14 23:00:09 chris-laptop kernel: [42987.460627] ata1: SError: { PHYRdyChg CommWake }
Mar 14 23:00:09 chris-laptop kernel: [42987.460635] ata1.00: failed command: READ FPDMA QUEUED
Mar 14 23:00:09 chris-laptop kernel: [42987.460649] ata1.00: cmd 60/08:00:
Mar 14 23:00:09 chris-laptop kernel: [42987.460652] res 40/00:04:
Mar 14 23:00:09 chris-laptop kernel: [42987.460669] ata1.00: status: { DRDY }
Mar 14 23:00:09 chris-laptop kernel: [42987.460681] ata1: hard resetting link
Mar 14 23:00:09 chris-laptop kernel: [42987.523336] ata2: exception Emask 0x10 SAct 0x0 SErr 0x50000 action 0xe frozen
Mar 14 23:00:09 chris-laptop kernel: [42987.523346] ata2: irq_stat 0x00400000, PHY RDY changed
Mar 14 23:00:09 chris-laptop kernel: [42987.523355] ata2: SError: { PHYRdyChg CommWake }
Mar 14 23:00:09 chris-laptop kernel: [42987.523368] ata2: hard resetting link
Mar 14 23:00:09 chris-laptop kernel: [42988.202586] ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
Mar 14 23:00:09 chris-laptop kernel: [42988.205443] ata1.00: configured for UDMA/133
Mar 14 23:00:09 chris-laptop kernel: [42988.205459] ata1: EH complete
Mar 14 23:00:09 chris-laptop kernel: [42988.280089] ata2: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
Mar 14 23:00:09 chris-laptop kernel: [42988.285567] ata2.00: configured for UDMA/100
Mar 14 23:00:09 chris-laptop kernel: [42988.289370] ata2: EH complete
Every couple of days, this results in data corruption and my filesystem being remounted read-only:
[ 6148.305806] Aborting journal on device sda1-8.
[ 6148.325011] EXT4-fs error (device sda1): ext4_journal_
[ 6148.325018] EXT4-fs (sda1): Remounting filesystem read-only
[ 6148.326702] journal commit I/O error
[ 6148.330975] EXT4-fs error (device sda1) in ext4_reserve_
[ 6148.462572] __ratelimit: 15 callbacks suppressed
Those messages generally appear at the end of dmesg after the event, just after the "hard resetting link" message. I then have to boot a live CD and manually run fsck, as I can no longer boot the laptop.
This is happening every couple of days generally, although it happened 3 times in one day last Thursday.
I did contemplate it being a hardware issue, but I tried running the kernel from Karmic for a couple of days, and that worked ok without a single error message
ProblemType: Bug
AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.21.
Architecture: amd64
AudioDevicesInUse:
USER PID ACCESS COMMAND
/dev/snd/
/dev/snd/
CRDA: Error: [Errno 2] No such file or directory
Card0.Amixer.info:
Card hw:0 'Intel'/'HDA Intel at 0xf6afc000 irq 21'
Mixer name : 'Intel G45 DEVCTG'
Components : 'HDA:111d76b2,
Controls : 22
Simple ctrls : 11
Card1.Amixer.info:
Card hw:1 'U0x46d0x9a4'/'USB Device 0x46d:0x9a4 at usb-0000:
Mixer name : 'USB Mixer'
Components : 'USB046d:09a4'
Controls : 2
Simple ctrls : 1
Card1.Amixer.
Simple mixer control 'Mic',0
Capabilities: cvolume cvolume-joined cswitch cswitch-joined penum
Capture channels: Mono
Limits: Capture 0 - 14
Mono: Capture 0 [0%] [23.75dB] [on]
Date: Tue Mar 16 10:07:41 2010
DistroRelease: Ubuntu 10.04
Frequency: Once a day.
HibernationDevice: RESUME=
InstallationMedia: Ubuntu 9.10 "Karmic Koala" - Release amd64 (20091027)
LiveMediaBuild: Ubuntu 9.10 "Karmic Koala" - Release amd64 (20091027)
MachineType: Dell Inc. Latitude E5500
Package: linux-image-
PccardctlIdent:
Socket 0:
no product info available
PccardctlStatus:
Socket 0:
no card
ProcCmdLine: BOOT_IMAGE=
ProcEnviron:
LANG=en_GB.utf8
SHELL=/bin/bash
ProcVersionSign
Regression: Yes
RelatedPackageV
Reproducible: No
SourcePackage: linux
TestedUpstream: No
Uname: Linux 2.6.32-16-generic x86_64
dmi.bios.date: 11/05/2009
dmi.bios.vendor: Dell Inc.
dmi.bios.version: A15
dmi.board.name: 0DW635
dmi.board.vendor: Dell Inc.
dmi.chassis.type: 8
dmi.chassis.vendor: Dell Inc.
dmi.modalias: dmi:bvnDellInc.
dmi.product.name: Latitude E5500
dmi.sys.vendor: Dell Inc.
Changed in linux (Ubuntu): | |
assignee: | nobody → Stefan Bader (stefan-bader-canonical) |
Changed in pm-utils-powersave-policy (Ubuntu): | |
status: | New → In Progress |
importance: | Undecided → High |
assignee: | nobody → Chase Douglas (chasedouglas) |
description: | updated |
Changed in pm-utils-powersave-policy (Ubuntu Lucid): | |
status: | In Progress → Fix Committed |
tags: |
added: verification-done removed: verification-needed |
Changed in pm-utils (Ubuntu Natty): | |
importance: | Undecided → High |
tags: |
added: regression-release removed: regression-potential |
Changed in linux (Ubuntu Natty): | |
importance: | Undecided → High |
status: | Fix Released → New |
Changed in linux: | |
status: | Unknown → Confirmed |
Changed in linux: | |
importance: | Unknown → Medium |
Changed in linux (Ubuntu Natty): | |
status: | Incomplete → Triaged |
Changed in pm-utils-powersave-policy (Ubuntu Lucid): | |
assignee: | Chase Douglas (chasedouglas) → nobody |
Changed in pm-utils-powersave-policy (Ubuntu Natty): | |
assignee: | Chase Douglas (chasedouglas) → nobody |
Changed in linux (Ubuntu Maverick): | |
status: | Incomplete → Triaged |
Changed in linux (Ubuntu Lucid): | |
importance: | Undecided → Low |
Changed in linux (Ubuntu Maverick): | |
importance: | Undecided → Medium |
Changed in linux (Ubuntu Natty): | |
milestone: | none → natty-updates |
Changed in linux: | |
status: | Confirmed → Expired |
Changed in linux (Ubuntu Natty): | |
assignee: | Stefan Bader (stefan-bader-canonical) → nobody |
milestone: | natty-updates → none |
Changed in linux (Ubuntu): | |
assignee: | Stefan Bader (stefan-bader-canonical) → nobody |
milestone: | natty-updates → none |
I'm also affected by this, running kernel 2.6.32-16-generic #25-Ubuntu SMP Tue Mar 9 16:33:52 UTC 2010 i686 GNU/Linux Unknown/ Install/ Remove/ Purge/Hold Not/Inst/ Cfg-files/ Unpacked/ Failed- cfg/Half- inst/trig- aWait/Trig- pend /Reinst- required (Status,Err: uppercase=bad) ======= ====-== ======= =====-= ======= ======= ======= ======= ======= ======= =
$ dpkg -l linux-image
Desired=
| Status=
|/ Err?=(none)
||/ Name Version Description
+++-===
ii linux-image 2.6.32.16.17 Generic Linux kernel image.