spurious completions during NCQ - Lenovo R61i - FUJITSU MHW2160BH PL

Bug #175834 reported by Wiktor Grebla
2
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Fix Released
Medium
Chuck Short

Bug Description

Hi I'm getting quite a lot of these errors:
'''
[ 9932.004000] ata1.00: exception Emask 0x2 SAct 0x70007 SErr 0x0 action 0x2 frozen
[ 9932.004000] ata1.00: (spurious completions during NCQ issue=0x0 SAct=0x70007 FIS=004040a1:00008000)
[ 9932.004000] ata1.00: cmd 61/10:00:26:0d:12/00:00:0c:00:00/40 tag 0 cdb 0x0 data 8192 out
[ 9932.004000] res 40/00:10:9e:68:68/00:00:0c:00:00/40 Emask 0x2 (HSM violation)
[ 9932.004000] ata1.00: cmd 61/18:08:06:5e:68/00:00:0c:00:00/40 tag 1 cdb 0x0 data 12288 out
[ 9932.004000] res 40/00:10:9e:68:68/00:00:0c:00:00/40 Emask 0x2 (HSM violation)
[ 9932.004000] ata1.00: cmd 61/18:10:9e:68:68/00:00:0c:00:00/40 tag 2 cdb 0x0 data 12288 out
[ 9932.004000] res 40/00:10:9e:68:68/00:00:0c:00:00/40 Emask 0x2 (HSM violation)
[ 9932.004000] ata1.00: cmd 61/08:80:36:21:68/00:00:0c:00:00/40 tag 16 cdb 0x0 data 4096 out
[ 9932.004000] res 40/00:10:9e:68:68/00:00:0c:00:00/40 Emask 0x2 (HSM violation)
[ 9932.004000] ata1.00: cmd 61/08:88:16:30:68/00:00:0c:00:00/40 tag 17 cdb 0x0 data 4096 out
[ 9932.004000] res 40/00:10:9e:68:68/00:00:0c:00:00/40 Emask 0x2 (HSM violation)
[ 9932.004000] ata1.00: cmd 61/08:90:76:5c:68/00:00:0c:00:00/40 tag 18 cdb 0x0 data 4096 out
[ 9932.004000] res 40/00:10:9e:68:68/00:00:0c:00:00/40 Emask 0x2 (HSM violation)
[ 9932.316000] ata1: soft resetting port
[ 9932.488000] ata1: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
[ 9932.488000] ata1.00: configured for UDMA/100
[ 9932.488000] ata1: EH complete
'''

hdparm -I output is:

'''
/dev/sda:

ATA device, with non-removable media
        Model Number: FUJITSU MHW2160BH PL
        Serial Number: K10FT7A269UE
        Firmware Revision: 0084001E
        Transport: Serial, ATA8-AST, SATA 1.0a, SATA II Extensions, SATA Rev 2.5; Revision: ATA8-AST T13 Project D1697 Revision 0b
Standards:
        Supported: 8 7 6 5
        Likely used: 8
Configuration:
        Logical max current
        cylinders 16383 16383
        heads 16 16
        sectors/track 63 63
        --
        CHS current addressable sectors: 16514064
        LBA user addressable sectors: 268435455
        LBA48 user addressable sectors: 312581808
        device size with M = 1024*1024: 152627 MBytes
        device size with M = 1000*1000: 160041 MBytes (160 GB)
Capabilities:
        LBA, IORDY(can be disabled)
        Queue depth: 32
        Standby timer values: spec'd by Standard, no device specific minimum
        R/W multiple sector transfer: Max = 16 Current = 16
        Advanced power management level: 128 (0x80)
        Recommended acoustic management value: 254, current value: 254
        DMA: mdma0 mdma1 mdma2 udma0 udma1 udma2 udma3 udma4 *udma5
             Cycle time: min=120ns recommended=120ns
             Cycle time: min=120ns recommended=120ns
        PIO: pio0 pio1 pio2 pio3 pio4
             Cycle time: no flow control=120ns IORDY flow control=120ns
Commands/features:
        Enabled Supported:
           * SMART feature set
                Security Mode feature set
           * Power Management feature set
           * Write cache
           * Look-ahead
           * Host Protected Area feature set
           * WRITE_BUFFER command
           * READ_BUFFER command
           * DOWNLOAD_MICROCODE
           * Advanced Power Management feature set
                SET_MAX security extension
           * Automatic Acoustic Management feature set
           * 48-bit Address feature set
           * Device Configuration Overlay feature set
           * Mandatory FLUSH_CACHE
           * FLUSH_CACHE_EXT
           * SMART error logging
           * SMART self-test
           * General Purpose Logging feature set
           * WRITE_{DMA|MULTIPLE}_FUA_EXT
           * 64-bit World wide name
           * IDLE_IMMEDIATE with UNLOAD
           * Disable Data Transfer After Error Detection
           * WRITE_UNCORRECTABLE command
           * Segmented DOWNLOAD_MICROCODE
           * SATA-I signaling speed (1.5Gb/s)
           * Native Command Queueing (NCQ)
           * Host-initiated interface power management
           * Phy event counters
                DMA Setup Auto-Activate optimization
           * Device-initiated interface power management
           * Software settings preservation
           * SMART Command Transport (SCT) feature set
           * SCT LBA Segment Access (AC2)
           * SCT Error Recovery Control (AC3)
           * SCT Features Control (AC4)
           * SCT Data Tables (AC5)
Security:
        Master password revision code = 65534
                supported
        not enabled
        not locked
                frozen
        not expired: security count
        not supported: enhanced erase
        160min for SECURITY ERASE UNIT.
Checksum: correct
'''

I suppose this hard drive should be blacklisted (NCQ is disabled because of this errors anyway).

I'm a bit afraid. Nothing happened so far (no data loss and I've Ubuntu installed on this laptop for two
weeks) but my first installation (Arch Linux) ended up with a filesystem corruption, possibly
because of my attempts to get suspend to ram working.

CVE References

Revision history for this message
Chuck Short (zulcss) wrote :

I have added this to my git tree for hardy. Thanks for the bug report.

Changed in linux-meta:
status: New → In Progress
Tim Gardner (timg-tpi)
Changed in linux-meta:
assignee: nobody → zulcss
importance: Undecided → Medium
milestone: none → hardy-alpha-4
status: In Progress → Fix Committed
Revision history for this message
Launchpad Janitor (janitor) wrote :
Download full text (20.1 KiB)

This bug was fixed in the package linux - 2.6.24-5.8

---------------
linux (2.6.24-5.8) hardy; urgency=low

  [Alessio Igor Bogani]

  * rt: Update to 2.6.24-rc8-rt1
  * rt: Update configuration files

  [Amit Kucheria]

  * Asix: fix breakage caused in 2.6.24-rc7
  * Add CONFIG_CPUSETS to server-related flavours
    - LP: #182434

  [Chuck Short]

  * SAUCE: ata: blacklist FUJITSU MHW2160BH PL
    - LP: #175834

  [Kees Cook]

  * AppArmor: updated patch series to upstream SVN 1079.

  [Soren Hansen]

  * Updated configs to enable virtio stuff Ignore: yes

  [Stefan Bader]

  * Enabled CONFIG_BSD_PROCESS_ACCT=y for sparc.
    - LP: #176587
  * Enable CONFIG_AUDITSYSCALL=y.
    - LP: #140784
  * Added CONFIG_AUDIT_SYSCALL=y to custom lpia(compat)
  * Enabled CONFIG_HUGETLBFS=y for i386/server amd64/server and ia64.
  * Lower priority of pnpacpi resource messages to warning level.
    - LP: #159241
  * Fix the messed up message level of pnpacpi parser.

  [Tim Gardner]

  * Start new release, bump ABI to -5
  * Disabled iwlwifi preperatory to moving it to l-u-m.
  * Enabled CONFIG_USB_SERIAL_KEYSPAN
  * Disabled CONFIG_CGROUPS.
  * Virtio config settings for -rt.
  * Re-enable IWLWIFI in the kernel.
  * Fixed -rt saa7134-core.c FTBS

  [Upstream Kernel Changes]

  * Input: Handle EV_PWR type of input caps in input_set_capability.
  * Input: jornada680_kbd - fix default keymap
  * increase PNP_MAX_PORT to 40 from 24
  * sched: fix gcc warnings
  * leds: Fix leds_list_lock locking issues
  * leds: Fix locomo LED driver oops
  * x86: fix asm-x86/byteorder.h for userspace export
  * x86: fix asm-x86/msr.h for user-space export
  * ACPI: EC: Enable boot EC before bus_scan
  * ACPI: Make sysfs interface in ACPI power optional.
  * fix lguest rmmod "bad pgd"
  * slub: provide /proc/slabinfo
  * [POWERPC] Fix build failure on Cell when CONFIG_SPU_FS=y
  * slub: register slabinfo to procfs
  * [SCSI] scsi_sysfs: restore prep_fn when ULD is removed
  * Unify /proc/slabinfo configuration
  * scsi: revert "[SCSI] Get rid of scsi_cmnd->done"
  * restrict reading from /proc/<pid>/maps to those who share ->mm or can
    ptrace pid
  * Fix kernel/ptrace.c compile problem (missing "may_attach()")
  * hwmon: (w83627ehf) Be more careful when changing VID input level
  * NFS: Fix a possible Oops in fs/nfs/super.c
  * NFSv4: Fix circular locking dependency in nfs4_kill_renewd
  * NFS: add newline to kernel warning message in auth_gss code
  * NFSv4: nfs4_open_confirm must not set the open_owner as confirmed on
    error
  * NFSv4: Fix open_to_lock_owner sequenceid allocation...
  * gameport: don't export functions that are static inline
  * Input: spitzkbd - fix suspend key handling
  * Input: pass EV_PWR events to event handlers
  * [ARM] 4735/1: Unbreak pxa25x suspend/resume
  * IB/srp: Fix list corruption/oops on module reload
  * Console is utf-8 by default
  * [IA64] Update Altix BTE error return status patch
  * [IA64] Update Altix nofault code
  * [X25]: Add missing x25_neigh_put
  * [XFRM]: Do not define km_migrate() if !CONFIG_XFRM_MIGRATE
  * [CASSINI]: Fix endianness bug.
  * [CASSINI]: Revert 'dont touch page_count'.
  * [CASSINI]: Program parent Inte...

Changed in linux:
status: Fix Committed → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.