Comment 6 for bug 599830

Revision history for this message
Lars (lars-taeuber) wrote :

Hi Jason,

thanks for your hints.
I did a FW update of the LSI SAS controller and reduced the fs content. Since the the update and the filesystems are less filled the error didn't occur again.

# cat /proc/scsi/mptsas/9
ioc1: LSISAS1068E B3, FwRev=011f0200h, Ports=1, MaxQ=483
# cat /proc/scsi/mptsas/10
ioc2: LSISAS1068E B3, FwRev=011f0200h, Ports=1, MaxQ=483

# ./sasflash -listall

 ****************************************************************************
    LSI Corporation SAS FLASH Utility.

    SASFlash Version 1.26.00.00 (2010.05.18)

    Copyright (c) 2006-2007 LSI Corporation. All rights reserved.
 ****************************************************************************

 Adapter Selected is a LSI SAS 1068E(B3):

 Num Ctlr FW Ver NVDATA x86-BIOS EFI-BSD PCI Addr
-----------------------------------------------------------------------

1 1068E(B3) 01.31.02.00 2d.03 06.32.00.00 No Image 00:08:00:00
2 1068E(B3) 01.31.02.00 2d.03 06.32.00.00 No Image 00:09:00:00

The fs look like this:
# LANG=C df -ht xfs
Filesystem Size Used Avail Use% Mounted on
/dev/md2 6.1T 2.2T 3.9T 36% /backup2
/dev/md3 6.1T 3.8T 2.3T 63% /backup1

Just for your interest:
# cat /sys/block/sd?/device/ioerr_cnt /sys/block/sd??/device/ioerr_cnt
0x358
0x358
0x53
0x48
0x47
0x46
0x59
0x55
0x55
0x60
0x63
0x62
0x60
0x5e
0x6c
0x62
0x60
0x67
0x68
0x6c
0x76
0x70
0x72
0x6e
0x6d
0x65
0xc3
0xbd
0xc5
0xca
0xf0
0x104
0x107
0x113
0x119
0x127
0x11b
0x127
0x126
0x12d
0x12f
0x13c
0x12e
0x142
0x17f
0x13a
0x141
0x144
0x13e
0x141

The first 2 drives are attached through SATA controller (AHCI). I don't know what numbers are normal but there is a server with drives that have an error count of more than 850 and work flawlessly.
I would disable NCQ only at very last step, because throughput is important. The server has to fill 2 LTO tapes with fast write speed.

Is it possible to reopen bug reports? If yes, I think you can close this one for now.
I'll report when problems occur again.

Thanks
Lars