Comment 5 for bug 599830

Revision history for this message
Jason Unrein (diabelek) wrote :

The mpt messages in your logs suggest that the firmware had an NCQ problem that required it to abort all the outstanding commands and have the OS retry them (see http://en.wikipedia.org/wiki/NCQ for what NCQ is). You can disable NCQ, at the cost of IO performance usually, to work around the issue (see https://ata.wiki.kernel.org/index.php/Libata_FAQ#Enabling.2C_disabling_and_checking_NCQ).

The problem would probably either be a bad drive or off change a bad cable or card. You might check each driver with smartctl to confirm their health. You might also what watch /sys/block/sdX/device/ioerr_cnt for each device to help clue in on any problems (never used the file before so I'd be curious if it helps).

Also, the xfs.log shows a panic from a null pointer. This is probably just a result of the problems on with the fw<->drive communication.