Comment 20 for bug 263160

Revision history for this message
Shawn Ostapuk (flagg) wrote :

I believe I also have this problem with Jaunty (9.04) running 2.6.28-11-server.

I am using 6 Seagate 1.5TB Disks (that do NOT have the notorious freezing firmware) and Promise SATA controllers, under heavy load i get ATA resets and 2 drives drop out of my raid.

Drive Info:

 Model Number: ST31500341AS
 Serial Number: 9VS1F7AR
 Firmware Revision: CC1H
 Transport: Serial

Controllers:

00:08.0 Mass storage controller: Promise Technology, Inc. PDC40718 (SATA 300 TX4) (rev 02)
00:09.0 Mass storage controller: Promise Technology, Inc. PDC40718 (SATA 300 TX4) (rev 02)

Jun 22 21:57:52 ralph -- MARK --
Jun 22 22:17:52 ralph -- MARK --
Jun 22 22:37:52 ralph -- MARK --
Jun 22 22:39:30 ralph kernel: [98628.073762] ata6: hard resetting link
Jun 22 22:39:31 ralph kernel: [98628.430232] ata6: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
Jun 22 22:39:31 ralph kernel: [98628.566502] ata6.00: configured for UDMA/133
Jun 22 22:39:31 ralph kernel: [98628.566545] ata6: EH complete
Jun 22 22:40:01 ralph kernel: [98659.059722] ata6: hard resetting link
Jun 22 22:40:02 ralph kernel: [98659.400116] ata6: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
Jun 22 22:40:02 ralph kernel: [98659.547905] ata6.00: configured for UDMA/133
Jun 22 22:40:02 ralph kernel: [98659.547951] ata6: EH complete
Jun 22 22:40:32 ralph kernel: [98690.065025] ata6: hard resetting link
Jun 22 22:40:33 ralph kernel: [98690.410095] ata6: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
Jun 22 22:40:33 ralph kernel: [98690.549672] ata6.00: configured for UDMA/133
Jun 22 22:40:33 ralph kernel: [98690.549705] ata6: EH complete
Jun 22 22:41:03 ralph kernel: [98721.000265] ata6: limiting SATA link speed to 1.5 Gbps
Jun 22 22:41:03 ralph kernel: [98721.067312] ata6: hard resetting link
Jun 22 22:41:04 ralph kernel: [98721.410108] ata6: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
Jun 22 22:41:04 ralph kernel: [98721.537792] ata6.00: configured for UDMA/133
Jun 22 22:41:04 ralph kernel: [98721.537832] ata6: EH complete
Jun 22 22:41:34 ralph kernel: [98752.071181] ata6: hard resetting link
Jun 22 22:41:35 ralph kernel: [98752.430117] ata6: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
Jun 22 22:41:35 ralph kernel: [98752.557827] ata6.00: configured for UDMA/133
Jun 22 22:41:35 ralph kernel: [98752.557867] ata6: EH complete
Jun 22 22:42:05 ralph kernel: [98783.078988] ata6: hard resetting link
Jun 22 22:42:06 ralph kernel: [98783.420202] ata6: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
Jun 22 22:42:06 ralph kernel: [98783.547890] ata6.00: configured for UDMA/133
Jun 22 22:42:06 ralph kernel: [98783.547925] ata6: EH complete
Jun 22 22:42:36 ralph kernel: [98814.071513] ata6: hard resetting link
Jun 22 22:42:37 ralph kernel: [98814.440122] ata6: SATA link up 1.5 Gbps (SStatus 113 SControl 310)
Jun 22 22:42:37 ralph kernel: [98814.567858] ata6.00: configured for UDMA/133
Jun 22 22:42:37 ralph kernel: [98814.567913] sd 5:0:0:0: [sdf] Result: hostbyte=DID_OK driverbyte=DRIVER_SENSE,SUGGEST_OK
Jun 22 22:42:37 ralph kernel: [98814.567956] sd 5:0:0:0: [sdf] Sense Key : Aborted Command [current] [descriptor]
Jun 22 22:42:37 ralph kernel: [98814.567963] Descriptor sense data with sense descriptors (in hex):
Jun 22 22:42:37 ralph kernel: [98814.567965] 72 0b 00 00 00 00 00 0c 00 0a 80 00 00 00 00 00
Jun 22 22:42:37 ralph kernel: [98814.567975] 00 00 00 00
Jun 22 22:42:37 ralph kernel: [98814.567979] sd 5:0:0:0: [sdf] Add. Sense: No additional sense information
Jun 22 22:42:37 ralph kernel: [98814.580037] raid5:md0: read error not correctable (sector 183296640 on sdf1).
Jun 22 22:42:37 ralph kernel: [98814.603411] raid5:md0: read error not correctable (sector 183296648 on sdf1).
Jun 22 22:42:37 ralph kernel: [98814.603416] raid5:md0: read error not correctable (sector 183296656 on sdf1).
Jun 22 22:42:37 ralph kernel: [98814.603420] raid5:md0: read error not correctable (sector 183296664 on sdf1).
Jun 22 22:42:37 ralph kernel: [98814.603424] raid5:md0: read error not correctable (sector 183296672 on sdf1).
Jun 22 22:42:37 ralph kernel: [98814.603427] raid5:md0: read error not correctable (sector 183296680 on sdf1).
Jun 22 22:42:37 ralph kernel: [98814.603431] raid5:md0: read error not correctable (sector 183296688 on sdf1).
Jun 22 22:42:37 ralph kernel: [98814.603435] raid5:md0: read error not correctable (sector 183296696 on sdf1).

Rebooting and recreating the md0 restores the raid (albeit with a 3 day to run recovery).