VmWare Server2.0 on Host Interpid 8.10 2.6.27-9-server x86_64 and same for guest.
I install linux-image-virtual and linux-virtual packages.
I reproduce this bug many times under heavy disk load.
It seams we need option on driver mptscsih or in filesystems which turn off timeouts and verification.
Patching kernel for package linux-image-virtual will help too.
VmWare Server2.0 on Host Interpid 8.10 2.6.27-9-server x86_64 and same for guest.
I install linux-image-virtual and linux-virtual packages.
I reproduce this bug many times under heavy disk load.
It seams we need option on driver mptscsih or in filesystems which turn off timeouts and verification.
Patching kernel for package linux-image-virtual will help too.
Here is my syslog .
Sometimes all goes good, driver succesfully reset
Jan 24 19:37:22 uadb kernel: [17989.370267] mptscsih: ioc0: attempting task abort! (sc=ffff88006ac 123c0) 123c0) 12640) 12640) 12c80) 12c80) 8d140) 8d140) 8da00) 8da00) 8d140) 8d140)
Jan 24 19:37:22 uadb kernel: [17989.370268] sd 2:0:2:0: [sdc] CDB: Write(10): 2a 00 01 43 d0 37 00 04 00 00
Jan 24 19:37:22 uadb kernel: [17989.370273] mptscsih: ioc0: task abort: SUCCESS (sc=ffff88006ac
Jan 24 19:37:22 uadb kernel: [17989.370292] mptscsih: ioc0: attempting task abort! (sc=ffff88006ac
Jan 24 19:37:22 uadb kernel: [17989.370294] sd 2:0:2:0: [sdc] CDB: Write(10): 2a 00 01 43 d4 37 00 00 08 00
Jan 24 19:37:22 uadb kernel: [17989.370298] mptscsih: ioc0: task abort: SUCCESS (sc=ffff88006ac
Jan 24 19:37:22 uadb kernel: [17989.370318] mptscsih: ioc0: attempting task abort! (sc=ffff88006ac
Jan 24 19:37:22 uadb kernel: [17989.370319] sd 2:0:2:0: [sdc] CDB: Write(10): 2a 00 01 43 d4 3f 00 00 90 00
Jan 24 19:37:22 uadb kernel: [17989.370324] mptscsih: ioc0: task abort: SUCCESS (sc=ffff88006ac
Jan 24 19:37:22 uadb kernel: [17989.370344] mptscsih: ioc0: attempting task abort! (sc=ffff8800378
Jan 24 19:37:22 uadb kernel: [17989.370346] sd 2:0:1:0: [sdb] CDB: Read(10): 28 00 00 be b5 17 00 00 08 00
Jan 24 19:37:22 uadb kernel: [17991.772561] mptbase: ioc0: Initiating recovery
Jan 24 19:37:22 uadb kernel: [17993.342132] mptscsih: ioc0: Issue of TaskMgmt failed!
Jan 24 19:37:22 uadb kernel: [17993.342224] mptscsih: ioc0: task abort: FAILED (sc=ffff8800378
Jan 24 19:37:22 uadb kernel: [17993.342226] mptscsih: ioc0: attempting task abort! (sc=ffff8800378
Jan 24 19:37:22 uadb kernel: [17993.342229] sd 2:0:1:0: [sdb] CDB: Read(10): 28 00 00 be b5 2f 00 00 08 00
Jan 24 19:37:22 uadb kernel: [17993.342235] mptscsih: ioc0: task abort: SUCCESS (sc=ffff8800378
Jan 24 19:37:22 uadb kernel: [17993.342362] mptscsih: ioc0: attempting target reset! (sc=ffff8800378
Jan 24 19:37:22 uadb kernel: [17993.342364] sd 2:0:1:0: [sdb] CDB: Read(10): 28 00 00 be b5 17 00 00 08 00
Jan 24 19:37:22 uadb kernel: [17993.342828] scsi target2:0:0: Beginning Domain Validation
Jan 24 19:37:22 uadb kernel: [17993.620049] mptscsih: ioc0: target reset: SUCCESS (sc=ffff8800378
but in really heavy load :
Jan 24 20:51:49 uadb kernel: [22458.461947] mptscsih: ioc0: bus reset: FAILED (sc=ffff880068b 81280) 81280) 18c0, mf = ffff880174583e80, idx=3c 1dc0, mf = ffff880174584d20, idx=63 1a00, mf = ffff880174584e40, idx=66 1280, mf = ffff880174585620, idx=7b 81280)
Jan 24 20:51:49 uadb kernel: [22458.461958] mptscsih: ioc0: attempting host reset! (sc=ffff880068b
Jan 24 20:51:49 uadb kernel: [22458.461974] mptbase: ioc0: Initiating recovery
Jan 24 20:51:49 uadb kernel: [22459.601222] sd 2:0:1:0: mptscsih: ioc0: completing cmds: fw_channel 0, fw_id 1, sc=ffff880068b8
Jan 24 20:51:49 uadb kernel: [22459.601236] sd 2:0:1:0: mptscsih: ioc0: completing cmds: fw_channel 0, fw_id 1, sc=ffff880068b8
Jan 24 20:51:49 uadb kernel: [22459.601241] sd 2:0:1:0: mptscsih: ioc0: completing cmds: fw_channel 0, fw_id 1, sc=ffff880068b8
Jan 24 20:51:49 uadb kernel: [22459.601253] sd 2:0:1:0: mptscsih: ioc0: completing cmds: fw_channel 0, fw_id 1, sc=ffff880068b8
Jan 24 20:51:49 uadb kernel: [22460.000277] mptscsih: ioc0: host reset: SUCCESS (sc=ffff880068b
Jan 24 20:51:49 uadb kernel: [22460.000286] sd 2:0:1:0: Device offlined - not ready after error recovery
Jan 24 20:51:49 uadb kernel: [22460.000288] sd 2:0:1:0: Device offlined - not ready after error recovery
Jan 24 20:51:49 uadb kernel: [22460.000289] sd 2:0:1:0: Device offlined - not ready after error recovery
Jan 24 20:51:49 uadb kernel: [22460.000290] sd 2:0:1:0: Device offlined - not ready after error recovery
Jan 24 20:51:49 uadb kernel: [22460.000309] sd 2:0:1:0: rejecting I/O to offline device
Jan 24 20:51:49 uadb kernel: [22460.010086] Buffer I/O error on device sdb1, logical block 2844192
Jan 24 20:51:49 uadb kernel: [22460.010086] lost page write due to I/O error on sdb1
Jan 24 20:51:49 uadb kernel: [22460.010086] Buffer I/O error on device sdb1, logical block 2844193
Jan 24 20:51:49 uadb kernel: [22460.010086] sd 2:0:1:0: rejecting I/O to offline device
Jan 24 20:51:49 uadb last message repeated 107 times
Jan 24 20:51:49 uadb kernel: [22460.713791] sd 2:0:1:0: rejecting I/O to offline device
.... DID_NO_ CONNECT driverbyte= DRIVER_ OK,SUGGEST_ OK start_sb: Detected aborted journal
Jan 24 20:51:49 uadb kernel: [22460.762970] sd 2:0:1:0: [sdb] Result: hostbyte=
Jan 24 20:51:49 uadb kernel: [22460.762982] end_request: I/O error, dev sdb, sector 18966103
Jan 24 20:51:49 uadb kernel: [22460.763012] Aborting journal on device sdb1.
Jan 24 20:51:49 uadb kernel: [22460.763018] sd 2:0:1:0: rejecting I/O to offline device
Jan 24 20:51:49 uadb kernel: [22460.763022] sd 2:0:1:0: rejecting I/O to offline device
Jan 24 20:51:49 uadb kernel: [22460.763724] sd 2:0:1:0: rejecting I/O to offline device
Jan 24 20:51:49 uadb kernel: [22460.764010] sd 2:0:1:0: rejecting I/O to offline device
Jan 24 20:51:49 uadb kernel: [22460.764034] ext3_abort called.
Jan 24 20:51:49 uadb kernel: [22460.764039] EXT3-fs error (device sdb1): ext3_journal_
Jan 24 20:51:49 uadb kernel: [22460.764044] Remounting filesystem read-only