Comment 16 for bug 149071

Revision history for this message
TJ (tj) wrote : aacraid fails on 2.6.24-15-generic with Dell PowerEdge PERC/2 RAID controller

I attached a serial console and captured 2.6.24-15-generic attempting to start. I've attached the log-file so we don't need to rely on screen photographs.

Key features are, I think:

[ 439.306852] Adaptec aacraid driver 1.1-5[2449]-ms
[ 439.758050] irq 10: nobody cared (try booting with the "irqpoll" option)
[ 439.764849] Pid: 1263, comm: udevd Not tainted 2.6.24-15-generic #1
[ 439.771202] [<c0165764>] __report_bad_irq+0x24/0x80
...
[ 439.839092] handlers:
[ 439.841428] [<f88859a0>] (ahc_linux_isr+0x0/0x250 [aic7xxx])
[ 439.847367] Disabling IRQ #10

[ 495.422177] BUG: soft lockup - CPU#3 stuck for 11s! [modprobe:1447]
[ 495.428524]
[ 495.430086] Pid: 1447, comm: modprobe Not tainted (2.6.24-15-generic #1)
[ 495.436855] EIP: 0060:[<c021662b>] EFLAGS: 00000293 CPU: 3
[ 495.442429] EIP is at delay_tsc+0x2b/0x50
[ 495.446510] EAX: 78583a4b EBX: 0000003f ECX: 00000000 EDX: 0000003f
[ 495.452847] ESI: 78583a27 EDI: f7d01a78 EBP: 78583171 ESP: df93bd4c
[ 495.459178] DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068
[ 495.464644] CR0: 8005003b CR2: 0812574c CR3: 1fa03000 CR4: 00000690
[ 495.470980] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
[ 495.477309] DR6: ffff0ff0 DR7: 00000400
[ 495.481224] [<c02165c6>] __delay+0x6/0x10
[ 495.485463] [<f89446aa>] aac_fib_send+0x21a/0x2d0 [aacraid]
[ 495.491306] [<c012363a>] enqueue_task_fair+0x1a/0x30
[ 495.496515] [<f8940a94>] aac_get_adapter_info+0x74/0x620 [aacraid]
[ 495.502942] [<f893df54>] aac_probe_one+0x224/0x450 [aacraid]
[ 495.508830] [<f8944b80>] aac_command_thread+0x0/0x6d0 [aacraid]
...

*** This is caused by the motherboard having a USB controller chipset with no USB hardware

[ 499.829084] uhci_hcd 0000:00:02.2: host controller process error, something bad happened!
[ 499.837347] uhci_hcd 0000:00:02.2: host controller halted, very bad!
[ 499.843790] uhci_hcd 0000:00:02.2: HC died; cleaning up
...
[ 708.005536] aacraid: aac_fib_send: first asynchronous command timed out.
[ 708.005542] Usually a result of a PCI interrupt routing problem;
[ 708.005548] update mother board BIOS or consider utilizing one of
[ 708.005553] the SAFE mode kernel options (acpi, apic etc)
...
[ 708.030099] scsi 4:0:0:0: Attempting to queue an ABORT message
[ 708.030110] CDB: 0x0 0x0 0x0 0x0 0x0 0x0
[ 708.030191] scsi 4:0:0:0: Command already completed
[ 708.030201] aic7xxx_abort returns 0x2002
...
[ 718.100047] scsi 3:0:0:0: Device offlined - not ready after error recovery
...
[ 935.879635] scsi4: At time of recovery, card was paused
[ 935.884941] >>>>>>>>>>>>>>>>>> Dump Card State Begins <<<<<<<<<<<<<<<<<
[ 935.884947] scsi4: Dumping Card State in Message-in phase, at SEQADDR 0x100
[ 935.898648] Card was paused
[ 935.901509] ACCUM = 0xc0, SINDEX = 0x71, DINDEX = 0x8c, ARG_2 = 0x0
[ 935.907839] HCNT = 0x0 SCBPTR = 0x0
[ 935.911393] SCSISIGI[0xe6]:(REQI|BSYI|MSGI|IOI|CDI)
[ 935.916886] ERROR[0x0] SCSIBUSL[0x0] LASTPHASE[0xe0]:(MSGI|IOI|CDI)
[ 935.923912] SCSISEQ[0x12]:(ENAUTOATNP|ENRSELI)
[ 935.928781] SBLKCTL[0x0] SCSIRATE[0x0] SEQCTL[0x10]:(FASTMODE)
[ 935.935235] SEQ_FLAGS[0x0] SSTAT0[0x7]:(DMADONE|SPIORDY|SDONE)
[ 935.941690] SSTAT1[0x3]:(REQINIT|PHASECHG) SSTAT2[0x0]
[ 935.947380] SSTAT3[0x0] SIMODE0[0x0] SIMODE1[0xac]:(ENSCSIPERR|ENBUSFREE|ENSCSIRST|ENSELTIMO)
[ 935.956736] SXFRCTL0[0x88]:(SPIOEN|DFON) DFCNTRL[0x0]
[ 935.962341] DFSTATUS[0x29]:(FIFOEMP|HDONE|FIFOQWDEMP)
[ 935.967877] STACK: 0x0 0x164 0x18c 0xff
[ 935.971865] SCB count = 4
[ 935.974797] Kernel NEXTQSCB = 3
[ 935.977161] scsi 2:0:6:0: Attempting to queue an ABORT message
[ 935.977170] CDB: 0x12 0x0 0x0 0x0 0x24 0x0
[ 935.977282] scsi 2:0:6:0: Command already completed
[ 935.977290] aic7xxx_abort returns 0x2002
[ 935.996970] Card NEXTQSCB = 2
[ 935.999999] QINFIFO entries: 2
[ 936.003336] Waiting Queue entries:
[ 936.006948] Disconnected Queue entries:
[ 936.010994] QOUTFIFO entries:
[ 936.014176] Sequencer Free SCB List: 0 1 2
[ 936.018698] Sequencer SCB Info:
[ 936.021805] 0 SCB_CONTROL[0xc0]:(DISCENB|TARGET_SCB)
[ 936.027672] SCB_SCSIID[0x57] SCB_LUN[0x0] SCB_TAG[0xff]
[ 936.033200] 1 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
[ 936.041102] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
[ 936.047148] 2 SCB_CONTROL[0x0] SCB_SCSIID[0xff]:(TWIN_CHNLB|OID|TWIN_TID)
[ 936.055050] SCB_LUN[0xff]:(SCB_XFERLEN_ODD|LID) SCB_TAG[0xff]
[ 936.061346] Pending list:
[ 936.063935] 2 SCB_CONTROL[0x0] SCB_SCSIID[0x57] SCB_LUN[0x0]
[ 936.070644] Kernel Free SCB list: 1 0
[ 936.074665] Untagged Q(5): 2
[ 936.077826]
[ 936.077828] <<<<<<<<<<<<<<<<< Dump Card State Ends >>>>>>>>>>>>>>>>>>
[ 936.085898] scsi4:0:5:0: Cmd aborted from QINFIFO