hardy server crash (nobody cared)

Bug #244933 reported by Alessandro Bono
4
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Invalid
Undecided
Unassigned

Bug Description

Hi

System is an old dapper server upgraded to hardy, and normally is rock solid
after this "nobody cared" system crash in an orrible way (log attached)

Jul 2 11:21:47 grignolino kernel: [576441.506605] irq 17: nobody cared (try booting with the "irqpoll" option)
Jul 2 11:21:47 grignolino kernel: [576441.506668] Pid: 0, comm: swapper Tainted: P 2.6.24-19-server #1
Jul 2 11:21:47 grignolino kernel: [576441.506698] [<c016e3f4>] __report_bad_irq+0x24/0x80
Jul 2 11:21:47 grignolino kernel: [576441.506725] [<c016e6cb>] note_interrupt+0x27b/0x2c0
Jul 2 11:21:47 grignolino kernel: [576441.506758] [<c016d8f0>] handle_IRQ_event+0x30/0x60
Jul 2 11:21:47 grignolino kernel: [576441.506779] [<c016f096>] handle_fasteoi_irq+0x86/0xe0
Jul 2 11:21:47 grignolino kernel: [576441.506799] [<c010a93b>] do_IRQ+0x3b/0x70
Jul 2 11:21:47 grignolino kernel: [576441.506813] [<c01241d0>] pgd_dtor+0x0/0x60
Jul 2 11:21:47 grignolino kernel: [576441.506835] [<c0108dff>] common_interrupt+0x23/0x28
Jul 2 11:21:47 grignolino kernel: [576441.506868] [<c019007b>] free_huge_page+0x1b/0x90
Jul 2 11:21:47 grignolino kernel: [576441.506888] [<c01062e6>] mwait_idle_with_hints+0x46/0x60
Jul 2 11:21:47 grignolino kernel: [576441.506914] [<c01066c3>] cpu_idle+0x73/0xd0
Jul 2 11:21:47 grignolino kernel: [576441.506998] =======================
Jul 2 11:21:47 grignolino kernel: [576441.507001] handlers:
Jul 2 11:21:47 grignolino kernel: [576441.507042] [<f88abe30>] (ata_interrupt+0x0/0x200 [libata])
Jul 2 11:21:47 grignolino kernel: [576441.507173] [<f88abe30>] (ata_interrupt+0x0/0x200 [libata])
Jul 2 11:21:47 grignolino kernel: [576441.507300] [<f899eb80>] (usb_hcd_irq+0x0/0x60 [usbcore])
Jul 2 11:21:47 grignolino kernel: [576441.507430] [<f89ffe00>] (e1000_intr+0x0/0x160 [e1000])
Jul 2 11:21:47 grignolino kernel: [576441.507551] Disabling IRQ #17

Revision history for this message
Alessandro Bono (a.bono) wrote :
Revision history for this message
Alessandro Bono (a.bono) wrote :
Revision history for this message
Alessandro Bono (a.bono) wrote :
Revision history for this message
Alessandro Bono (a.bono) wrote :
Revision history for this message
Alessandro Bono (a.bono) wrote :

Pushing power button

Jul 6 12:12:04 grignolino kernel: [334381.766308] irq 17: nobody cared (try booting with the "irqpoll" option)
Jul 6 12:12:04 grignolino kernel: [334381.766369] Pid: 0, comm: swapper Tainted: P 2.6.24-19-server #1
Jul 6 12:12:04 grignolino kernel: [334381.766398] [<c016e3f4>] __report_bad_irq+0x24/0x80
Jul 6 12:12:04 grignolino kernel: [334381.766424] [<c016e6cb>] note_interrupt+0x27b/0x2c0
Jul 6 12:12:04 grignolino kernel: [334381.766457] [<c016d8f0>] handle_IRQ_event+0x30/0x60
Jul 6 12:12:04 grignolino kernel: [334381.766478] [<c016f096>] handle_fasteoi_irq+0x86/0xe0
Jul 6 12:12:04 grignolino kernel: [334381.766499] [<c010a93b>] do_IRQ+0x3b/0x70
Jul 6 12:12:04 grignolino kernel: [334381.766531] [<c0108dff>] common_interrupt+0x23/0x28
Jul 6 12:12:04 grignolino kernel: [334381.766580] [<c01062e6>] mwait_idle_with_hints+0x46/0x60
Jul 6 12:12:04 grignolino kernel: [334381.766606] [<c01066c3>] cpu_idle+0x73/0xd0
Jul 6 12:12:04 grignolino kernel: [334381.766691] =======================
Jul 6 12:12:04 grignolino kernel: [334381.766693] handlers:
Jul 6 12:12:04 grignolino kernel: [334381.766734] [<f88abe30>] (ata_interrupt+0x0/0x200 [libata])
Jul 6 12:12:04 grignolino kernel: [334381.766864] [<f88abe30>] (ata_interrupt+0x0/0x200 [libata])
Jul 6 12:12:04 grignolino kernel: [334381.766992] [<f899eb80>] (usb_hcd_irq+0x0/0x60 [usbcore])
Jul 6 12:12:04 grignolino kernel: [334381.767121] [<f89c1e00>] (e1000_intr+0x0/0x160 [e1000])
Jul 6 12:12:04 grignolino kernel: [334381.767242] Disabling IRQ #17

Revision history for this message
Philipp Dreimann (philipp-dreimann-deactivatedaccount) wrote :

Please try the following things:
- boot your kernel with the irqpoll option
- try to install and boot an gusty kernel

and tell us if it solves the problem.

I think that it may be also an hdd related hw error... any comments?

Revision history for this message
Alessandro Bono (a.bono) wrote : Re: [Bug 244933] Re: hardy server crash (nobody cared)

On Monday 07 July 2008, Philipp Dreimann wrote:

Hi

> Please try the following things:
> - boot your kernel with the irqpoll option
> - try to install and boot an gusty kernel
>
> and tell us if it solves the problem.
>
>
> I think that it may be also an hdd related hw error... any comments?

possible, hdd are old, but errors from hdd come after irq of controller became
disable, anyway I'll attach log from smartcl

This machine was really rock solid for a couple of year with dapper
After upgrade to hardy machine became not so stable, seems to me a problem
with new kernel, but who knows? I'll try to reproduce problem in a reliable
way. Second ooops occured exactly when I pushed power button, maybe a problem
with acpi

thanks

--
Cordiali saluti

Alessandro Bono

Revision history for this message
Alessandro Bono (a.bono) wrote :
Revision history for this message
Leann Ogasawara (leannogasawara) wrote :

The Ubuntu Kernel Team is planning to move to the 2.6.27 kernel for the upcoming Intrepid Ibex 8.10 release. As a result, the kernel team would appreciate it if you could please test this newer 2.6.27 Ubuntu kernel. There are one of two ways you should be able to test:

1) If you are comfortable installing packages on your own, the linux-image-2.6.27-* package is currently available for you to install and test.

--or--

2) The upcoming Alpha5 for Intrepid Ibex 8.10 will contain this newer 2.6.27 Ubuntu kernel. Alpha5 is set to be released Thursday Sept 4. Please watch http://www.ubuntu.com/testing for Alpha5 to be announced. You should then be able to test via a LiveCD.

Please let us know immediately if this newer 2.6.27 kernel resolves the bug reported here or if the issue remains. More importantly, please open a new bug report for each new bug/regression introduced by the 2.6.27 kernel and tag the bug report with 'linux-2.6.27'. Also, please specifically note if the issue does or does not appear in the 2.6.26 kernel. Thanks again, we really appreicate your help and feedback.

Revision history for this message
kernel-janitor (kernel-janitor) wrote :

Hi Alessandro,

This bug was reported a while ago and there hasn't been any activity in it recently. We were wondering if this is still an issue? Can you try with the latest development release of Ubuntu? ISO CD images are available from http://cdimage.ubuntu.com/releases/ .

If it remains an issue, could you run the following command from a Terminal (Applications->Accessories->Terminal). It will automatically gather and attach updated debug information to this report.

apport-collect -p linux-image-`uname -r` 244933

Also, if you could test the latest upstream kernel available that would be great. It will allow additional upstream developers to examine the issue. Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Once you've tested the upstream kernel, please remove the 'needs-upstream-testing' tag. This can be done by clicking on the yellow pencil icon next to the tag located at the bottom of the bug description and deleting the 'needs-upstream-testing' text. Please let us know your results.

Thanks in advance.

[This is an automated message. Apologies if it has reached you inappropriately; please just reply to this message indicating so.]

tags: added: needs-kernel-logs
tags: added: needs-upstream-testing
tags: added: kj-triage
Changed in linux (Ubuntu):
status: New → Incomplete
Revision history for this message
Jeremy Foshee (jeremyfoshee) wrote :

This bug report was marked as Incomplete and has not had any updated comments for quite some time. As a result this bug is being closed. Please reopen if this is still an issue in the current Ubuntu release http://www.ubuntu.com/getubuntu/download . Also, please be sure to provide any requested information that may have been missing. To reopen the bug, click on the current status under the Status column and change the status back to "New". Thanks.

[This is an automated message. Apologies if it has reached you inappropriately; please just reply to this message indicating so.]

tags: added: kj-expired
Changed in linux (Ubuntu):
status: Incomplete → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.