8.04 - kernel 2.6.24-18 up to 21 : domU crash with java

Bug #300031 reported by Jean-Luc Renard
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Expired
Undecided
Unassigned

Bug Description

Binary package hint: linux-image-xen

Description: Ubuntu 8.04.1
Release: 8.04

Package : linux-image-2.6.24-18-xen (2.6.24-18.32) up to 26.24-21-xen (2.6.24-21.43)

I have a zimbra server running in a domU over Ubuntu 8.04.
I use kernel from 2.6.24-18.32 for dom0 and DomU up to 2.6.24-21.43.
The higher level is the kernel, the most frequent is the issue.

After some time running (from 18h up to 36h), the following kernel error occur :

Nov 15 21:50:03 server kernel: [264758.709116] invalid opcode: 0000 [#1] SMP
Nov 15 21:50:03 server kernel: [264758.709149] Modules linked in: af_packet drbd cn ipv6 evdev ext3 jbd mbcache dm_mirror dm_snapshot dm_mod fuse
Nov 15 21:50:03 server kernel: [264758.709185]
Nov 15 21:50:03 server kernel: [264758.709189] Pid: 32165, comm: java Not tainted (2.6.24-18-xen #1)
Nov 15 21:50:03 server kernel: [264758.709193] EIP: 0061:[<c1e0ce69>] EFLAGS: 00010212 CPU: 2
Nov 15 21:50:03 server kernel: [264758.709198] EIP is at 0xc1e0ce69
Nov 15 21:50:03 server kernel: [264758.709201] EAX: c1de2980 EBX: c1de5280 ECX: 00000004 EDX: 00000000
Nov 15 21:50:03 server kernel: [264758.709204] ESI: 00000002 EDI: 40040000 EBP: 00000000 ESP: cd4ddd90
Nov 15 21:50:03 server kernel: [264758.709207] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0069
Nov 15 21:50:03 server kernel: [264758.709212] Process java (pid: 32165, ti=cd4dc000 task=dea32710 task.ti=cd4dc000)
Nov 15 21:50:03 server kernel: [264758.709215] Stack: c01623a5 00000000 c03fdf80 cd4dde04 00000002 0000000e cd4dddcc c0162456
Nov 15 21:50:03 server kernel: [264758.709229] c1de5920 c15e240c c03fdf80 0000000d c01658c8 0000000e 00000000 0000000e
Nov 15 21:50:03 server kernel: [264758.709242] 00000000 c48ced20 c498dda0 c4ab3fa0 c4891be0 c4acb380 c44c9de0 c4716500
Nov 15 21:50:03 server kernel: [264758.709257] Call Trace:
Nov 15 21:50:03 server kernel: [264758.709261] [free_hot_cold_page+0x195/0x220] free_hot_cold_page+0x195/0x220
Nov 15 21:50:03 server kernel: [264758.709271] [__pagevec_free+0x26/0x30] __pagevec_free+0x26/0x30
Nov 15 21:50:03 server kernel: [264758.709278] [release_pages+0x68/0x160] release_pages+0x68/0x160
Nov 15 21:50:03 server kernel: [264758.709283] [free_pages_and_swap_cache+0x74/0xa0] free_pages_and_swap_cache+0x74/0xa0
Nov 15 21:50:03 server kernel: [264758.709290] [exit_mmap+0xe7/0x100] exit_mmap+0xe7/0x100
Nov 15 21:50:03 server kernel: [264758.709297] [mmput+0x23/0x80] mmput+0x23/0x80
Nov 15 21:50:03 server kernel: [264758.709303] [do_exit+0x165/0x8b0] do_exit+0x165/0x8b0
Nov 15 21:50:03 server kernel: [264758.709308] [recalc_sigpending+0xb/0x40] recalc_sigpending+0xb/0x40
Nov 15 21:50:03 server kernel: [264758.709315] [dequeue_signal+0x6b/0x150] dequeue_signal+0x6b/0x150
Nov 15 21:50:03 server kernel: [264758.709321] [do_group_exit+0x2a/0xa0] do_group_exit+0x2a/0xa0
Nov 15 21:50:03 server kernel: [264758.709327] [get_signal_to_deliver+0x2e9/0x540] get_signal_to_deliver+0x2e9/0x540
Nov 15 21:50:03 server kernel: [264758.709333] [do_notify_resume+0x93/0x760] do_notify_resume+0x93/0x760
Nov 15 21:50:03 server kernel: [264758.709339] [mprotect_fixup+0x6e7/0x800] mprotect_fixup+0x6e7/0x800
Nov 15 21:50:03 server kernel: [264758.709346] [sys_futex+0x97/0x120] sys_futex+0x97/0x120
Nov 15 21:50:03 server kernel: [264758.709352] [sys_mprotect+0x11f/0x230] sys_mprotect+0x11f/0x230
Nov 15 21:50:03 server kernel: [264758.709359] [work_notifysig+0x13/0x22] work_notifysig+0x13/0x22
Nov 15 21:50:03 server kernel: [264758.709365] =======================
Nov 15 21:50:03 server kernel: [264758.709368] Code: 20 00 00 00 00 40 01 00 00 00 ff ff ff ff d4 4c ee c1 00 00 00 00 e0 0d e1 c1 00 01 10 00 00 02 20 00 00 00 08 40 00 00 00 00 ff <ff> ff ff 00 00 00 00 00 00 00 00 a0 c9 e0 c1 b8 97 de c1 d8 be
Nov 15 21:50:03 server kernel: [264758.709437] EIP: [<c1e0ce69>] 0xc1e0ce69 SS:ESP 0069:cd4ddd90
Nov 15 21:50:03 server kernel: [264758.709450] ---[ end trace 7afb563bc1905199 ]---
Nov 15 21:50:03 server kernel: [264758.709453] Fixing recursive fault but reboot is needed!

The server continue to run and suddenly goes in an infinite loop

Nov 15 22:10:44 server kernel: [266003.290318] BUG: soft lockup - CPU#4 stuck for 11s! [java:23418]
Nov 15 22:10:44 server kernel: [266003.290332]
Nov 15 22:10:44 server kernel: [266003.290336] Pid: 23418, comm: java Tainted: G D (2.6.24-18-xen #1)
Nov 15 22:10:44 server kernel: [266003.290340] EIP: 0061:[ipv6:_spin_lock+0x5/0x10] EFLAGS: 00000286 CPU: 4
Nov 15 22:10:44 server kernel: [266003.290350] EIP is at _spin_lock+0x5/0x10
Nov 15 22:10:44 server kernel: [266003.290353] EAX: c1e0f8ac EBX: 00000000 ECX: c1e0f8a0 EDX: 00000838
Nov 15 22:10:44 server kernel: [266003.290356] ESI: ae5ba067 EDI: 00000001 EBP: c0477158 ESP: cc00dddc
Nov 15 22:10:44 server kernel: [266003.290359] DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0069
Nov 15 22:10:44 server kernel: [266003.290365] CR0: 80050033 CR2: 91707388 CR3: 0d3ae000 CR4: 00002660
Nov 15 22:10:44 server kernel: [266003.290372] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
Nov 15 22:10:44 server kernel: [266003.290376] DR6: ffff0ff0 DR7: 00000400
Nov 15 22:10:44 server kernel: [266003.290381] [__do_fault+0x3b8/0x6b0] __do_fault+0x3b8/0x6b0
Nov 15 22:10:44 server kernel: [266003.290394] [handle_mm_fault+0x249/0x1350] handle_mm_fault+0x249/0x1350
Nov 15 22:10:44 server kernel: [266003.290401] [timer_interrupt+0x3a0/0x770] timer_interrupt+0x3a0/0x770
Nov 15 22:10:44 server kernel: [266003.290411] [hrtimer_run_queues+0xda/0x1e0] hrtimer_run_queues+0xda/0x1e0
Nov 15 22:10:44 server kernel: [266003.290418] [local_clock+0x55/0xa0] local_clock+0x55/0xa0
Nov 15 22:10:44 server kernel: [266003.290425] [do_page_fault+0x366/0xe90] do_page_fault+0x366/0xe90
Nov 15 22:10:44 server kernel: [266003.290434] [tasklet_action+0x6c/0x110] tasklet_action+0x6c/0x110
Nov 15 22:10:44 server kernel: [266003.290441] [__do_softirq+0x92/0x130] __do_softirq+0x92/0x130
Nov 15 22:10:44 server kernel: [266003.290449] [evdev:do_gettimeofday+0x34/0x40860] do_gettimeofday+0x34/0xe0
Nov 15 22:10:44 server kernel: [266003.290457] [ipv6:copy_to_user+0x30/0xa60] copy_to_user+0x30/0x60
Nov 15 22:10:44 server kernel: [266003.290464] [sys_gettimeofday+0x28/0x80] sys_gettimeofday+0x28/0x80
Nov 15 22:10:44 server kernel: [266003.290471] [do_page_fault+0x0/0xe90] do_page_fault+0x0/0xe90
Nov 15 22:10:44 server kernel: [266003.290477] [error_code+0x35/0x40] error_code+0x35/0x40
Nov 15 22:10:44 server kernel: [266003.290484] [vcc_def_wakeup+0x30/0x60] vcc_def_wakeup+0x30/0x60
Nov 15 22:10:44 server kernel: [266003.290492] =======================
Nov 15 22:10:56 server kernel: [266015.094479] BUG: soft lockup - CPU#4 stuck for 11s! [java:23418]
Nov 15 22:10:56 server kernel: [266015.094495]
Nov 15 22:10:56 server kernel: [266015.094499] Pid: 23418, comm: java Tainted: G D (2.6.24-18-xen #1)
Nov 15 22:10:56 server kernel: [266015.094503] EIP: 0061:[ipv6:_spin_lock+0xa/0x10] EFLAGS: 00000286 CPU: 4
Nov 15 22:10:56 server kernel: [266015.094513] EIP is at _spin_lock+0xa/0x10
Nov 15 22:10:56 server kernel: [266015.094516] EAX: c1e0f8ac EBX: 00000000 ECX: c1e0f8a0 EDX: 00000838
Nov 15 22:10:56 server kernel: [266015.094520] ESI: ae5ba067 EDI: 00000001 EBP: c0477158 ESP: cc00dddc
Nov 15 22:10:56 server kernel: [266015.094523] DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0069
Nov 15 22:10:56 server kernel: [266015.094531] CR0: 80050033 CR2: 91707388 CR3: 0d3ae000 CR4: 00002660
Nov 15 22:10:56 server kernel: [266015.094536] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
Nov 15 22:10:56 server kernel: [266015.094541] DR6: ffff0ff0 DR7: 00000400
Nov 15 22:10:56 server kernel: [266015.094546] [__do_fault+0x3b8/0x6b0] __do_fault+0x3b8/0x6b0
Nov 15 22:10:56 server kernel: [266015.094559] [handle_mm_fault+0x249/0x1350] handle_mm_fault+0x249/0x1350
Nov 15 22:10:56 server kernel: [266015.094568] [timer_interrupt+0x3a0/0x770] timer_interrupt+0x3a0/0x770
Nov 15 22:10:56 server kernel: [266015.094577] [hrtimer_run_queues+0xda/0x1e0] hrtimer_run_queues+0xda/0x1e0
Nov 15 22:10:56 server kernel: [266015.094585] [local_clock+0x55/0xa0] local_clock+0x55/0xa0
Nov 15 22:10:56 server kernel: [266015.094592] [do_page_fault+0x366/0xe90] do_page_fault+0x366/0xe90
Nov 15 22:10:56 server kernel: [266015.094600] [tasklet_action+0x6c/0x110] tasklet_action+0x6c/0x110
Nov 15 22:10:56 server kernel: [266015.094608] [__do_softirq+0x92/0x130] __do_softirq+0x92/0x130
Nov 15 22:10:56 server kernel: [266015.094617] [evdev:do_gettimeofday+0x34/0x40860] do_gettimeofday+0x34/0xe0
Nov 15 22:10:56 server kernel: [266015.094625] [ipv6:copy_to_user+0x30/0xa60] copy_to_user+0x30/0x60
Nov 15 22:10:56 server kernel: [266015.094633] [sys_gettimeofday+0x28/0x80] sys_gettimeofday+0x28/0x80
Nov 15 22:10:56 server kernel: [266015.094640] [do_page_fault+0x0/0xe90] do_page_fault+0x0/0xe90
Nov 15 22:10:56 server kernel: [266015.094648] [error_code+0x35/0x40] error_code+0x35/0x40
Nov 15 22:10:56 server kernel: [266015.094655] [vcc_def_wakeup+0x30/0x60] vcc_def_wakeup+0x30/0x60
Nov 15 22:10:56 server kernel: [266015.094662] =======================
Nov 15 22:11:08 server kernel: [266026.907977] BUG: soft lockup - CPU#4 stuck for 11s! [java:23418]
Nov 15 22:11:08 server kernel: [266026.907993]
Nov 15 22:11:08 server kernel: [266026.907997] Pid: 23418, comm: java Tainted: G D (2.6.24-18-xen #1)
Nov 15 22:11:08 server kernel: [266026.908001] EIP: 0061:[ipv6:_spin_lock+0x7/0x10] EFLAGS: 00000286 CPU: 4
Nov 15 22:11:08 server kernel: [266026.908010] EIP is at _spin_lock+0x7/0x10
Nov 15 22:11:08 server kernel: [266026.908013] EAX: c1e0f8ac EBX: 00000000 ECX: c1e0f8a0 EDX: 00000838
Nov 15 22:11:08 server kernel: [266026.908016] ESI: ae5ba067 EDI: 00000001 EBP: c0477158 ESP: cc00dddc
Nov 15 22:11:08 server kernel: [266026.908020] DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0069
Nov 15 22:11:08 server kernel: [266026.908027] CR0: 80050033 CR2: 91707388 CR3: 0d3ae000 CR4: 00002660
Nov 15 22:11:08 server kernel: [266026.908031] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
Nov 15 22:11:08 server kernel: [266026.908036] DR6: ffff0ff0 DR7: 00000400
Nov 15 22:11:08 server kernel: [266026.908039] [__do_fault+0x3b8/0x6b0] __do_fault+0x3b8/0x6b0
Nov 15 22:11:08 server kernel: [266026.908050] [handle_mm_fault+0x249/0x1350] handle_mm_fault+0x249/0x1350
Nov 15 22:11:08 server kernel: [266026.908057] [timer_interrupt+0x3a0/0x770] timer_interrupt+0x3a0/0x770
Nov 15 22:11:08 server kernel: [266026.908065] [hrtimer_run_queues+0xda/0x1e0] hrtimer_run_queues+0xda/0x1e0
Nov 15 22:11:08 server kernel: [266026.908072] [local_clock+0x55/0xa0] local_clock+0x55/0xa0
Nov 15 22:11:08 server kernel: [266026.908078] [do_page_fault+0x366/0xe90] do_page_fault+0x366/0xe90
Nov 15 22:11:08 server kernel: [266026.908083] [tasklet_action+0x6c/0x110] tasklet_action+0x6c/0x110
Nov 15 22:11:08 server kernel: [266026.908090] [__do_softirq+0x92/0x130] __do_softirq+0x92/0x130
Nov 15 22:11:08 server kernel: [266026.908096] [evdev:do_gettimeofday+0x34/0x40860] do_gettimeofday+0x34/0xe0
Nov 15 22:11:08 server kernel: [266026.908102] [ipv6:copy_to_user+0x30/0xa60] copy_to_user+0x30/0x60
Nov 15 22:11:08 server kernel: [266026.908109] [sys_gettimeofday+0x28/0x80] sys_gettimeofday+0x28/0x80
Nov 15 22:11:08 server kernel: [266026.908114] [do_page_fault+0x0/0xe90] do_page_fault+0x0/0xe90
Nov 15 22:11:08 server kernel: [266026.908120] [error_code+0x35/0x40] error_code+0x35/0x40
Nov 15 22:11:08 server kernel: [266026.908125] [vcc_def_wakeup+0x30/0x60] vcc_def_wakeup+0x30/0x60
Nov 15 22:11:08 server kernel: [266026.908131] =======================
....

I have read in other lists that it has been identified as a kernel bug in debian and should be fixed in 2.6.27-rc5.
is it fixed or scheduled to be fixed in 8.04 release ? Or does somebody have a workaround ?

Regards,

Revision history for this message
Bohuslav Blin (bohuslav) wrote :

Same problem. I run 8.04 on Dom0 and 6.06.2 with Zimbra on DomU.

Revision history for this message
Andy Whitcroft (apw) wrote :

This is not a bug in the linux-meta package, moving to the linux package.

affects: linux-meta (Ubuntu) → linux (Ubuntu)
tags: added: xen
Revision history for this message
Jeremy Foshee (jeremyfoshee) wrote :

Hi Jean-Luc,

This bug was reported a while ago and there hasn't been any activity in it recently. We were wondering if this is still an issue? Can you try with the latest development release of Ubuntu? ISO CD images are available from http://cdimage.ubuntu.com/releases/ .

If it remains an issue, could you run the following command from a Terminal (Applications->Accessories->Terminal). It will automatically gather and attach updated debug information to this report.

apport-collect -p linux 300031

Also, if you could test the latest upstream kernel available that would be great. It will allow additional upstream developers to examine the issue. Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Once you've tested the upstream kernel, please remove the 'needs-upstream-testing' tag. This can be done by clicking on the yellow pencil icon next to the tag located at the bottom of the bug description and deleting the 'needs-upstream-testing' text. Please let us know your results.

Thanks in advance.

    [This is an automated message. Apologies if it has reached you inappropriately; please just reply to this message indicating so.]

tags: added: needs-kernel-logs
tags: added: needs-upstream-testing
tags: added: kj-triage
Changed in linux (Ubuntu):
status: New → Incomplete
Revision history for this message
Jeremy Foshee (jeremyfoshee) wrote :

This bug report was marked as Incomplete and has not had any updated comments for quite some time. As a result this bug is being closed. Please reopen if this is still an issue in the current Ubuntu release http://www.ubuntu.com/getubuntu/download . Also, please be sure to provide any requested information that may have been missing. To reopen the bug, click on the current status under the Status column and change the status back to "New". Thanks.

[This is an automated message. Apologies if it has reached you inappropriately; please just reply to this message indicating so.]

tags: added: kj-expired
Changed in linux (Ubuntu):
status: Incomplete → Expired
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.