kernel BUG at /build/buildd/linux-2.6.24/mm/memory.c:2667 (2.6.24-28.75)
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
New
|
Undecided
|
Unassigned |
Bug Description
Binary package hint: linux-image-
Since the upgrade to 2.6.24-28.75 a bunch of our cluster nodes are tripping on this kernel BUG when the batch system (pbs_mom from torque) starts up. This has never happened before. I.e. at 2.6.24-28.73 things where working ok.
The process (pbs_mom) is running as root so it might be the real cause, but as this has never happened before...
[ 164.803003] ------------[ cut here ]------------
[ 164.807704] kernel BUG at /build/
[ 164.814343] invalid opcode: 0000 [1] SMP
[ 164.818526] CPU 1
[ 164.820661] Modules linked in: openafs(P) autofs4 ext2 ext3 jbd mbcache ipmi_
devintf ipmi_si ipmi_msghandler ipv6 psmouse iTCO_wdt pcspkr evdev container iTC
O_vendor_support serio_raw shpchp button i5000_edac edac_core pci_hotplug xfs sg
sd_mod pata_acpi ata_piix ata_generic libata ehci_hcd uhci_hcd scsi_mod usbcore
e1000 dm_mirror dm_snapshot dm_mod thermal processor fan fbcon tileblit font bi
tblit softcursor
[ 164.859639] Pid: 5509, comm: pbs_mom Tainted: P 2.6.24-28-server #1
[ 164.866610] RIP: 0010:[<
resent+0xa5/0xc0
[ 164.875313] RSP: 0018:ffff810418
[ 164.880713] RAX: ffff810419d87dc0 RBX: 00007fffdba2e000 RCX: 0000000004040075
[ 164.887951] RDX: 00000000ffffffff RSI: 00007fffdba2e000 RDI: ffff810418df56c0
[ 164.895171] RBP: 00007fffdba2e000 R08: 00000007fffffffe R09: ffff81041e05b520
[ 164.902404] R10: 0000000000000000 R11: 0000000000000001 R12: 0000000000102173
[ 164.909633] R13: 0000000000000000 R14: 00007fffdba2e000 R15: 00007fffdba2e000
[ 164.916877] FS: 00007f9f4f9d56e
00000000000
[ 164.925086] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[ 164.930921] CR2: 00007f2898f8ed50 CR3: 0000000418d59000 CR4: 00000000000006e0
[ 164.938150] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 164.945384] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[ 164.952611] Process pbs_mom (pid: 5509, threadinfo ffff810418cfa000, task fff
f8104185ee7f0)
[ 164.961064] Stack: ffff810418cfbf48 0000000000102173 0000000000102173 ffff81
0419d878f0
[ 164.969448] 00000000ffffffff ffffffff80299613 0000000000000000 00000007fffff
ffe
[ 164.977168] 0000000000000000 ffff8104122b9680 ffff810418df56c0 0000000000000
001
[ 164.984657] Call Trace:
[ 164.987391] [<ffffffff80299
[ 164.992877] [<ffffffff80299
[ 164.998176] [<ffffffff80246
[ 165.003213] [<ffffffff80299
[ 165.008618] [<ffffffff8020c
[ 165.013943]
[ 165.015482]
[ 165.015482] Code: 0f 0b eb fe 0f 1f 80 00 00 00 00 0f 0b eb fe 66 66 66 2e 0f
[ 165.025507] RIP [<ffffffff80298
[ 165.031841] RSP <ffff810418cfbea8>
[ 165.035501] ---[ end trace a9e9eb01a097b525 ]---
Hello,
We are seeing this kernel bug as well:
17.452673] ------------[ cut here ]------------ buildd/ linux-2. 6.24/debian/ build/custom- source- xen/mm/ memory. c:2704! ffffffff80288b5 5>] [<ffffffff80288 b55>] make_pages_ present+ 0xa5/0xc0 f6fe98 EFLAGS: 00010246 0(0000) GS:ffffffff805c 9180(0000) knlGS:000000000 0000000 3a0>] mlock_fixup+ 0x100/0x180 58d>] do_mlock+0xcd/0x110 6ba>] sys_mlock+0xea/0xf0 688>] system_ call+0x68/ 0x6d 620>] system_ call+0x0/ 0x6d b55>] make_pages_ present+ 0xa5/0xc0
[ 17.452877] kernel BUG at /build/
[ 17.453087] invalid opcode: 0000 [1] SMP
[ 17.453605] CPU 3
[ 17.453952] Modules linked in: ppdev ac sbs video output ipv6 battery dock sbshc iptable_filter ip_tables x_tables parport_pc lp parport loop 8250_pnp serio_raw 8250 evdev serial_core psmouse pcspkr container iTCO_wdt iTCO_vendor_support button shpchp pci_hotplug e752x_edac edac_core dm_multipath ext3 jbd mbcache sg sr_mod cdrom ata_piix ata_generic pata_acpi uhci_hcd floppy ehci_hcd libata cciss usbcore scsi_mod tg3 raid10 raid456 async_xor async_memcpy async_tx xor raid1 raid0 multipath linear md_mod dm_mirror dm_snapshot dm_mod thermal processor fan fuse
[ 17.465650] Pid: 5270, comm: xm Not tainted 2.6.24-28-xen #1
[ 17.465807] RIP: e030:[<
[ 17.466119] RSP: e02b:ffff880268
[ 17.466273] RAX: ffff880267641f00 RBX: 00007fffda7c8000 RCX: 0000000000100173
[ 17.466432] RDX: 00000000ffffffff RSI: 00007fffda7c8000 RDI: ffff880267028500
[ 17.466590] RBP: 00007fffda7c8000 R08: ffff880269424c90 R09: 0000000000000000
[ 17.466750] R10: ffff880269424c90 R11: ffff880267676f30 R12: 0000000000102173
[ 17.466909] R13: 0000000000000000 R14: 00007fffda7c8000 R15: 00007fffda7c8000
[ 17.467069] FS: 00007f433580d6e
[ 17.467242] CS: e033 DS: 0000 ES: 0000
[ 17.467394] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 17.467553] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[ 17.467739] Process xm (pid: 5270, threadinfo ffff880268f6e000, task ffff880267713800)
[ 17.467916] Stack: ffff880268f6ff38 ffff880267676f00 ffff880268f6ff38 ffff880267676f00
[ 17.468764] 00000000ffffffff ffffffff802893a0 0000000000000000 00000007fffffff3
[ 17.469474] 0000000000000000 0000000000000007 ffff880267028500 00007fffda7c8000
[ 17.470033] Call Trace:
[ 17.470352] [<ffffffff80289
[ 17.470528] [<ffffffff80289
[ 17.470699] [<ffffffff80289
[ 17.470863] [<ffffffff8020c
[ 17.471029] [<ffffffff8020c
[ 17.471201]
[ 17.471349]
[ 17.471350] Code: 0f 0b eb fe 0f 1f 80 00 00 00 00 0f 0b eb fe 90 90 90 90 90
[ 17.474471] RIP [<ffffffff80288
[ 17.474764] RSP <ffff880268f6fe98>
[ 17.474927] ---[ end trace f04904656fc7840f ]---
[ 17.481450] ------------[ cut here ]-----------