kernel BUG at /build/buildd/linux-2.6.24/mm/memory.c:2667 (2.6.24-28.75)

Bug #622636 reported by ake sandgren
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
New
Undecided
Unassigned

Bug Description

Binary package hint: linux-image-2.6.24-28-server

Since the upgrade to 2.6.24-28.75 a bunch of our cluster nodes are tripping on this kernel BUG when the batch system (pbs_mom from torque) starts up. This has never happened before. I.e. at 2.6.24-28.73 things where working ok.

The process (pbs_mom) is running as root so it might be the real cause, but as this has never happened before...

[ 164.803003] ------------[ cut here ]------------
[ 164.807704] kernel BUG at /build/buildd/linux-2.6.24/mm/memory.c:2667!
[ 164.814343] invalid opcode: 0000 [1] SMP
[ 164.818526] CPU 1
[ 164.820661] Modules linked in: openafs(P) autofs4 ext2 ext3 jbd mbcache ipmi_
devintf ipmi_si ipmi_msghandler ipv6 psmouse iTCO_wdt pcspkr evdev container iTC
O_vendor_support serio_raw shpchp button i5000_edac edac_core pci_hotplug xfs sg
 sd_mod pata_acpi ata_piix ata_generic libata ehci_hcd uhci_hcd scsi_mod usbcore
 e1000 dm_mirror dm_snapshot dm_mod thermal processor fan fbcon tileblit font bi
tblit softcursor
[ 164.859639] Pid: 5509, comm: pbs_mom Tainted: P 2.6.24-28-server #1
[ 164.866610] RIP: 0010:[<ffffffff802989b5>] [<ffffffff802989b5>] make_pages_p
resent+0xa5/0xc0
[ 164.875313] RSP: 0018:ffff810418cfbea8 EFLAGS: 00010246
[ 164.880713] RAX: ffff810419d87dc0 RBX: 00007fffdba2e000 RCX: 0000000004040075
[ 164.887951] RDX: 00000000ffffffff RSI: 00007fffdba2e000 RDI: ffff810418df56c0
[ 164.895171] RBP: 00007fffdba2e000 R08: 00000007fffffffe R09: ffff81041e05b520
[ 164.902404] R10: 0000000000000000 R11: 0000000000000001 R12: 0000000000102173
[ 164.909633] R13: 0000000000000000 R14: 00007fffdba2e000 R15: 00007fffdba2e000
[ 164.916877] FS: 00007f9f4f9d56e0(0000) GS:ffff810421001800(0000) knlGS:00000
00000000000
[ 164.925086] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[ 164.930921] CR2: 00007f2898f8ed50 CR3: 0000000418d59000 CR4: 00000000000006e0
[ 164.938150] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 164.945384] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[ 164.952611] Process pbs_mom (pid: 5509, threadinfo ffff810418cfa000, task fff
f8104185ee7f0)
[ 164.961064] Stack: ffff810418cfbf48 0000000000102173 0000000000102173 ffff81
0419d878f0
[ 164.969448] 00000000ffffffff ffffffff80299613 0000000000000000 00000007fffff
ffe
[ 164.977168] 0000000000000000 ffff8104122b9680 ffff810418df56c0 0000000000000
001
[ 164.984657] Call Trace:
[ 164.987391] [<ffffffff80299613>] mlock_fixup+0x103/0x180
[ 164.992877] [<ffffffff80299719>] do_mlockall+0x89/0xa0
[ 164.998176] [<ffffffff80246649>] __capable+0x9/0x20
[ 165.003213] [<ffffffff80299a45>] sys_mlockall+0x95/0xd0
[ 165.008618] [<ffffffff8020c38e>] system_call+0x7e/0x83
[ 165.013943]
[ 165.015482]
[ 165.015482] Code: 0f 0b eb fe 0f 1f 80 00 00 00 00 0f 0b eb fe 66 66 66 2e 0f

[ 165.025507] RIP [<ffffffff802989b5>] make_pages_present+0xa5/0xc0
[ 165.031841] RSP <ffff810418cfbea8>
[ 165.035501] ---[ end trace a9e9eb01a097b525 ]---

Revision history for this message
Edward Z. Yang (ezyang) wrote :

Hello,

We are seeing this kernel bug as well:

   17.452673] ------------[ cut here ]------------
[ 17.452877] kernel BUG at /build/buildd/linux-2.6.24/debian/build/custom-source-xen/mm/memory.c:2704!
[ 17.453087] invalid opcode: 0000 [1] SMP
[ 17.453605] CPU 3
[ 17.453952] Modules linked in: ppdev ac sbs video output ipv6 battery dock sbshc iptable_filter ip_tables x_tables parport_pc lp parport loop 8250_pnp serio_raw 8250 evdev serial_core psmouse pcspkr container iTCO_wdt iTCO_vendor_support button shpchp pci_hotplug e752x_edac edac_core dm_multipath ext3 jbd mbcache sg sr_mod cdrom ata_piix ata_generic pata_acpi uhci_hcd floppy ehci_hcd libata cciss usbcore scsi_mod tg3 raid10 raid456 async_xor async_memcpy async_tx xor raid1 raid0 multipath linear md_mod dm_mirror dm_snapshot dm_mod thermal processor fan fuse
[ 17.465650] Pid: 5270, comm: xm Not tainted 2.6.24-28-xen #1
[ 17.465807] RIP: e030:[<ffffffff80288b55>] [<ffffffff80288b55>] make_pages_present+0xa5/0xc0
[ 17.466119] RSP: e02b:ffff880268f6fe98 EFLAGS: 00010246
[ 17.466273] RAX: ffff880267641f00 RBX: 00007fffda7c8000 RCX: 0000000000100173
[ 17.466432] RDX: 00000000ffffffff RSI: 00007fffda7c8000 RDI: ffff880267028500
[ 17.466590] RBP: 00007fffda7c8000 R08: ffff880269424c90 R09: 0000000000000000
[ 17.466750] R10: ffff880269424c90 R11: ffff880267676f30 R12: 0000000000102173
[ 17.466909] R13: 0000000000000000 R14: 00007fffda7c8000 R15: 00007fffda7c8000
[ 17.467069] FS: 00007f433580d6e0(0000) GS:ffffffff805c9180(0000) knlGS:0000000000000000
[ 17.467242] CS: e033 DS: 0000 ES: 0000
[ 17.467394] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 17.467553] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[ 17.467739] Process xm (pid: 5270, threadinfo ffff880268f6e000, task ffff880267713800)
[ 17.467916] Stack: ffff880268f6ff38 ffff880267676f00 ffff880268f6ff38 ffff880267676f00
[ 17.468764] 00000000ffffffff ffffffff802893a0 0000000000000000 00000007fffffff3
[ 17.469474] 0000000000000000 0000000000000007 ffff880267028500 00007fffda7c8000
[ 17.470033] Call Trace:
[ 17.470352] [<ffffffff802893a0>] mlock_fixup+0x100/0x180
[ 17.470528] [<ffffffff8028958d>] do_mlock+0xcd/0x110
[ 17.470699] [<ffffffff802896ba>] sys_mlock+0xea/0xf0
[ 17.470863] [<ffffffff8020c688>] system_call+0x68/0x6d
[ 17.471029] [<ffffffff8020c620>] system_call+0x0/0x6d
[ 17.471201]
[ 17.471349]
[ 17.471350] Code: 0f 0b eb fe 0f 1f 80 00 00 00 00 0f 0b eb fe 90 90 90 90 90
[ 17.474471] RIP [<ffffffff80288b55>] make_pages_present+0xa5/0xc0
[ 17.474764] RSP <ffff880268f6fe98>
[ 17.474927] ---[ end trace f04904656fc7840f ]---
[ 17.481450] ------------[ cut here ]-----------

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.