Bug #681083 “Ubuntu Crashes/Freeze on XenMotion” : Bugs : linux package : Ubuntu

Revision history for this message

Luiz Ozaki (luiz-ozaki) wrote on 2010-11-24:

#1

Testing the PPA's kernel, before http://kernel.ubuntu.com/~kernel-ppa/mainline/v2.6.32.16.5-lucid/ it works well, after this release 2.6.32.16.5 the problem appears.

Looking at the changes, this one seems to be related, changing the disk and network, both symptoms that I'm getting:

xen: avoid allocation causing potential swap activity on the resume path

I tried to rollback the patch and the Ubuntu VM Guest gets powered off in the end of the migration.

I'll keep looking.

Revision history for this message

Luiz Ozaki (luiz-ozaki) wrote on 2010-11-25:

#2

Download full text (4.2 KiB)

Updated to the new kernel release 2.6.32-26-generic

[1285568.771462] ------------[ cut here ]------------
[1285568.771473] kernel BUG at /build/buildd/linux-2.6.32/arch/x86/xen/spinlock.c:343!
[1285568.771486] invalid opcode: 0000 [#1] SMP
[1285568.771500] last sysfs file: /sys/power/pm_trace
[1285568.771508] CPU 0
[1285568.771517] Modules linked in: xenfs lp parport xen_netfront xen_blkfront
[1285568.771554] Pid: 41, comm: xenwatch Not tainted 2.6.32-26-generic #47-Ubuntu
[1285568.771567] RIP: e030:[<ffffffff8100fca4>] [<ffffffff8100fca4>] dummy_handler+0x4/0x10
[1285568.771590] RSP: e02b:ffff880003669e88 EFLAGS: 00010046
[1285568.771594] RAX: ffffffffff57b000 RBX: ffff88007fc1b060 RCX: 0000000000000000
[1285568.771600] RDX: 0000000000400200 RSI: 0000000000000000 RDI: 0000000000000001
[1285568.771605] RBP: ffff880003669e88 R08: 0000000000000000 R09: 0000000000000000
[1285568.771612] R10: ffff880003671028 R11: 0000000000012eb0 R12: 0000000000000000
[1285568.771617] R13: 0000000000000000 R14: 0000000000000001 R15: 0000000000000100
[1285568.771626] FS: 00007fae6666c700(0000) GS:ffff880003666000(0000) knlGS:0000000000000000
[1285568.771632] CS: e033 DS: 0000 ES: 0000 CR0: 000000008005003b
[1285568.771637] CR2: 0000000000000000 CR3: 000000007d7f7000 CR4: 0000000000002660
[1285568.771645] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[1285568.771653] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[1285568.771663] Process xenwatch (pid: 41, threadinfo ffff88007d50c000, task ffff88007d4cadc0)
[1285568.771669] Stack:
[1285568.771672] ffff880003669ed8 ffffffff810c4550 0000000000000000 0000000000000000
[1285568.771681] <0> 0000000000000000 ffffffff817b1300 0000000000000001 0000000000000000
[1285568.771690] <0> 0000000000000001 0000000000000100 ffff880003669ef8 ffffffff810c6882
[1285568.771702] Call Trace:
[1285568.771705] <IRQ>
[1285568.771713] [<ffffffff810c4550>] handle_IRQ_event+0x60/0x170
[1285568.771723] [<ffffffff810c6882>] handle_percpu_irq+0x42/0x80
[1285568.771734] [<ffffffff81014d12>] handle_irq+0x22/0x30
[1285568.771743] [<ffffffff8131da99>] xen_evtchn_do_upcall+0x199/0x1c0
[1285568.771749] [<ffffffff8101333e>] xen_do_hypervisor_callback+0x1e/0x30
[1285568.771754] <EOI>
[1285568.771763] [<ffffffff8100922a>] ? hypercall_page+0x22a/0x1010
[1285568.771769] [<ffffffff8100922a>] ? hypercall_page+0x22a/0x1010
[1285568.771776] [<ffffffff8100eb6d>] ? xen_force_evtchn_callback+0xd/0x10
[1285568.771783] [<ffffffff8100f302>] ? check_events+0x12/0x20
[1285568.771791] [<ffffffff8100f2a9>] ? xen_irq_enable_direct_end+0x0/0x7
[1285568.771800] [<ffffffff810578b9>] ? finish_task_switch+0x59/0xe0
[1285568.771808] [<ffffffff815417a8>] ? thread_return+0x48/0x420
[1285568.771815] [<ffffffff8106240a>] ? __cond_resched+0x2a/0x40
[1285568.771823] [<ffffffff81543e5e>] ? _spin_unlock_irqrestore+0x1e/0x30
[1285568.771829] [<ffffffff81541c80>] ? _cond_resched+0x30/0x40
[1285568.771836] [<ffffffff81080726>] ? flush_workqueue+0x36/0x80
[1285568.771843] [<ffffffff810b5b34>] ? __stop_machine+0xf4/0x120
[1285568.771850] [<ffffffff8131eda0>] ? xen_suspend+0x0/0xf0
[1285568.771855] [<ffffffff810b5d8e>] ? s...

Updated to the new kernel release 2.6.32-26-generic

[1285568.771462] ------------[ cut here ]------------
[1285568.771473] kernel BUG at /build/buildd/linux-2.6.32/arch/x86/xen/spinlock.c:343!
[1285568.771486] invalid opcode: 0000 [#1] SMP 
[1285568.771500] last sysfs file: /sys/power/pm_trace
[1285568.771508] CPU 0 
[1285568.771517] Modules linked in: xenfs lp parport xen_netfront xen_blkfront
[1285568.771554] Pid: 41, comm: xenwatch Not tainted 2.6.32-26-generic #47-Ubuntu 
[1285568.771567] RIP: e030:[<ffffffff8100fca4>]  [<ffffffff8100fca4>] dummy_handler+0x4/0x10
[1285568.771590] RSP: e02b:ffff880003669e88  EFLAGS: 00010046
[1285568.771594] RAX: ffffffffff57b000 RBX: ffff88007fc1b060 RCX: 0000000000000000
[1285568.771600] RDX: 0000000000400200 RSI: 0000000000000000 RDI: 0000000000000001
[1285568.771605] RBP: ffff880003669e88 R08: 0000000000000000 R09: 0000000000000000
[1285568.771612] R10: ffff880003671028 R11: 0000000000012eb0 R12: 0000000000000000
[1285568.771617] R13: 0000000000000000 R14: 0000000000000001 R15: 0000000000000100
[1285568.771626] FS:  00007fae6666c700(0000) GS:ffff880003666000(0000) knlGS:0000000000000000
[1285568.771632] CS:  e033 DS: 0000 ES: 0000 CR0: 000000008005003b
[1285568.771637] CR2: 0000000000000000 CR3: 000000007d7f7000 CR4: 0000000000002660
[1285568.771645] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[1285568.771653] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[1285568.771663] Process xenwatch (pid: 41, threadinfo ffff88007d50c000, task ffff88007d4cadc0)
[1285568.771669] Stack:
[1285568.771672]  ffff880003669ed8 ffffffff810c4550 0000000000000000 0000000000000000
[1285568.771681] <0> 0000000000000000 ffffffff817b1300 0000000000000001 0000000000000000
[1285568.771690] <0> 0000000000000001 0000000000000100 ffff880003669ef8 ffffffff810c6882
[1285568.771702] Call Trace:
[1285568.771705]  <IRQ> 
[1285568.771713]  [<ffffffff810c4550>] handle_IRQ_event+0x60/0x170
[1285568.771723]  [<ffffffff810c6882>] handle_percpu_irq+0x42/0x80
[1285568.771734]  [<ffffffff81014d12>] handle_irq+0x22/0x30
[1285568.771743]  [<ffffffff8131da99>] xen_evtchn_do_upcall+0x199/0x1c0
[1285568.771749]  [<ffffffff8101333e>] xen_do_hypervisor_callback+0x1e/0x30
[1285568.771754]  <EOI> 
[1285568.771763]  [<ffffffff8100922a>] ? hypercall_page+0x22a/0x1010
[1285568.771769]  [<ffffffff8100922a>] ? hypercall_page+0x22a/0x1010
[1285568.771776]  [<ffffffff8100eb6d>] ? xen_force_evtchn_callback+0xd/0x10
[1285568.771783]  [<ffffffff8100f302>] ? check_events+0x12/0x20
[1285568.771791]  [<ffffffff8100f2a9>] ? xen_irq_enable_direct_end+0x0/0x7
[1285568.771800]  [<ffffffff810578b9>] ? finish_task_switch+0x59/0xe0
[1285568.771808]  [<ffffffff815417a8>] ? thread_return+0x48/0x420
[1285568.771815]  [<ffffffff8106240a>] ? __cond_resched+0x2a/0x40
[1285568.771823]  [<ffffffff81543e5e>] ? _spin_unlock_irqrestore+0x1e/0x30
[1285568.771829]  [<ffffffff81541c80>] ? _cond_resched+0x30/0x40
[1285568.771836]  [<ffffffff81080726>] ? flush_workqueue+0x36/0x80
[1285568.771843]  [<ffffffff810b5b34>] ? __stop_machine+0xf4/0x120
[1285568.771850]  [<ffffffff8131eda0>] ? xen_suspend+0x0/0xf0
[1285568.771855]  [<ffffffff810b5d8e>] ? stop_machine+0x3e/0x60
[1285568.771862]  [<ffffffff8131eb9a>] ? do_suspend+0x7a/0x120
[1285568.771868]  [<ffffffff8131ed49>] ? shutdown_handler+0x109/0x160
[1285568.771874]  [<ffffffff813200c2>] ? xenwatch_thread+0xc2/0x190
[1285568.771882]  [<ffffffff81084570>] ? autoremove_wake_function+0x0/0x40
[1285568.771888]  [<ffffffff81320000>] ? xenwatch_thread+0x0/0x190
[1285568.771894]  [<ffffffff810841f6>] ? kthread+0x96/0xa0
[1285568.771900]  [<ffffffff810131ea>] ? child_rip+0xa/0x20
[1285568.771906]  [<ffffffff810123d1>] ? int_ret_from_sys_call+0x7/0x1b
[1285568.771912]  [<ffffffff81012b5d>] ? retint_restore_args+0x5/0x6
[1285568.771918]  [<ffffffff810131e0>] ? child_rip+0x0/0x20
[1285568.771923] Code: 89 e5 c9 0f 95 c0 c3 55 b8 01 00 00 00 86 07 84 c0 48 89 e5 0f 94 c0 c9 0f b6 c0 c3 66 66 2e 0f 1f 84 00 00 00 00 00 55 48 89 e5 <0f> 0b eb fe 0f 1f 84 00 00 00 00 00 55 ba ff ff ff ff 48 89 e5 
[1285568.771994] RIP  [<ffffffff8100fca4>] dummy_handler+0x4/0x10
[1285568.772001]  RSP <ffff880003669e88>

I`m trying apport and all the kernel debug stuffs in Wiki, got a better stack trace.

Revision history for this message

Luiz Ozaki (luiz-ozaki) wrote on 2010-11-25:

#3

Using 2.6.32-26-server, I think is the same problem, but got in swapper process, anyways here goes:

[1286941.792445] ------------[ cut here ]------------
[1286941.792454] kernel BUG at /build/buildd/linux-2.6.32/arch/x86/xen/spinlock.c:343!
[1286941.792462] invalid opcode: 0000 [#1] SMP
[1286941.792469] last sysfs file: /sys/power/pm_trace
[1286941.792473] CPU 1
[1286941.792477] Modules linked in: xenfs lp xen_netfront parport xen_blkfront
[1286941.792490] Pid: 0, comm: swapper Not tainted 2.6.32-26-server #47-Ubuntu
[1286941.792495] RIP: e030:[<ffffffff8100fca4>] [<ffffffff8100fca4>] dummy_handler+0x4/0x10
[1286941.792507] RSP: e02b:ffff880003680e88 EFLAGS: 00010046
[1286941.792511] RAX: ffffffffff57b000 RBX: ffff88007fc1b2a0 RCX: 0000000000000000
[1286941.792520] RDX: 0000000000400200 RSI: 0000000000000000 RDI: 0000000000000007
[1286941.792527] RBP: ffff880003680e88 R08: 0000000000000000 R09: 0000000000000000
[1286941.792535] R10: ffff880003688028 R11: 0000000000012eb0 R12: 0000000000000000
[1286941.792541] R13: 0000000000000000 R14: 0000000000000007 R15: 0000000000000100
[1286941.792550] FS: 00007f639b1fa700(0000) GS:ffff88000367d000(0000) knlGS:0000000000000000
[1286941.792557] CS: e033 DS: 002b ES: 002b CR0: 000000008005003b
[1286941.792562] CR2: 0000000000000000 CR3: 0000000001001000 CR4: 0000000000002660
[1286941.792568] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[1286941.792574] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[1286941.792580] Process swapper (pid: 0, threadinfo ffff88007dfc8000, task ffff88007dfb2dc0)
[1286941.792585] Stack:
[1286941.792588] ffff880003680ed8 ffffffff810c4060 ffff880003680ec8 ffffffff8108e553
[1286941.792597] <0> ffff88000368d400 ffffffff817cb780 0000000000000007 0000000000000200
[1286941.792607] <0> 0000000000000001 0000000000000100 ffff880003680ef8 ffffffff810c6392
[1286941.792618] Call Trace:
[1286941.792622] <IRQ>
[1286941.792628] [<ffffffff810c4060>] handle_IRQ_event+0x60/0x170
[1286941.792637] [<ffffffff8108e553>] ? ktime_get+0x63/0xe0
[1286941.792643] [<ffffffff810c6392>] handle_percpu_irq+0x42/0x80
[1286941.792650] [<ffffffff81014d12>] handle_irq+0x22/0x30
[1286941.792658] [<ffffffff8131f189>] xen_evtchn_do_upcall+0x199/0x1c0
[1286941.792665] [<ffffffff8101333e>] xen_do_hypervisor_callback+0x1e/0x30
[1286941.792669] <EOI>
[1286941.792675] [<ffffffff810093aa>] ? hypercall_page+0x3aa/0x1010
[1286941.792681] [<ffffffff810093aa>] ? hypercall_page+0x3aa/0x1010
[1286941.792688] [<ffffffff8100ebd0>] ? xen_safe_halt+0x10/0x20
[1286941.792693] [<ffffffff8100c285>] ? xen_idle+0x35/0x50
[1286941.792699] [<ffffffff81010e63>] ? cpu_idle+0xb3/0x110
[1286941.792704] [<ffffffff8100f2a9>] ? xen_irq_enable_direct_end+0x0/0x7
[1286941.792711] [<ffffffff8154d2e5>] ? cpu_bringup_and_idle+0x13/0x15
[1286941.792716] Code: 89 e5 c9 0f 95 c0 c3 55 b8 01 00 00 00 86 07 84 c0 48 89 e5 0f 94 c0 c9 0f b6 c0 c3 66 66 2e 0f 1f 84 00 00 00 00 00 55 48 89 e5 <0f> 0b eb fe 0f 1f 84 00 00 00 00 00 55 ba ff ff ff ff 48 89 e5
[1286941.792788] RIP [<ffffffff8100fca4>] dummy_handler+0x4/0x10
[1286941.792794] RSP <ffff880003680e88>

Using 2.6.32-26-server, I think is the same problem, but got in swapper process, anyways here goes:

[1286941.792445] ------------[ cut here ]------------
[1286941.792454] kernel BUG at /build/buildd/linux-2.6.32/arch/x86/xen/spinlock.c:343!
[1286941.792462] invalid opcode: 0000 [#1] SMP 
[1286941.792469] last sysfs file: /sys/power/pm_trace
[1286941.792473] CPU 1 
[1286941.792477] Modules linked in: xenfs lp xen_netfront parport xen_blkfront
[1286941.792490] Pid: 0, comm: swapper Not tainted 2.6.32-26-server #47-Ubuntu 
[1286941.792495] RIP: e030:[<ffffffff8100fca4>]  [<ffffffff8100fca4>] dummy_handler+0x4/0x10
[1286941.792507] RSP: e02b:ffff880003680e88  EFLAGS: 00010046
[1286941.792511] RAX: ffffffffff57b000 RBX: ffff88007fc1b2a0 RCX: 0000000000000000
[1286941.792520] RDX: 0000000000400200 RSI: 0000000000000000 RDI: 0000000000000007
[1286941.792527] RBP: ffff880003680e88 R08: 0000000000000000 R09: 0000000000000000
[1286941.792535] R10: ffff880003688028 R11: 0000000000012eb0 R12: 0000000000000000
[1286941.792541] R13: 0000000000000000 R14: 0000000000000007 R15: 0000000000000100
[1286941.792550] FS:  00007f639b1fa700(0000) GS:ffff88000367d000(0000) knlGS:0000000000000000
[1286941.792557] CS:  e033 DS: 002b ES: 002b CR0: 000000008005003b
[1286941.792562] CR2: 0000000000000000 CR3: 0000000001001000 CR4: 0000000000002660
[1286941.792568] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[1286941.792574] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[1286941.792580] Process swapper (pid: 0, threadinfo ffff88007dfc8000, task ffff88007dfb2dc0)
[1286941.792585] Stack:
[1286941.792588]  ffff880003680ed8 ffffffff810c4060 ffff880003680ec8 ffffffff8108e553
[1286941.792597] <0> ffff88000368d400 ffffffff817cb780 0000000000000007 0000000000000200
[1286941.792607] <0> 0000000000000001 0000000000000100 ffff880003680ef8 ffffffff810c6392
[1286941.792618] Call Trace:
[1286941.792622]  <IRQ> 
[1286941.792628]  [<ffffffff810c4060>] handle_IRQ_event+0x60/0x170
[1286941.792637]  [<ffffffff8108e553>] ? ktime_get+0x63/0xe0
[1286941.792643]  [<ffffffff810c6392>] handle_percpu_irq+0x42/0x80
[1286941.792650]  [<ffffffff81014d12>] handle_irq+0x22/0x30
[1286941.792658]  [<ffffffff8131f189>] xen_evtchn_do_upcall+0x199/0x1c0
[1286941.792665]  [<ffffffff8101333e>] xen_do_hypervisor_callback+0x1e/0x30
[1286941.792669]  <EOI> 
[1286941.792675]  [<ffffffff810093aa>] ? hypercall_page+0x3aa/0x1010
[1286941.792681]  [<ffffffff810093aa>] ? hypercall_page+0x3aa/0x1010
[1286941.792688]  [<ffffffff8100ebd0>] ? xen_safe_halt+0x10/0x20
[1286941.792693]  [<ffffffff8100c285>] ? xen_idle+0x35/0x50
[1286941.792699]  [<ffffffff81010e63>] ? cpu_idle+0xb3/0x110
[1286941.792704]  [<ffffffff8100f2a9>] ? xen_irq_enable_direct_end+0x0/0x7
[1286941.792711]  [<ffffffff8154d2e5>] ? cpu_bringup_and_idle+0x13/0x15
[1286941.792716] Code: 89 e5 c9 0f 95 c0 c3 55 b8 01 00 00 00 86 07 84 c0 48 89 e5 0f 94 c0 c9 0f b6 c0 c3 66 66 2e 0f 1f 84 00 00 00 00 00 55 48 89 e5 <0f> 0b eb fe 0f 1f 84 00 00 00 00 00 55 ba ff ff ff ff 48 89 e5 
[1286941.792788] RIP  [<ffffffff8100fca4>] dummy_handler+0x4/0x10
[1286941.792794]  RSP <ffff880003680e88>

Revision history for this message

Jeremy Foshee (jeremyfoshee) wrote on 2010-12-01:

#4

Hi Luiz,

Please be sure to confirm this issue exists with the latest development release of Ubuntu. ISO CD images are available from http://cdimage.ubuntu.com/daily/current/ . If the issue remains, please run the following command from a Terminal (Applications->Accessories->Terminal). It will automatically gather and attach updated debug information to this report.

apport-collect -p linux 681083

Also, if you could test the latest upstream kernel available that would be great. It will allow additional upstream developers to examine the issue. Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Once you've tested the upstream kernel, please remove the 'needs-upstream-testing' tag. This can be done by clicking on the yellow pencil icon next to the tag located at the bottom of the bug description and deleting the 'needs-upstream-testing' text. Please let us know your results.

Thanks in advance.

[This is an automated message. Apologies if it has reached you inappropriately; please just reply to this message indicating so.]

tags:	added: needs-kernel-logs
tags:	added: needs-upstream-testing
tags:	added: kj-triage
Changed in linux (Ubuntu):
status:	New → Incomplete

Revision history for this message

Luiz Ozaki (luiz-ozaki) wrote on 2010-12-01:

#5

Testing the PPA's kernel, before http://kernel.ubuntu.com/~kernel-ppa/mainline/v2.6.32.16.5-lucid/ it works well, after this release 2.6.32.16.5 the problem appears.

tags:

removed: needs-upstream-testing

Revision history for this message

Luiz Ozaki (luiz-ozaki) wrote on 2010-12-02:

#6

Testing the latest version (natty) I get:
xenfs: not registering filesystem on non-xen platform

So I cant do a migration between hosts to test this.

Revision history for this message

Luiz Ozaki (luiz-ozaki) wrote on 2010-12-16:

#7

Problem still happening in the XenServer 5.6 FP1 Beta

tags:

removed: kj-triage needs-kernel-logs

Revision history for this message

Muriel (tudamp) wrote on 2011-01-05:

#8

Hi all,
this is my experience with this bug.
Ubuntu server 10.04, x86_64, kernel from standard repo:
2.6.32.27 doesn't work
2.6.32.26 doesn't work
2.6.32.25 work unstable ( sometimes yes, often not)
2.6.32.24 and earlier works.

The problem is also present in mainline kernels:
2.6.32-0206321505 work
2.6.32-0206321606 work unstable
2.6.32-0206321709 work unstable
2.6.32-0206322210 doesn't boot
2.6.32-0206322310 doesn't boot
2.6.32-0206322411 doesn't work
2.6.32-0206322511 doesn't work
2.6.32-0206322611 doesn't work
2.6.32-0206322712 doesn't work

You need more logs to confirm the bug?

thanks for your work

Revision history for this message

Muriel (tudamp) wrote on 2011-01-18:

#9

events.c.patch Edit (1.1 KiB, text/plain)

This patch works for me, is derived from this thread http://xen.1045712.n5.nabble.com/PATCH-xen-events-do-not-unmask-polled-ipis-on-restore-td3241695.html

Andy Whitcroft (apw) on 2011-01-19

Changed in linux (Ubuntu):
status:	Incomplete → Triaged
tags:	added: regression-proposed

Revision history for this message

Stefan Bader (smb) wrote on 2011-01-19:

#10

For Natty the fix mentioned is included with 2.6.37 final:

commit 6903591f314b8947d0e362bda7715e90eb9df75e
Author: Ian Campbell <email address hidden>
Date: Mon Nov 1 16:30:09 2010 +0000

Now it needs to be backported to our ec2 topic branch for Lucid.
xen: events: do not unmask event channels on resume

Changed in linux (Ubuntu Lucid):
assignee:	nobody → Stefan Bader (stefan-bader-canonical)
importance:	Undecided → Medium
status:	New → Triaged
Changed in linux (Ubuntu):
importance:	Undecided → Medium
status:	Triaged → Fix Released

Revision history for this message

Luiz Ozaki (luiz-ozaki) wrote on 2011-01-19:

#11

Hi Muriel,

Which processor are you using ?

It seems that the kernel doesnt crash, but still freezes at least for me.

Unable to get console output or IO problem causing the process hang warning.

But the wierd thing is that when I migrate to hostA -> hostB = console freeze.
HostB -> hostA = IO freeze but console still works and then after any IO request, it freezes turning into process hang.
These happens always.

I wonder, Why dont I get the HostB -> HostA AND HostA-> HostB the same problem....

Gonna look into the Xen mailling as well, maybe I`m getting a different error or something now.

Revision history for this message

Stefan Bader (smb) wrote on 2011-01-19:

#12

SRU Justification:

Impact: With the current ec2 kernels the kernel oops described in comment #3 is experienced as a result of enabling interrupts on the pv spinlock event channel.

Fix: The following patch is taken from upstream and is included in 2.6.37. It has been reported to successfully prevent the oops.

Testcase: Migration of a guest (using suspend)

Changed in linux (Ubuntu Lucid):
status:	Triaged → In Progress

Revision history for this message

Muriel (tudamp) wrote on 2011-01-20:

#13

Hi Luiz,
I have two different pools with two different processors
1) amd
when i migrate A -> B and B is the master of the pool the console (and the vm) freeze;
when i migrate B -> A and B is the master all works fine (with the patch);
when i migrate B -> C and all are members all works fine (with the patch;)

2) intel
all works fine with the patch

Ian Campbell says it could be a problem on my master: You have had the same problem with the same processor?
------------

Stefan,
the patch that i proposed is the Ian's patch changed for the ubuntu kernel 2.6.32-27.49. The only differences are in the row on which make the change.

Stefan Bader (smb) on 2011-01-21

Changed in linux (Ubuntu Lucid):
status:	In Progress → Fix Committed

Steve Conklin (sconklin) on 2011-02-04

tags:

added: verification-needed-lucid

Andy Whitcroft (apw) on 2011-02-11

tags:

added: regression-update
removed: regression-proposed

Revision history for this message

Bjorn Meijer (bm-realtimesolutions) wrote on 2011-03-15:

#14

Is there any news on this?

Revision history for this message

Stefan Bader (smb) wrote on 2011-03-15:

#15

Apparently we got hit by confusion. Usually Xen in Lucid means the ec2 topic branch. But in this case this is the generic kernel. Actually the patch has no effect when applied to the ec2 topic branch as the file does exist but is completely ignored in that build.
So at the moment it was only applied to the topic branch without effect and we need to pull it out to the master branch.

Revision history for this message

Muriel (tudamp) wrote on 2011-03-29:

#16

No news?

Revision history for this message

Guilherme M. Schroeder (lero420) wrote on 2011-03-29:

#17

I've done a test with Debian 6.0 and it works fine (no crash/lost network connection) on vmotion with XenServer 5.6 FP1.
Debian included this patch in their last kernel:

[ Ian Campbell ]
* xen: blkback: fix potential leak of kernel thread. (CVE-2010-3699)

Their prior last kernel linux-2.6 (2.6.32-30) didn't work.
Their actual kernel linux-2.6 (2.6.32-31), that includes this patch, works.

I've tested this Debian kernel with Ubuntu 10.04 and it works fine (no crash/lost network connection) on vmotion.

Can this patch be applied to the Ubuntu 10.04 kernel?

description:

updated

Revision history for this message

Fábio Gomes dos Santos (supergrilo) wrote on 2011-04-01:

#18

I have the same problem. Anyone got the solution?

Revision history for this message

Bjorn Meijer (bm-realtimesolutions) wrote on 2011-04-04:

#19

There is no solution yet I think, maybe in the next kernel release.

Although I have tried this kernel which does allow xen motion(like posted above):
http://kernel.ubuntu.com/~kernel-ppa/mainline/v2.6.33.5-lucid/linux-image-2.6.33-02063305-generic_2.6.33-02063305_amd64.deb
It works well although it probably has a few issues with certain applications.

Revision history for this message

Davim (davim) wrote on 2011-04-21:

#20

I have no problem moving VMs around but if I suspend one Ubuntu VM it freezes on resume...

Revision history for this message

Martin Pitt (pitti) wrote on 2011-04-25: Please test proposed package

#21

Accepted linux into lucid-proposed, the package will build now and be available in a few hours. Please test and give feedback here. See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you in advance!

Revision history for this message

jnoer (jnoer) wrote on 2011-04-26:

#22

Installed linux-image-2.6.32-32-server and migrated slave-master-slave and I didn't see any problems. Seem to work fine now.

Revision history for this message

Luiz Ozaki (luiz-ozaki) wrote on 2011-04-26:

#23

2.6.32-32-server #62-Ubuntu SMP Wed Apr 20 22:07:43 UTC 2011 x86_64 GNU/Linux

Working fine !!! Nice.

Gonna try more test, but simple migrations seems to work fine.

Tks

Martin Pitt (pitti) on 2011-04-26

tags:

added: verification-done-lucid
removed: verification-needed-lucid

Revision history for this message

Alain St-Denis (alain-st-denis) wrote on 2011-05-05:

#24

I installed 2.6.32-32-server on our lucid guests running under xenserver 5.6 fp1 and the migration now works randomly. So there is some progress because before that it wasn't working at all. When it doesn't work, the same xenwatch backtrace is shown. SLES 11 SP1 guests migrate flawlessly...

Revision history for this message

Davim (davim) wrote on 2011-05-08:

#25

Same problem here, most of the times it works but some times it crashes...
I would say it's crashing one out of 5 migrations...

Revision history for this message

Luiz Ozaki (luiz-ozaki) wrote on 2011-05-08:

#26

Yea, seems that it continuing but sometimes it works.

First it seems that were crashing only when I migrate to the master.

Then it seems more random BUT for example. If I migrate to the destiny and that migration failed, all migrations for that same destiny will fail.

Like in some of my pools I have 1 "bugged" host, I can migrate from him and to other hosts, but if I try to migrate to that "bugged" host it will fail.

Revision history for this message

Luiz Ozaki (luiz-ozaki) wrote on 2011-05-13:

#27

I dont know if its the same bugs or another bug here its the stack trace on the ocasional failures:

[6368472.738379] Call Trace:
[6368472.738389] [<ffffffff81041abe>] ? pick_next_task_fair+0xca/0xd6
[6368472.738395] [<ffffffff812fae40>] ? thread_return+0x79/0xe0
[6368472.738401] [<ffffffff8100e160>] ? xen_vcpuop_set_next_event+0x0/0x60
[6368472.738405] [<ffffffff812fb1fd>] ? schedule_timeout+0x2e/0xdd
[6368472.738408] [<ffffffff8100e22f>] ? xen_restore_fl_direct_end+0x0/0x1
[6368472.738412] [<ffffffff812fc1da>] ? _spin_unlock_irqrestore+0xd/0xe
[6368472.738418] [<ffffffff810ad168>] ? cpupri_set+0x10c/0x135
[6368472.738425] [<ffffffff8121cc16>] ? serial8250_resume+0x0/0x3a
[6368472.738428] [<ffffffff812fb0b4>] ? wait_for_common+0xde/0x15b
[6368472.738433] [<ffffffff8104a42f>] ? default_wake_function+0x0/0x9
[6368472.738439] [<ffffffff81061b0c>] ? flush_cpu_workqueue+0x5e/0x75
[6368472.738442] [<ffffffff81064c2e>] ? kthread_stop+0x5d/0xa2
[6368472.738446] [<ffffffff81061b67>] ? cleanup_workqueue_thread+0x44/0x51
[6368472.738449] [<ffffffff81061c0e>] ? destroy_workqueue+0x76/0xad
[6368472.738454] [<ffffffff8108ad7f>] ? stop_machine_destroy+0x2e/0x47
[6368472.738458] [<ffffffff811efd2d>] ? shutdown_handler+0x230/0x25c
[6368472.738462] [<ffffffff812fb776>] ? mutex_lock+0xd/0x31
[6368472.738465] [<ffffffff811f1038>] ? xenwatch_thread+0x117/0x14a
[6368472.738469] [<ffffffff81064e96>] ? autoremove_wake_function+0x0/0x2e
[6368472.738472] [<ffffffff811f0f21>] ? xenwatch_thread+0x0/0x14a
[6368472.738474] [<ffffffff81064bc9>] ? kthread+0x79/0x81
[6368472.738479] [<ffffffff81011baa>] ? child_rip+0xa/0x20
[6368472.738482] [<ffffffff81010d61>] ? int_ret_from_sys_call+0x7/0x1b
[6368472.738485] [<ffffffff8101151d>] ? retint_restore_args+0x5/0x6
[6368472.738489] [<ffffffff8100e22f>] ? xen_restore_fl_direct_end+0x0/0x1
[6368472.738492] [<ffffffff81011ba0>] ? child_rip+0x0/0x20

But after some time it gets back to normal.

Revision history for this message

Luiz Ozaki (luiz-ozaki) wrote on 2011-05-13:

#28

Ummm... Disregard that stack trace... I was using the Debian 6... =/

Sorry.

Revision history for this message

Davim (davim) wrote on 2011-05-13:

#29

What do you mean by "bugged" host?
Hat is the problem with that host?
Reinstalling the host would solve the problem?
I have a pool with 10 hosts and all of my linux VMs (about 20VMs) are running the same ubuntu kernel (2.6.32-31-generic-pae) and some of them always crash on migration, some never crashed and some only crashed a few times...

I think that the VMs with most memory usage are the ones that crash more often.

Revision history for this message

Davim (davim) wrote on 2011-05-18:

#30

I've just tested the 2.6.35-25-generic-pae kernel on one of my VMs that always crash on xenmotion and the problem is the same :(

Does anyone have a solution for this???

Revision history for this message

Stefan Bader (smb) wrote on 2011-05-18:

#31

Seems the patch we added for 2.6.32 (Lucid) is still missing for 2.6.35 (Maverick). I will get it SRUed there as well.

Changed in linux (Ubuntu Maverick):
assignee:	nobody → Stefan Bader (stefan-bader-canonical)
importance:	Undecided → Medium
status:	New → In Progress

Revision history for this message

Davim (davim) wrote on 2011-05-18:

#32

But the 2.6.32 is not working either...
Before testing the 2.6.35 I was on the 2.6.32-31-generic-pae.

Revision history for this message

Davim (davim) wrote on 2011-05-18:

#33

Is there any other information I can add to help determine the cause/solution of this problem?

Revision history for this message

Stefan Bader (smb) wrote on 2011-05-18:

#34

It seems that there are probably two problems involved. On .32 some people were seeing improvement. But others still see a crash that seemed to look a bit different. So at least having that fix in .35 as well sounds reasonable. And maybe there no other problem exists and the remaining other problem only is in .32.

Revision history for this message

Koszta, Tamas (tamas-koszta) wrote on 2011-05-19:

#35

There is an article in Citrix knowledge center about this problem. Their workaround is to install natty's backported kernel, which is based on 2.6.38.
I've tested it on 5.6FP1 with 64bit Lucid guest, and works fine, every migration completed successfully.
However it would be nice if the official lucid kernel could do the same.

Revision history for this message

Koszta, Tamas (tamas-koszta) wrote on 2011-05-19:

#36

the Citrix link: http://support.citrix.com/article/CTX129472

Revision history for this message

Stefan Bader (smb) wrote on 2011-05-19:

#37

Davim, I placed some 2.6.35 kernels that include the proposed patch at http://people.canonical.com/~smb/lp681083/. If you could try one of those and let me know the result. So we can decide whether this is an issue with some setups over all releases or this is a secondary issue only with Lucid. Thanks.

Revision history for this message

Davim (davim) wrote on 2011-05-26:

#38

Thanks Stefan I haven't been able to test those kernels yet, I intend to test them until the end of this week...

I've noticed that the problem does not only occur on xenmotion but also on suspend/resume.

I will get back to you as soon as I have the chance to test those kernels.

Revision history for this message

Davim (davim) wrote on 2011-05-26:

#39

Success!!!

I've just tested the kernels provided by Stefan and they solve the problem :)

The test I made was:

* Installed a new Ubuntu 10.04.2 VM (net install) with two vCPUs
* Tried to migrate the VM to another Xenserver and confirmed it crashed
* Downloaded and installed the generic-pae kernels provided by Stefan and rebooted the VM into the new kernel.
* Migrated the VM around several hosts, including the master, and the VM never crashed.

Now the question is, when will we see this kernels as official for the Ubuntu 10.04.2 LTS ??

Thanks Stefan.

Revision history for this message

Bjorn Meijer (bm-realtimesolutions) wrote on 2011-05-27:

#40

Works for me.

I've migrated a test VM from and to the pool master a couple of times.
Everything still seems to work like it should.

Hoping to see this in the official kernel soon.

Revision history for this message

Stefan Bader (smb) wrote on 2011-05-27:

#41

Well, the kernels supplied are 2.6.35 ones (10.10 Maverick). And the patch is queued for the next update (after a currently pending one). The same patch is in the currently pending 2.6.32 (10.04 Lucid) kernel too. Just the feeling there from some feedback is that there might still be some (other) issue. Both updates are not yet in the normal place (for Lucid you would need to enable proposed to get it) but if there is still a problem with the proposed version of Lucid this will continue to be one because that then needs something more. But I would wait for this patch to get out on both, then check the fallout and probably work on the follow up in a new bug report.

Revision history for this message

Launchpad Janitor (janitor) wrote on 2011-05-30:

#42

Download full text (12.8 KiB)

This bug was fixed in the package linux - 2.6.32-32.62

---------------
linux (2.6.32-32.62) lucid-proposed; urgency=low

[ Brad Figg ]

* Release Tracking Bug
- LP: #767370

[ Stefan Bader ]

* (config) Disable CONFIG_NET_NS
- LP: #720095

[ Upstream Kernel Changes ]

  * Revert "drm/radeon/kms: Fix retrying ttm_bo_init() after it failed
    once."
    - LP: #736234
  * Revert "drm/radeon: fall back to GTT if bo creation/validation in VRAM
    fails."
    - LP: #736234
  * x86: pvclock: Move scale_delta into common header
  * KVM: x86: Fix a possible backwards warp of kvmclock
  * KVM: x86: Fix kvmclock bug
  * cpuset: add a missing unlock in cpuset_write_resmask()
    - LP: #736234
  * keyboard: integer underflow bug
    - LP: #736234
  * RxRPC: Fix v1 keys
    - LP: #736234
  * ixgbe: fix for 82599 erratum on Header Splitting
    - LP: #736234
  * mm: fix possible cause of a page_mapped BUG
    - LP: #736234
  * powerpc/kdump: CPUs assume the context of the oopsing CPU
    - LP: #736234
  * powerpc/kdump: Use chip->shutdown to disable IRQs
    - LP: #736234
  * powerpc: Use more accurate limit for first segment memory allocations
    - LP: #736234
  * powerpc/pseries: Add hcall to read 4 ptes at a time in real mode
    - LP: #736234
  * powerpc/kexec: Speedup kexec hash PTE tear down
    - LP: #736234
  * powerpc/crashdump: Do not fail on NULL pointer dereferencing
    - LP: #736234
  * powerpc/kexec: Fix orphaned offline CPUs across kexec
    - LP: #736234
  * netfilter: nf_log: avoid oops in (un)bind with invalid nfproto values
    - LP: #736234
  * nfsd: wrong index used in inner loop
    - LP: #736234
  * r8169: use RxFIFO overflow workaround for 8168c chipset.
    - LP: #736234
  * Staging: comedi: jr3_pci: Don't ioremap too much space. Check result.
    - LP: #736234
  * net: don't allow CAP_NET_ADMIN to load non-netdev kernel modules,
    CVE-2011-1019
    - LP: #736234
    - CVE-2011-1019
  * ip6ip6: autoload ip6 tunnel
    - LP: #736234
  * Linux 2.6.32.33
    - LP: #736234
  * drm/radeon: fall back to GTT if bo creation/validation in VRAM fails.
    - LP: #652934, #736234
  * drm/radeon/kms: Fix retrying ttm_bo_init() after it failed once.
    - LP: #652934, #736234
  * drm: fix unsigned vs signed comparison issue in modeset ctl ioctl,
    CVE-2011-1013
    - LP: #736234
    - CVE-2011-1013
  * Linux 2.6.32.33+drm33.15
    - LP: #736234
  * econet: Fix crash in aun_incoming(). CVE-2010-4342
    - LP: #736394
    - CVE-2010-4342
  * igb: only use vlan_gro_receive if vlans are registered, CVE-2010-4263
    - LP: #737024
    - CVE-2010-4263
  * irda: prevent integer underflow in IRLMP_ENUMDEVICES, CVE-2010-4529
    - LP: #737823
    - CVE-2010-4529
  * hwmon/f71882fg: Set platform drvdata to NULL later
    - LP: #742056
  * mtd: add "platform:" prefix for platform modalias
    - LP: #742056
  * libata: no special completion processing for EH commands
    - LP: #742056
  * MIPS: MTX-1: Make au1000_eth probe all PHY addresses
    - LP: #742056
  * x86/mm: Handle mm_fault_error() in kernel space
    - LP: #742056
  * ftrace: Fix memory leak with function graph and cpu hotplug
    - LP: #742056
  * x86: Fix panic when ...

This bug was fixed in the package linux - 2.6.32-32.62

---------------
linux (2.6.32-32.62) lucid-proposed; urgency=low

[ Brad Figg ]

* Release Tracking Bug
    - LP: #767370

[ Stefan Bader ]

* (config) Disable CONFIG_NET_NS
    - LP: #720095

[ Upstream Kernel Changes ]

* Revert "drm/radeon/kms: Fix retrying ttm_bo_init() after it failed
    once."
    - LP: #736234
  * Revert "drm/radeon: fall back to GTT if bo creation/validation in VRAM
    fails."
    - LP: #736234
  * x86: pvclock: Move scale_delta into common header
  * KVM: x86: Fix a possible backwards warp of kvmclock
  * KVM: x86: Fix kvmclock bug
  * cpuset: add a missing unlock in cpuset_write_resmask()
    - LP: #736234
  * keyboard: integer underflow bug
    - LP: #736234
  * RxRPC: Fix v1 keys
    - LP: #736234
  * ixgbe: fix for 82599 erratum on Header Splitting
    - LP: #736234
  * mm: fix possible cause of a page_mapped BUG
    - LP: #736234
  * powerpc/kdump: CPUs assume the context of the oopsing CPU
    - LP: #736234
  * powerpc/kdump: Use chip->shutdown to disable IRQs
    - LP: #736234
  * powerpc: Use more accurate limit for first segment memory allocations
    - LP: #736234
  * powerpc/pseries: Add hcall to read 4 ptes at a time in real mode
    - LP: #736234
  * powerpc/kexec: Speedup kexec hash PTE tear down
    - LP: #736234
  * powerpc/crashdump: Do not fail on NULL pointer dereferencing
    - LP: #736234
  * powerpc/kexec: Fix orphaned offline CPUs across kexec
    - LP: #736234
  * netfilter: nf_log: avoid oops in (un)bind with invalid nfproto values
    - LP: #736234
  * nfsd: wrong index used in inner loop
    - LP: #736234
  * r8169: use RxFIFO overflow workaround for 8168c chipset.
    - LP: #736234
  * Staging: comedi: jr3_pci: Don't ioremap too much space. Check result.
    - LP: #736234
  * net: don't allow CAP_NET_ADMIN to load non-netdev kernel modules,
    CVE-2011-1019
    - LP: #736234
    - CVE-2011-1019
  * ip6ip6: autoload ip6 tunnel
    - LP: #736234
  * Linux 2.6.32.33
    - LP: #736234
  * drm/radeon: fall back to GTT if bo creation/validation in VRAM fails.
    - LP: #652934, #736234
  * drm/radeon/kms: Fix retrying ttm_bo_init() after it failed once.
    - LP: #652934, #736234
  * drm: fix unsigned vs signed comparison issue in modeset ctl ioctl,
    CVE-2011-1013
    - LP: #736234
    - CVE-2011-1013
  * Linux 2.6.32.33+drm33.15
    - LP: #736234
  * econet: Fix crash in aun_incoming(). CVE-2010-4342
    - LP: #736394
    - CVE-2010-4342
  * igb: only use vlan_gro_receive if vlans are registered, CVE-2010-4263
    - LP: #737024
    - CVE-2010-4263
  * irda: prevent integer underflow in IRLMP_ENUMDEVICES, CVE-2010-4529
    - LP: #737823
    - CVE-2010-4529
  * hwmon/f71882fg: Set platform drvdata to NULL later
    - LP: #742056
  * mtd: add "platform:" prefix for platform modalias
    - LP: #742056
  * libata: no special completion processing for EH commands
    - LP: #742056
  * MIPS: MTX-1: Make au1000_eth probe all PHY addresses
    - LP: #742056
  * x86/mm: Handle mm_fault_error() in kernel space
    - LP: #742056
  * ftrace: Fix memory leak with function graph and cpu hotplug
    - LP: #742056
  * x86: Fix panic when handling "mem={invalid}" param
    - LP: #553464, #742056
  * x86: Emit "mem=nopentium ignored" warning when not supported
    - LP: #553464, #742056
  * ahci: AHCI and RAID mode SATA patch for Intel Patsburg DeviceIDs
    - LP: #742056
  * ahci: AHCI mode SATA patch for Intel DH89xxCC DeviceIDs
    - LP: #742056
  * ahci: AHCI mode SATA patch for Intel Patsburg SATA RAID controller
    - LP: #742056
  * RDMA/cma: Fix crash in request handlers
    - LP: #742056
  * IB/cm: Bump reference count on cm_id before invoking callback
    - LP: #742056
  * ath9k_hw: Fix incorrect macversion and macrev checks
    - LP: #742056
  * USB: serial/kobil_sct, fix potential tty NULL dereference
    - LP: #742056
  * USB: serial: ch341: add new id
    - LP: #742056
  * xhci: Fix cycle bit calculation during stall handling.
    - LP: #742056
  * ALSA: hda - fix digital mic selection in mixer on 92HD8X codecs
    - LP: #742056
  * PCI: remove quirk for pre-production systems
    - LP: #742056
  * PCI: add more checking to ICH region quirks
    - LP: #742056
  * PCI: do not create quirk I/O regions below PCIBIOS_MIN_IO for ICH
    - LP: #742056
  * PCI: sysfs: Fix failure path for addition of "vpd" attribute
    - LP: #742056
  * ALSA: ctxfi - Fix incorrect SPDIF status bit mask
    - LP: #742056
  * ALSA: ctxfi - Fix SPDIF status retrieval
    - LP: #742056
  * ALSA: ctxfi - Clear input settings before initialization
    - LP: #742056
  * SUNRPC: Ensure we always run the tk_callback before tk_action
    - LP: #742056
  * perf, powerpc: Handle events that raise an exception without
    overflowing
    - LP: #742056
  * ext3: Always set dx_node's fake_dirent explicitly.
    - LP: #742056
  * call_function_many: fix list delete vs add race
    - LP: #742056
  * call_function_many: add missing ordering
    - LP: #742056
  * x86: Flush TLB if PGD entry is changed in i386 PAE mode
    - LP: #742056
  * isdn: avoid calling tty_ldisc_flush() in atomic context
    - LP: #742056
  * smp_call_function_many: handle concurrent clearing of mask
    - LP: #742056
  * fix per-cpu flag problem in the cpu affinity checkers
    - LP: #742056
  * i2c: Fix typo in instantiating-devices document
    - LP: #742056
  * mmc: sdio: remember new card RCA when redetecting card
    - LP: #742056
  * powerpc/kexec: Fix race in kexec shutdown
    - LP: #742056
  * powerpc/kdump: Fix race in kdump shutdown
    - LP: #742056
  * powerpc: rtas_flash needs to use rtas_data_buf
    - LP: #742056
  * x86, binutils, xen: Fix another wrong size directive
    - LP: #742056
  * hwmon: (sht15) Fix integer overflow in humidity calculation
    - LP: #742056
  * Linux 2.6.32.34
    - LP: #742056
  * Linux 2.6.32.35
    - LP: #742056
  * aio: wake all waiters when destroying ctx
    - LP: #744921
  * shmem: let shared anonymous be nonlinear again
    - LP: #744921
  * PCI hotplug: acpiphp: set current_state to D0 in register_slot
    - LP: #744921
  * xen: set max_pfn_mapped to the last pfn mapped
    - LP: #744921
  * PCI: return correct value when writing to the "reset" attribute
    - LP: #744921
  * Prevent rt_sigqueueinfo and rt_tgsigqueueinfo from spoofing the signal
    code
    - LP: #744921
  * ext3: skip orphan cleanup on rocompat fs
    - LP: #744921
  * procfs: fix /proc/<pid>/maps heap check
    - LP: #744921
  * proc: protect mm start_code/end_code in /proc/pid/stat, CVE-2011-0726
    - LP: #744921
    - CVE-2011-0726
  * fbcon: Bugfix soft cursor detection in Tile Blitting
    - LP: #744921
  * nfsd41: modify the members value of nfsd4_op_flags
    - LP: #744921
  * nfsd: wrong index used in inner loop
    - LP: #744921
  * uvcvideo: Fix uvc_fixup_video_ctrl() format search
    - LP: #744921
  * ehci-hcd: Bug fix: don't set a QH's Halt bit
    - LP: #744921
  * USB: uss720 fixup refcount position
    - LP: #744921
  * USB: cdc-acm: fix memory corruption / panic
    - LP: #744921
  * USB: cdc-acm: fix potential null-pointer dereference
    - LP: #744921
  * USB: cdc-acm: fix potential null-pointer dereference on disconnect
    - LP: #744921
  * Input: xen-kbdfront - advertise either absolute or relative coordinates
    - LP: #744921
  * SUNRPC: Never reuse the socket port after an xs_close()
    - LP: #744921
  * fs: call security_d_instantiate in d_obtain_alias V2
    - LP: #744921
  * dcdbas: force SMI to happen when expected
    - LP: #744921
  * Linux 2.6.32.36
    - LP: #744921
  * drm/radeon/kms: check AA resolve registers on r300, CVE-2011-1016
    - LP: #745686
    - CVE-2011-1016
  * drm/radeon: fix regression with AA resolve checking, CVE-2011-1016
    - LP: #745686
    - CVE-2011-1016
  * xen: events: do not unmask event channels on resume
    - LP: #681083
  * drm/radeon/kms: check AA resolve registers on r300
    - LP: #754584
  * drm/radeon: fix regression with AA resolve checking
    - LP: #754584
  * Linux 2.6.32.36+drm33.16
    - LP: #754584
  * ALSA: hda - Fix SPDIF out regression on ALC889
    - LP: #764685
  * ALSA: Fix yet another race in disconnection
    - LP: #764685
  * perf: Better fit max unprivileged mlock pages for tools needs
    - LP: #764685
  * myri10ge: fix rmmod crash
    - LP: #764685
  * cciss: fix lost command issue
    - LP: #764685
  * sound/oss/opl3: validate voice and channel indexes
    - LP: #764685
  * mac80211: initialize sta->last_rx in sta_info_alloc
    - LP: #764685
  * ses: show devices for enclosures with no page 7
    - LP: #764685
  * ses: Avoid kernel panic when lun 0 is not mapped
    - LP: #764685
  * eCryptfs: Unlock page in write_begin error path
    - LP: #764685
  * eCryptfs: ecryptfs_keyring_auth_tok_for_sig() bug fix
    - LP: #764685
  * staging: usbip: bugfixes related to kthread conversion
    - LP: #764685
  * staging: usbip: bugfix add number of packets for isochronous frames
    - LP: #764685
  * staging: usbip: bugfix for isochronous packets and optimization
    - LP: #764685
  * staging: hv: Fix GARP not sent after Quick Migration
    - LP: #764685
  * staging: hv: use sync_bitops when interacting with the hypervisor
    - LP: #764685
  * Relax si_code check in rt_sigqueueinfo and rt_tgsigqueueinfo
    - LP: #764685
  * xfs: prevent leaking uninitialized stack memory in FSGEOMETRY_V1
    - LP: #764685
  * irda: validate peer name and attribute lengths
    - LP: #764685
  * irda: prevent heap corruption on invalid nickname
    - LP: #764685
  * nilfs2: fix data loss in mmap page write for hole blocks
    - LP: #764685
  * ASoC: Explicitly say registerless widgets have no register
    - LP: #764685
  * ALSA: ens1371: fix Creative Ectiva support
    - LP: #764685
  * ROSE: prevent heap corruption with bad facilities
    - LP: #764685
  * Btrfs: Fix uninitialized root flags for subvolumes
    - LP: #764685
  * x86, mtrr, pat: Fix one cpu getting out of sync during resume
    - LP: #764685
  * ath9k: fix a chip wakeup related crash in ath9k_start
    - LP: #764685
  * UBIFS: do not read flash unnecessarily
    - LP: #764685
  * UBIFS: fix oops on error path in read_pnode
    - LP: #764685
  * UBIFS: fix debugging failure in dbg_check_space_info
    - LP: #764685
  * quota: Don't write quota info in dquot_commit()
    - LP: #764685
  * mm: avoid wrapping vm_pgoff in mremap()
    - LP: #764685
  * p54usb: IDs for two new devices
    - LP: #764685
  * b43: allocate receive buffers big enough for max frame len + offset
    - LP: #764685
  * Bluetooth: sco: fix information leak to userspace
    - LP: #764685
  * bridge: netfilter: fix information leak
    - LP: #764685
  * Bluetooth: bnep: fix buffer overflow
    - LP: #764685
  * Bluetooth: add support for Apple MacBook Pro 8,2
    - LP: #764685
  * Treat writes as new when holes span across page boundaries
    - LP: #764685
  * char/tpm: Fix unitialized usage of data buffer
    - LP: #764685
  * netfilter: ip_tables: fix infoleak to userspace
    - LP: #764685
  * netfilter: arp_tables: fix infoleak to userspace
    - LP: #764685
  * netfilter: ipt_CLUSTERIP: fix buffer overflow
    - LP: #764685
  * ipv6: netfilter: ip6_tables: fix infoleak to userspace
    - LP: #764685
  * mfd: ab3100: world-writable debugfs *_priv files
    - LP: #764685
  * drivers/rtc/rtc-ds1511.c: world-writable sysfs nvram file
    - LP: #764685
  * drivers/misc/ep93xx_pwm.c: world-writable sysfs files
    - LP: #764685
  * econet: 4 byte infoleak to the network
    - LP: #764685
  * sound/oss: remove offset from load_patch callbacks
    - LP: #764685
  * sound: oss: midi_synth: check get_user() return value
    - LP: #764685
  * repair gdbstub to match the gdbserial protocol specification
    - LP: #764685
  * gro: Reset dev pointer on reuse
    - LP: #764685
  * gro: reset skb_iif on reuse
    - LP: #764685
  * x86, amd-ucode: Remove needless log messages
    - LP: #764685
  * x86, microcode, AMD: Extend ucode size verification
    - LP: #764685
  * powerpc/kexec: Add ifdef CONFIG_PPC_STD_MMU_64 to PPC64 code
    - LP: #764685
  * powerpc: Fix default_machine_crash_shutdown #ifdef botch
    - LP: #764685
  * Squashfs: handle corruption of directory structure
    - LP: #764685
  * sctp: fix to calc the INIT/INIT-ACK chunk length correctly is set
    - LP: #764685
  * atm/solos-pci: Don't include frame pseudo-header on transmit hex-dump
    - LP: #764685
  * ext4: fix credits computing for indirect mapped files
    - LP: #764685
  * nfsd: fix auth_domain reference leak on nlm operations
    - LP: #764685
  * CAN: Use inode instead of kernel address for /proc file
    - LP: #764685
  * exec: make argv/envp memory visible to oom-killer
    - LP: #764685
  * exec: copy-and-paste the fixes into compat_do_execve() paths
    - LP: #764685
  * xfs: zero proper structure size for geometry calls
    - LP: #764685
  * Linux 2.6.32.37
    - LP: #764685
  * Linux 2.6.32.38
    - LP: #764685
 -- Brad Figg <brad.figg@canonical.com>   Wed, 20 Apr 2011 08:28:25 -0700

Changed in linux (Ubuntu Lucid):
status:	Fix Committed → Fix Released

Revision history for this message

Davim (davim) wrote on 2011-06-01:

#43

The kernel released on this fix as a nastier bug tha causes the VMs to freeze on boot if they're configured with more than 512M of RAM, see this bug report:

https://bugs.launchpad.net/ubuntu/+source/linux/+bug/790747

Revision history for this message

Herton R. Krzesinski (herton) wrote on 2011-06-13:

#44

This bug is awaiting verification that the kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-needed-maverick' to 'verification-done-maverick'.

If verification is not done by one week from today, this fix will be dropped from the source code, and this bug will be closed.

See https://wiki.ubuntu.com/Testing/EnableProposed for documentation how to enable and use -proposed. Thank you!

tags:

added: verification-needed-maverick

Revision history for this message

Steve Conklin (sconklin) wrote on 2011-06-14:

#45

Note that comment #43 only applies to Lucid and not to maverick. So we are still awaiting verification that the problem is resolved in Maverick by the kernel in -proposed.

Thanks

Revision history for this message

Steve Conklin (sconklin) wrote on 2011-06-15:

#46

The verification period for this kernel will close soon, and we have not received verification that the problem is resolved in Maverick. Please test and post results to this bug or the fix will be reverted from the release.

Revision history for this message

Stefan Bader (smb) wrote on 2011-06-16:

#47

It is probably stretching things a bit, but Davim sort of does a verification for Maverick kernels in comment #39 as he installed the kernels that I did provide in comment #37 into a Lucid installation (and I only provided 2.6.35 kernels).

Revision history for this message

BlueT - Matthew Lien - 練喆明 (bluet) wrote on 2011-06-17:

#48

The bug got fixed by applying the HowTo in Citrix's website as mentioned at #36 in this thread.
The Lucid kernel 2.6.32-33.66 which released few days ago didn't do any help, but 2.6.38 solved this.

Revision history for this message

Luiz Ozaki (luiz-ozaki) wrote on 2011-06-17:

#49

Download full text (6.4 KiB)

Okay, here goes my tests:

2.6.35-28-server #50-Ubuntu SMP Fri Mar 18 18:59:25 UTC 2011 x86_64 GNU/Linux
[2819125.429932] ------------[ cut here ]------------
[2819125.429943] kernel BUG at /build/buildd/linux-2.6.35/arch/x86/xen/spinlock.c:344!
[2819125.429950] invalid opcode: 0000 [#1] SMP
[2819125.429956] last sysfs file: /sys/kernel/uevent_seqnum
[2819125.429961] CPU 3
[2819125.429964] Modules linked in: xenfs lp parport xen_netfront xen_blkfront
[2819125.429977]
[2819125.429981] Pid: 12, comm: migration/3 Not tainted 2.6.35-28-server #50-Ubuntu /
[2819125.429987] RIP: e030:[<ffffffff81007d04>] [<ffffffff81007d04>] dummy_handler+0x4/0x10
[2819125.430000] RSP: e02b:ffff880003f60ea8 EFLAGS: 00010046
[2819125.430005] RAX: ffffffffff57b000 RBX: ffff88001fc1a8a0 RCX: 0000000000000000
[2819125.430011] RDX: 0000000000400200 RSI: 0000000000000000 RDI: 0000000000000013
[2819125.430016] RBP: ffff880003f60ea8 R08: 0000000000000600 R09: 0000000000000000
[2819125.430022] R10: ffff880003f68028 R11: 0000000000012ed0 R12: 0000000000000000
[2819125.430028] R13: 0000000000000000 R14: 0000000000000013 R15: 0000000000000100
[2819125.430038] FS: 00007f2137153700(0000) GS:ffff880003f5d000(0000) knlGS:0000000000000000
[2819125.430045] CS: e033 DS: 0000 ES: 0000 CR0: 000000008005003b
[2819125.430050] CR2: 0000000000000000 CR3: 0000000003acf000 CR4: 0000000000002660
[2819125.430057] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[2819125.430063] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[2819125.430069] Process migration/3 (pid: 12, threadinfo ffff88001fdd2000, task ffff88001fd9dbc0)
[2819125.430076] Stack:
[2819125.430079] ffff880003f60ef8 ffffffff810ca310 0000000000000000 000000000072f59a
[2819125.430088] <0> ffff880003f60ef8 ffff88001fc05900 0000000000000013 0000000000000600
[2819125.430099] <0> 0000000000000001 0000000000000100 ffff880003f60f18 ffffffff810cca52
[2819125.430111] Call Trace:
[2819125.430115] <IRQ>
[2819125.430123] [<ffffffff810ca310>] handle_IRQ_event+0x50/0x160
[2819125.430130] [<ffffffff810cca52>] handle_percpu_irq+0x42/0x80
[2819125.430139] [<ffffffff81348706>] xen_evtchn_do_upcall+0x1d6/0x200
[2819125.430147] [<ffffffff810b2400>] ? stop_machine_cpu_stop+0x0/0xe0
[2819125.430154] [<ffffffff8100b02e>] xen_do_hypervisor_callback+0x1e/0x30
[2819125.430159] <EOI>
[2819125.430164] [<ffffffff810b2400>] ? stop_machine_cpu_stop+0x0/0xe0
[2819125.430172] [<ffffffff8100122a>] ? hypercall_page+0x22a/0x1010
[2819125.430179] [<ffffffff8100122a>] ? hypercall_page+0x22a/0x1010
[2819125.430186] [<ffffffff81006b9d>] ? xen_force_evtchn_callback+0xd/0x10
[2819125.430192] [<ffffffff81007332>] ? check_events+0x12/0x20
[2819125.430199] [<ffffffff810072d9>] ? xen_irq_enable_direct_end+0x0/0x7
[2819125.430206] [<ffffffff810b2301>] ? cpu_stopper_thread+0xd1/0x1d0
[2819125.430214] [<ffffffff8159f6f1>] ? schedule+0x3e1/0x830
[2819125.430221] [<ffffffff815a1aee>] ? _raw_spin_unlock_irqrestore+0x1e/0x30
[2819125.430227] [<ffffffff810b2230>] ? cpu_stopper_thread+0x0/0x1d0
[2819125.430235] [<ffffffff8107f616>] ? kthread+0x96/0xa0
[2819125.430240] [<ffffffff8100aee4>] ? kernel_thread_helper+0x4/0x10
[2...

Okay, here goes my tests:

2.6.35-28-server #50-Ubuntu SMP Fri Mar 18 18:59:25 UTC 2011 x86_64 GNU/Linux
[2819125.429932] ------------[ cut here ]------------
[2819125.429943] kernel BUG at /build/buildd/linux-2.6.35/arch/x86/xen/spinlock.c:344!
[2819125.429950] invalid opcode: 0000 [#1] SMP 
[2819125.429956] last sysfs file: /sys/kernel/uevent_seqnum
[2819125.429961] CPU 3 
[2819125.429964] Modules linked in: xenfs lp parport xen_netfront xen_blkfront
[2819125.429977] 
[2819125.429981] Pid: 12, comm: migration/3 Not tainted 2.6.35-28-server #50-Ubuntu /
[2819125.429987] RIP: e030:[<ffffffff81007d04>]  [<ffffffff81007d04>] dummy_handler+0x4/0x10
[2819125.430000] RSP: e02b:ffff880003f60ea8  EFLAGS: 00010046
[2819125.430005] RAX: ffffffffff57b000 RBX: ffff88001fc1a8a0 RCX: 0000000000000000
[2819125.430011] RDX: 0000000000400200 RSI: 0000000000000000 RDI: 0000000000000013
[2819125.430016] RBP: ffff880003f60ea8 R08: 0000000000000600 R09: 0000000000000000
[2819125.430022] R10: ffff880003f68028 R11: 0000000000012ed0 R12: 0000000000000000
[2819125.430028] R13: 0000000000000000 R14: 0000000000000013 R15: 0000000000000100
[2819125.430038] FS:  00007f2137153700(0000) GS:ffff880003f5d000(0000) knlGS:0000000000000000
[2819125.430045] CS:  e033 DS: 0000 ES: 0000 CR0: 000000008005003b
[2819125.430050] CR2: 0000000000000000 CR3: 0000000003acf000 CR4: 0000000000002660
[2819125.430057] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[2819125.430063] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[2819125.430069] Process migration/3 (pid: 12, threadinfo ffff88001fdd2000, task ffff88001fd9dbc0)
[2819125.430076] Stack:
[2819125.430079]  ffff880003f60ef8 ffffffff810ca310 0000000000000000 000000000072f59a
[2819125.430088] <0> ffff880003f60ef8 ffff88001fc05900 0000000000000013 0000000000000600
[2819125.430099] <0> 0000000000000001 0000000000000100 ffff880003f60f18 ffffffff810cca52
[2819125.430111] Call Trace:
[2819125.430115]  <IRQ> 
[2819125.430123]  [<ffffffff810ca310>] handle_IRQ_event+0x50/0x160
[2819125.430130]  [<ffffffff810cca52>] handle_percpu_irq+0x42/0x80
[2819125.430139]  [<ffffffff81348706>] xen_evtchn_do_upcall+0x1d6/0x200
[2819125.430147]  [<ffffffff810b2400>] ? stop_machine_cpu_stop+0x0/0xe0
[2819125.430154]  [<ffffffff8100b02e>] xen_do_hypervisor_callback+0x1e/0x30
[2819125.430159]  <EOI> 
[2819125.430164]  [<ffffffff810b2400>] ? stop_machine_cpu_stop+0x0/0xe0
[2819125.430172]  [<ffffffff8100122a>] ? hypercall_page+0x22a/0x1010
[2819125.430179]  [<ffffffff8100122a>] ? hypercall_page+0x22a/0x1010
[2819125.430186]  [<ffffffff81006b9d>] ? xen_force_evtchn_callback+0xd/0x10
[2819125.430192]  [<ffffffff81007332>] ? check_events+0x12/0x20
[2819125.430199]  [<ffffffff810072d9>] ? xen_irq_enable_direct_end+0x0/0x7
[2819125.430206]  [<ffffffff810b2301>] ? cpu_stopper_thread+0xd1/0x1d0
[2819125.430214]  [<ffffffff8159f6f1>] ? schedule+0x3e1/0x830
[2819125.430221]  [<ffffffff815a1aee>] ? _raw_spin_unlock_irqrestore+0x1e/0x30
[2819125.430227]  [<ffffffff810b2230>] ? cpu_stopper_thread+0x0/0x1d0
[2819125.430235]  [<ffffffff8107f616>] ? kthread+0x96/0xa0
[2819125.430240]  [<ffffffff8100aee4>] ? kernel_thread_helper+0x4/0x10
[2819125.430247]  [<ffffffff8100a313>] ? int_ret_from_sys_call+0x7/0x1b
[2819125.430254]  [<ffffffff815a205d>] ? retint_restore_args+0x5/0x6
[2819125.430260]  [<ffffffff8100aee0>] ? kernel_thread_helper+0x0/0x10
[2819125.430265] Code: 89 e5 c9 0f 95 c0 c3 55 b8 01 00 00 00 86 07 84 c0 48 89 e5 0f 94 c0 c9 0f b6 c0 c3 eb 09 90 90 90 90 90 90 90 90 90 55 48 89 e5 <0f> 0b eb fe eb 06 90 90 90 90 90 90 55 ba ff ff ff ff 48 89 e5 
[2819125.430341] RIP  [<ffffffff81007d04>] dummy_handler+0x4/0x10
[2819125.430348]  RSP <ffff880003f60ea8>
[2819125.430354] ---[ end trace d7c57d1d217dc8cc ]---
[2819125.430358] Kernel panic - not syncing: Fatal exception in interrupt
[2819125.430364] Pid: 12, comm: migration/3 Tainted: G      D     2.6.35-28-server #50-Ubuntu
[2819125.430370] Call Trace:
[2819125.430374]  <IRQ>  [<ffffffff8159ed8a>] panic+0x90/0x113
[2819125.430383]  [<ffffffff815a1aee>] ? _raw_spin_unlock_irqrestore+0x1e/0x30
[2819125.430390]  [<ffffffff815a301a>] oops_end+0xea/0xf0
[2819125.430396]  [<ffffffff8100de0b>] die+0x5b/0x90
[2819125.430402]  [<ffffffff815a2884>] do_trap+0xc4/0x170
[2819125.430408]  [<ffffffff8100ba35>] do_invalid_op+0x95/0xb0
[2819125.430414]  [<ffffffff81007d04>] ? dummy_handler+0x4/0x10
[2819125.430420]  [<ffffffff81006b9d>] ? xen_force_evtchn_callback+0xd/0x10
[2819125.430426]  [<ffffffff81007332>] ? check_events+0x12/0x20
[2819125.430432]  [<ffffffff8100ad5b>] invalid_op+0x1b/0x20
[2819125.430439]  [<ffffffff81007d04>] ? dummy_handler+0x4/0x10
[2819125.430445]  [<ffffffff810ca517>] ? irq_to_desc+0x17/0x20
[2819125.430450]  [<ffffffff810ca310>] handle_IRQ_event+0x50/0x160
[2819125.430457]  [<ffffffff810cca52>] handle_percpu_irq+0x42/0x80
[2819125.430463]  [<ffffffff81348706>] xen_evtchn_do_upcall+0x1d6/0x200
[2819125.529453]  [<ffffffff810b2400>] ? stop_machine_cpu_stop+0x0/0xe0
[2819125.529464]  [<ffffffff8100b02e>] xen_do_hypervisor_callback+0x1e/0x30
[2819125.529469]  <EOI>  [<ffffffff810b2400>] ? stop_machine_cpu_stop+0x0/0xe0
[2819125.529481]  [<ffffffff8100122a>] ? hypercall_page+0x22a/0x1010
[2819125.529488]  [<ffffffff8100122a>] ? hypercall_page+0x22a/0x1010
[2819125.529495]  [<ffffffff81006b9d>] ? xen_force_evtchn_callback+0xd/0x10
[2819125.529501]  [<ffffffff81007332>] ? check_events+0x12/0x20
[2819125.529508]  [<ffffffff810072d9>] ? xen_irq_enable_direct_end+0x0/0x7
[2819125.529514]  [<ffffffff810b2301>] ? cpu_stopper_thread+0xd1/0x1d0
[2819125.529523]  [<ffffffff8159f6f1>] ? schedule+0x3e1/0x830
[2819125.529530]  [<ffffffff815a1aee>] ? _raw_spin_unlock_irqrestore+0x1e/0x30
[2819125.529536]  [<ffffffff810b2230>] ? cpu_stopper_thread+0x0/0x1d0
[2819125.529543]  [<ffffffff8107f616>] ? kthread+0x96/0xa0
[2819125.529549]  [<ffffffff8100aee4>] ? kernel_thread_helper+0x4/0x10
[2819125.529556]  [<ffffffff8100a313>] ? int_ret_from_sys_call+0x7/0x1b
[2819125.529563]  [<ffffffff815a205d>] ? retint_restore_args+0x5/0x6
[2819125.529569]  [<ffffffff8100aee0>] ? kernel_thread_helper+0x0/0x10

2.6.35-30-server #54-Ubuntu SMP Tue Jun 7 20:13:05 UTC 2011 x86_64 GNU/Linux
..... Working fine !!

Tks !

At least in my case, lucid still have some glitches while xenmotion, but its much better with the latest kernel release.

Maverick seems to be working fine using the 35-30 proposed kernel. All 8 xenmotions to different hosts worked, no fail.

Steve Conklin (sconklin) on 2011-06-17

tags:

added: verification-done-maverick
removed: verification-needed-maverick

Revision history for this message

Launchpad Janitor (janitor) wrote on 2011-06-27:

#50

Download full text (30.0 KiB)

This bug was fixed in the package linux - 2.6.35-30.54

---------------
linux (2.6.35-30.54) maverick-proposed; urgency=low

[ Brad Figg ]

* Release Tracking Bug
- LP: #794114

[ Upstream Kernel Changes ]

  * Revert "xhci: Fix full speed bInterval encoding."
  * Revert "USB: xhci - also free streams when resetting devices"
  * Revert "USB: xhci - fix math in xhci_get_endpoint_interval()"
  * Revert "USB: xhci - fix unsafe macro definitions"

linux (2.6.35-30.53) maverick-proposed; urgency=low

[ Upstream Kernel Changes ]

* xhci: Fix full speed bInterval encoding.
- LP: #792959

linux (2.6.35-30.52) maverick-proposed; urgency=low

[ Herton R. Krzesinski ]

* Release Tracking Bug
- LP: #790653

[ Stefan Bader ]

* Include nls_iso8859-1 for virtual images
- LP: #732046

[ Thomas Schlichter ]

* SAUCE: vesafb: mtrr module parameter is uint, not bool
- LP: #778043

[ Tim Gardner ]

* [Config] Add cachefiles.ko to virtual flavour
- LP: #770430

[ Upstream Kernel Changes ]

  * Revert "intel_idle: PCI quirk to prevent Lenovo Ideapad s10-3 boot
    hang"
    - LP: #772560
  * Revert "TPM: Long default timeout fix"
    - LP: #772560
  * Revert "tpm_tis: Use timeouts returned from TPM"
    - LP: #772560
  * Revert "xen: set max_pfn_mapped to the last pfn mapped"
  * CAN: Use inode instead of kernel address for /proc file, CVE-2010-4565
    - LP: #765007
    - CVE-2010-4565
  * xfs: prevent leaking uninitialized stack memory in FSGEOMETRY_V1,
    CVE-2011-0711
    - LP: #767740
    - CVE-2011-0711
  * Treat writes as new when holes span across page boundaries,
    CVE-2011-0463
    - LP: #770483
    - CVE-2011-0463
  * fs/partitions/ldm.c: fix oops caused by corrupted partition table,
    CVE-2011-1017
    - LP: #771382
    - CVE-2011-1017
  * qla2xxx: Make the FC port capability mutual exclusive.
    - LP: #772560
  * staging: usbip: bugfixes related to kthread conversion
    - LP: #772560
  * staging: usbip: bugfix add number of packets for isochronous frames
    - LP: #772560
  * staging: usbip: bugfix for isochronous packets and optimization
    - LP: #772560
  * staging: hv: Fix GARP not sent after Quick Migration
    - LP: #772560
  * staging: hv: use sync_bitops when interacting with the hypervisor
    - LP: #772560
  * irda: validate peer name and attribute lengths
    - LP: #772560
  * irda: prevent heap corruption on invalid nickname
    - LP: #772560
  * nilfs2: fix data loss in mmap page write for hole blocks
    - LP: #772560
  * ASoC: Explicitly say registerless widgets have no register
    - LP: #772560
  * ALSA: ens1371: fix Creative Ectiva support
    - LP: #772560
  * ROSE: prevent heap corruption with bad facilities
    - LP: #772560
  * Btrfs: Fix uninitialized root flags for subvolumes
    - LP: #772560
  * x86, mtrr, pat: Fix one cpu getting out of sync during resume
    - LP: #772560
  * UBIFS: do not read flash unnecessarily
    - LP: #772560
  * UBIFS: fix oops on error path in read_pnode
    - LP: #772560
  * UBIFS: fix debugging failure in dbg_check_space_info
    - LP: #772560
  * quota: Don't write quota info in dquot_commit()
    - LP: #772560
  * mm: avoid wrapping vm_...

This bug was fixed in the package linux - 2.6.35-30.54

---------------
linux (2.6.35-30.54) maverick-proposed; urgency=low

[ Brad Figg ]

* Release Tracking Bug
    - LP: #794114

[ Upstream Kernel Changes ]

* Revert "xhci: Fix full speed bInterval encoding."
  * Revert "USB: xhci - also free streams when resetting devices"
  * Revert "USB: xhci - fix math in xhci_get_endpoint_interval()"
  * Revert "USB: xhci - fix unsafe macro definitions"

linux (2.6.35-30.53) maverick-proposed; urgency=low

[ Upstream Kernel Changes ]

* xhci: Fix full speed bInterval encoding.
    - LP: #792959

linux (2.6.35-30.52) maverick-proposed; urgency=low

[ Herton R. Krzesinski ]

* Release Tracking Bug
    - LP: #790653

[ Stefan Bader ]

* Include nls_iso8859-1 for virtual images
    - LP: #732046

[ Thomas Schlichter ]

* SAUCE: vesafb: mtrr module parameter is uint, not bool
    - LP: #778043

[ Tim Gardner ]

* [Config] Add cachefiles.ko to virtual flavour
    - LP: #770430

[ Upstream Kernel Changes ]

* Revert "intel_idle: PCI quirk to prevent Lenovo Ideapad s10-3 boot
    hang"
    - LP: #772560
  * Revert "TPM: Long default timeout fix"
    - LP: #772560
  * Revert "tpm_tis: Use timeouts returned from TPM"
    - LP: #772560
  * Revert "xen: set max_pfn_mapped to the last pfn mapped"
  * CAN: Use inode instead of kernel address for /proc file, CVE-2010-4565
    - LP: #765007
    - CVE-2010-4565
  * xfs: prevent leaking uninitialized stack memory in FSGEOMETRY_V1,
    CVE-2011-0711
    - LP: #767740
    - CVE-2011-0711
  * Treat writes as new when holes span across page boundaries,
    CVE-2011-0463
    - LP: #770483
    - CVE-2011-0463
  * fs/partitions/ldm.c: fix oops caused by corrupted partition table,
    CVE-2011-1017
    - LP: #771382
    - CVE-2011-1017
  * qla2xxx: Make the FC port capability mutual exclusive.
    - LP: #772560
  * staging: usbip: bugfixes related to kthread conversion
    - LP: #772560
  * staging: usbip: bugfix add number of packets for isochronous frames
    - LP: #772560
  * staging: usbip: bugfix for isochronous packets and optimization
    - LP: #772560
  * staging: hv: Fix GARP not sent after Quick Migration
    - LP: #772560
  * staging: hv: use sync_bitops when interacting with the hypervisor
    - LP: #772560
  * irda: validate peer name and attribute lengths
    - LP: #772560
  * irda: prevent heap corruption on invalid nickname
    - LP: #772560
  * nilfs2: fix data loss in mmap page write for hole blocks
    - LP: #772560
  * ASoC: Explicitly say registerless widgets have no register
    - LP: #772560
  * ALSA: ens1371: fix Creative Ectiva support
    - LP: #772560
  * ROSE: prevent heap corruption with bad facilities
    - LP: #772560
  * Btrfs: Fix uninitialized root flags for subvolumes
    - LP: #772560
  * x86, mtrr, pat: Fix one cpu getting out of sync during resume
    - LP: #772560
  * UBIFS: do not read flash unnecessarily
    - LP: #772560
  * UBIFS: fix oops on error path in read_pnode
    - LP: #772560
  * UBIFS: fix debugging failure in dbg_check_space_info
    - LP: #772560
  * quota: Don't write quota info in dquot_commit()
    - LP: #772560
  * mm: avoid wrapping vm_pgoff in mremap()
    - LP: #772560
  * p54usb: IDs for two new devices
    - LP: #772560
  * b43: allocate receive buffers big enough for max frame len + offset
    - LP: #772560
  * Bluetooth: sco: fix information leak to userspace
    - LP: #772560
  * bridge: netfilter: fix information leak
    - LP: #772560
  * Bluetooth: bnep: fix buffer overflow
    - LP: #772560
  * Bluetooth: add support for Apple MacBook Pro 8,2
    - LP: #772560
  * char/tpm: Fix unitialized usage of data buffer
    - LP: #772560
  * netfilter: ip_tables: fix infoleak to userspace
    - LP: #772560
  * netfilter: arp_tables: fix infoleak to userspace
    - LP: #772560
  * netfilter: ipt_CLUSTERIP: fix buffer overflow
    - LP: #772560
  * ipv6: netfilter: ip6_tables: fix infoleak to userspace
    - LP: #772560
  * mfd: ab3100: world-writable debugfs *_priv files
    - LP: #772560
  * drivers/rtc/rtc-ds1511.c: world-writable sysfs nvram file
    - LP: #772560
  * drivers/misc/ep93xx_pwm.c: world-writable sysfs files
    - LP: #772560
  * econet: 4 byte infoleak to the network
    - LP: #772560
  * sound/oss: remove offset from load_patch callbacks
    - LP: #772560
  * sound: oss: midi_synth: check get_user() return value
    - LP: #772560
  * gro: Reset dev pointer on reuse
    - LP: #772560
  * gro: reset skb_iif on reuse
    - LP: #772560
  * x86, microcode, AMD: Extend ucode size verification
    - LP: #772560
  * Squashfs: handle corruption of directory structure
    - LP: #772560
  * atm/solos-pci: Don't include frame pseudo-header on transmit hex-dump
    - LP: #772560
  * ext4: fix credits computing for indirect mapped files
    - LP: #772560
  * nfsd: fix auth_domain reference leak on nlm operations
    - LP: #772560
  * inet_diag: Make sure we actually run the same bytecode we audited.
    - LP: #772560
  * xfs: zero proper structure size for geometry calls
    - LP: #772560
  * cifs: always do is_path_accessible check in cifs_mount
    - LP: #772560
  * video: sn9c102: world-wirtable sysfs files
    - LP: #772560
  * UBIFS: restrict world-writable debugfs files
    - LP: #772560
  * NET: cdc-phonet, handle empty phonet header
    - LP: #772560
  * x86: Fix a bogus unwind annotation in lib/semaphore_32.S
    - LP: #772560
  * tioca: Fix assignment from incompatible pointer warnings
    - LP: #772560
  * mca.c: Fix cast from integer to pointer warning
    - LP: #772560
  * ramfs: fix memleak on no-mmu arch
    - LP: #772560
  * MAINTAINERS: update STABLE BRANCH info
    - LP: #772560
  * UBIFS: fix oops when R/O file-system is fsync'ed
    - LP: #772560
  * x86, cpu: AMD errata checking framework
    - LP: #772560
  * x86, cpu: Clean up AMD erratum 400 workaround
    - LP: #772560
  * x86, AMD: Set ARAT feature on AMD processors
    - LP: #772560
  * x86, amd: Disable GartTlbWlkErr when BIOS forgets it
    - LP: #772560
  * USB: ftdi_sio: Added IDs for CTI USB Serial Devices
    - LP: #772560
  * USB: ftdi_sio: add PID for OCT DK201 docking station
    - LP: #772560
  * USB: ftdi_sio: add ids for Hameg HO720 and HO730
    - LP: #772560
  * USB: option: Add new ONDA vendor id and product id for ONDA MT825UP
    - LP: #772560
  * USB: option: Added support for Samsung GT-B3730/GT-B3710 LTE USB modem.
    - LP: #772560
  * next_pidmap: fix overflow condition
    - LP: #772560
  * proc: do proper range check on readdir offset
    - LP: #772560
  * USB: EHCI: unlink unused QHs when the controller is stopped
    - LP: #772560
  * USB: fix formatting of SuperSpeed endpoints in /proc/bus/usb/devices
    - LP: #772560
  * USB: xhci - fix unsafe macro definitions
    - LP: #772560
  * USB: xhci - fix math in xhci_get_endpoint_interval()
    - LP: #772560
  * x86, cpu: Fix regression in AMD errata checking code
    - LP: #772560
  * Input: synaptics - fix crash in synaptics_module_init()
    - LP: #772560
  * ath9k: fix a chip wakeup related crash in ath9k_start
    - LP: #772560
  * ath: add missing regdomain pair 0x5c mapping
    - LP: #772560
  * block, blk-sysfs: Fix an err return path in blk_register_queue()
    - LP: #772560
  * p54: Initialize extra_len in p54_tx_80211
    - LP: #772560
  * x86, gart: Make sure GART does not map physmem above 1TB
    - LP: #772560
  * intel-iommu: Unlink domain from iommu
    - LP: #772560
  * intel-iommu: Fix get_domain_for_dev() error path
    - LP: #772560
  * drm/radeon/kms: fix bad shift in atom iio table parser
    - LP: #772560
  * NFS: nfs_wcc_update_inode() should set nfsi->attr_gencount
    - LP: #772560
  * serial/imx: read cts state only after acking cts change irq
    - LP: #772560
  * ASoC: Fix output PGA enabling in wm_hubs CODECs
    - LP: #772560
  * kconfig: Avoid buffer underrun in choice input
    - LP: #772560
  * UBIFS: fix master node recovery
    - LP: #772560
  * Remove extra struct page member from the buffer info structure
    - LP: #772560
  * dasd: correct device table
    - LP: #772560
  * iwlagn: Support new 5000 microcode.
    - LP: #772560
  * uvcvideo: Fix descriptor parsing for video output devices
    - LP: #772560
  * ALSA: hda - VIA: Add missing support for VT1718S in A-A path
    - LP: #772560
  * ALSA: hda - VIA: Fix stereo mixer recording no sound issue
    - LP: #772560
  * iwlwifi: fix skb usage after free
    - LP: #772560
  * intel-iommu: Fix use after release during device attach
    - LP: #772560
  * USB: Fix unplug of device with active streams
    - LP: #772560
  * USB: xhci - also free streams when resetting devices
    - LP: #772560
  * 2.6.35.y: Revert "SH: Add missing consts to sys_execve() declaration"
    - LP: #772560
  * 2.6.35.13 longterm review
    - LP: #772560
  * release-2.6.35.13
    - LP: #772560
  * xen: events: do not unmask event channels on resume
    - LP: #681083
  * ALSA: sound/pci/asihpi: check adapter index in hpi_ioctl, CVE-2011-1169
    - LP: #785331
    - CVE-2011-1169
  * mpt2sas: prevent heap overflows and unchecked reads, CVE-2011-1494
    - LP: #787145
    - CVE-2011-1494
  * can: add missing socket check in can/raw release, CVE-2011-1748
    - LP: #788694
    - CVE-2011-1748
  * agp: fix arbitrary kernel memory writes, CVE-1011-2022
    - LP: #788684
    - CVE-1011-2022

linux (2.6.35-29.51) maverick-proposed; urgency=low

[ Steve Conklin ]

* Release Tracking Bug
    - LP: #760928

[ Brad Figg ]

* [Config] Set CONFIG_NR_CPUS=256 for amd64 generic
    - LP: #737124

[ Kees Cook ]

* SAUCE: nx-emu: further clarify dmesg reporting
    - LP: #745181

[ Loïc Minier ]

* Include nls_cp437 module in virtual for fat
    - LP: #732046

[ Manoj Iyer ]

* SAUCE: thinkpad-acpi: module autoloading for newer Lenovo ThinkPads.
    - LP: #745217
  * SAUCE: (drop after 2.6.38) add support for Lenovo tablet ID (0xE6)
    - LP: #746652

[ Mel Gorman ]

* (pre-stable) mm: page allocator: adjust the per-cpu counter threshold
    when memory is low
    - LP: #719446

[ Tim Gardner ]

* [Config] updateconfigs after 2.6.35.12
    - LP: #747520
  * SAUCE: staging: hv: Fixed issue with scheduling while atomic in
    hv_vmbus
    - LP: #752064
  * SAUCE: Backport of mainline loss of network fix for Hyper-V
    - LP: #752064

[ Upstream Kernel Changes ]

* Revert "slab: Fix missing DEBUG_SLAB last user"
    - LP: #747520
  * PM / Hibernate: Improve comments in hibernate_preallocate_memory()
    - LP: #737208
  * PM / Hibernate: Make default image size depend on total RAM size
    - LP: #737208
  * irda: prevent integer underflow in IRLMP_ENUMDEVICES, CVE-2010-4529
    - LP: #737823
    - CVE-2010-4529
  * perf session: Invalidate last_match when removing threads from rb_tree
    - LP: #747520
  * Fix cred leak in AF_NETLINK
    - LP: #747520
  * staging: usbip: remove double giveback of URB
    - LP: #747520
  * USB: EHCI: ASPM quirk of ISOC on AMD SB800
    - LP: #747520
  * rt2x00: add device id for windy31 usb device
    - LP: #747520
  * ALSA: snd-usb-us122l: Fix missing NULL checks
    - LP: #747520
  * hwmon: (via686a) Initialize fan_div values
    - LP: #747520
  * USB: serial: handle Data Carrier Detect changes
    - LP: #747520
  * USB: CP210x Add two device IDs
    - LP: #747520
  * USB: CP210x Removed incorrect device ID
    - LP: #747520
  * USB: usb-storage: unusual_devs update for Cypress ATACB
    - LP: #747520
  * USB: usb-storage: unusual_devs update for TrekStor DataStation maxi g.u
    external hard drive enclosure
    - LP: #747520
  * USB: usb-storage: unusual_devs entry for CamSport Evo
    - LP: #747520
  * USB: usb-storage: unusual_devs entry for Coby MP3 player
    - LP: #747520
  * USB: serial: Updated support for ICOM devices
    - LP: #747520
  * USB: adding USB support for Cinterion's HC2x, EU3 and PH8 products
    - LP: #747520
  * USB: EHCI: ASPM quirk of ISOC on AMD Hudson
    - LP: #747520
  * USB: EHCI: fix DMA deallocation bug
    - LP: #747520
  * USB: g_printer: fix bug in module parameter definitions
    - LP: #747520
  * USB: io_edgeport: fix the reported firmware major and minor
    - LP: #747520
  * USB: ti_usb: fix module removal
    - LP: #747520
  * USB: Storage: Add unusual_devs entry for VTech Kidizoom
    - LP: #747520
  * USB: ftdi_sio: add ST Micro Connect Lite uart support
    - LP: #747520
  * USB: cdc-acm: Adding second ACM channel support for Nokia N8
    - LP: #747520
  * USB: ftdi_sio: Add VID=0x0647, PID=0x0100 for Acton Research
    spectrograph
    - LP: #747520
  * USB: prevent buggy hubs from crashing the USB stack
    - LP: #747520
  * staging: comedi: add support for newer jr3 1-channel pci board
    - LP: #747520
  * staging: comedi: ni_labpc: Use shared IRQ for PCMCIA card
    - LP: #747520
  * Staging: hv: fix sysfs symlink on hv block device
    - LP: #747520
  * staging: hv: Enable sending GARP packet after live migration
    - LP: #747520
  * iwlagn: enable only rfkill interrupt when device is down
    - LP: #747520
  * ath9k: Fix bug in delimiter padding computation
    - LP: #747520
  * fix medium error problems with some arrays which can cause data
    corruption
    - LP: #747520
  * libsas: fix runaway error handler problem
    - LP: #747520
  * mpt2sas: Fix device removal handshake for zoned devices
    - LP: #747520
  * mpt2sas: Correct resizing calculation for max_queue_depth
    - LP: #747520
  * mpt2sas: Kernel Panic during Large Topology discovery
    - LP: #747520
  * radio-aimslab.c: Fix gcc 4.5+ bug
    - LP: #747520
  * em28xx: Fix audio input for Terratec Grabby
    - LP: #747520
  * ALSA : au88x0 - Limit number of channels to fix Oops via OSS emu
    - LP: #747520
  * ALSA: HDA: Fix dmesg output of HDMI supported bits
    - LP: #747520
  * ALSA: hda - Fix memory leaks in conexant jack arrays
    - LP: #747520
  * input: bcm5974: Add support for MacBookAir3
    - LP: #747520
  * ALSA: hrtimer: handle delayed timer interrupts
    - LP: #747520
  * ASoC: WM8990: msleep() takes milliseconds not jiffies
    - LP: #747520
  * ASoC: Blackfin AC97: fix build error after multi-component update
    - LP: #747520
  * NFS: Fix "kernel BUG at fs/aio.c:554!"
    - LP: #747520
  * rtc-cmos: fix suspend/resume
    - LP: #747520
  * iwlagn: Re-enable RF_KILL interrupt when down
    - LP: #747520
  * rapidio: fix hang on RapidIO doorbell queue full condition
    - LP: #747520
  * PCI: pci-stub: ignore zero-length id parameters
    - LP: #747520
  * virtio: remove virtio-pci root device
    - LP: #747520
  * ds2760_battery: Fix calculation of time_to_empty_now
    - LP: #747520
  * p54: fix sequence no. accounting off-by-one error
    - LP: #747520
  * i2c: Unregister dummy devices last on adapter removal
    - LP: #747520
  * serial: unbreak billionton CF card
    - LP: #747520
  * ptrace: use safer wake up on ptrace_detach()
    - LP: #747520
  * x86, mtrr: Avoid MTRR reprogramming on BP during boot on UP platforms
    - LP: #747520
  * fix jiffy calculations in calibrate_delay_direct to handle overflow
    - LP: #747520
  * drivers: update to pl2303 usb-serial to support Motorola cables
    - LP: #747520
  * klist: Fix object alignment on 64-bit.
    - LP: #747520
  * powerpc: Fix some 6xx/7xxx CPU setup functions
    - LP: #747520
  * parisc : Remove broken line wrapping handling pdc_iodc_print()
    - LP: #747520
  * kernel/smp.c: fix smp_call_function_many() SMP race
    - LP: #747520
  * hostap_cs: fix sleeping function called from invalid context
    - LP: #747520
  * md: fix regression with re-adding devices to arrays with no metadata
    - LP: #747520
  * pata_mpc52xx: inherit from ata_bmdma_port_ops
    - LP: #747520
  * TPM: Long default timeout fix
    - LP: #747520
  * tpm_tis: Use timeouts returned from TPM
    - LP: #747520
  * SELinux: define permissions for DCB netlink messages
    - LP: #747520
  * SELinux: do not compute transition labels on mountpoint labeled
    filesystems
    - LP: #747520
  * ieee80211: correct IEEE80211_ADDBA_PARAM_BUF_SIZE_MASK macro
    - LP: #747520
  * dm: dont take i_mutex to change device size
    - LP: #747520
  * dm mpath: disable blk_abort_queue
    - LP: #747520
  * drm/radeon/kms: add quirk for Mac Radeon HD 2600 card
    - LP: #747520
  * drm/radeon/kms: make the mac rv630 quirk generic
    - LP: #747520
  * drm/radeon/kms: add pll debugging output
    - LP: #747520
  * drm/radeon: remove 0x4243 pci id
    - LP: #747520
  * drm/radeon/kms: fix s/r issues with bios scratch regs
    - LP: #747520
  * drm/i915/lvds: Add AOpen i915GMm-HFS to the list of false-positive LVDS
    - LP: #747520
  * drm/i915: Add dependency on CONFIG_TMPFS
    - LP: #747520
  * x86, mm: avoid possible bogus tlb entries by clearing prev mm_cpumask
    after switching mm
    - LP: #747520
  * usb: Realloc xHCI structures after a hub is verified.
    - LP: #747520
  * sched: Move sched_avg_update() to update_cpu_load()
    - LP: #747520
  * sched: Increment cache_nice_tries only on periodic lb
    - LP: #747520
  * sched: Try not to migrate higher priority RT tasks
    - LP: #747520
  * sched: Give CPU bound RT tasks preference
    - LP: #747520
  * sched: suppress RCU lockdep splat in task_fork_fair
    - LP: #747520
  * sched: Do not consider SCHED_IDLE tasks to be cache hot
    - LP: #747520
  * sched: Set group_imb only a task can be pulled from the busiest cpu
    - LP: #747520
  * sched: Force balancing on newidle balance if local group has capacity
    - LP: #747520
  * sched: Drop group_capacity to 1 only if local group has extra capacity
    - LP: #747520
  * sched: Fix softirq time accounting
    - LP: #747520
  * sched: Consolidate account_system_vtime extern declaration
    - LP: #747520
  * sched: Remove unused PF_ALIGNWARN flag
    - LP: #747520
  * sched: Add a PF flag for ksoftirqd identification
    - LP: #747520
  * sched: Add IRQ_TIME_ACCOUNTING, finer accounting of irq time
    - LP: #747520
  * x86: Add IRQ_TIME_ACCOUNTING
    - LP: #747520
  * sched: Do not account irq time to current task
    - LP: #747520
  * sched: Remove irq time from available CPU power
    - LP: #747520
  * sched: Call tick_check_idle before __irq_enter
    - LP: #747520
  * sched: Export account_system_vtime()
    - LP: #747520
  * sched, cgroup: Fixup broken cgroup movement
    - LP: #747520
  * sched: Use group weight, idle cpu metrics to fix imbalances during idle
    - LP: #747520
  * kernel/user.c: add lock release annotation on free_user()
    - LP: #747520
  * NFSD: memory corruption due to writing beyond the stat array
    - LP: #747520
  * mptfusion: mptctl_release is required in mptctl.c
    - LP: #747520
  * mptfusion: Fix Incorrect return value in mptscsih_dev_reset
    - LP: #747520
  * ocfs2_connection_find() returns pointer to bad structure
    - LP: #747520
  * x86/pvclock: Zero last_value on resume
    - LP: #747520
  * av7110: check for negative array offset
    - LP: #747520
  * bonding/vlan: Avoid mangled NAs on slaves without VLAN tag insertion
    - LP: #747520
  * CRED: Fix kernel panic upon security_file_alloc() failure.
    - LP: #747520
  * CRED: Fix BUG() upon security_cred_alloc_blank() failure
    - LP: #747520
  * CRED: Fix memory and refcount leaks upon security_prepare_creds()
    failure
    - LP: #747520
  * isdn: hisax: Replace the bogus access to irq stats
    - LP: #747520
  * scsi_dh_alua: add netapp to dev list
    - LP: #747520
  * scsi_dh_alua: Add IBM Power Virtual SCSI ALUA device to dev list
    - LP: #747520
  * nfsd: correctly handle return value from nfsd_map_name_to_*
    - LP: #747520
  * s390: remove task_show_regs
    - LP: #747520
  * PM / Hibernate: Return error code when alloc_image_page() fails
    - LP: #747520
  * fs/partitions: Validate map_count in Mac partition tables
    - LP: #747520
  * ALSA: HDA: Add position_fix quirk for an Asus device
    - LP: #718402, #747520
  * ALSA: caiaq - Fix possible string-buffer overflow
    - LP: #747520
  * radio-aimslab.c needs #include <linux/delay.h>
    - LP: #747520
  * ARM: Ensure predictable endian state on signal handler entry
    - LP: #747520
  * acer-wmi: Fix capitalisation of GUID
    - LP: #747520
  * eCryptfs: Copy up lower inode attrs in getattr
    - LP: #747520
  * platform: x86: acer-wmi: world-writable sysfs threeg file
    - LP: #747520
  * platform: x86: asus_acpi: world-writable procfs files
    - LP: #747520
  * platform: x86: tc1100-wmi: world-writable sysfs wireless and jogdial
    files
    - LP: #747520
  * genirq: Disable the SHIRQ_DEBUG call in request_threaded_irq for now
    - LP: #747520
  * usb: musb: omap2430: fix kernel panic on reboot
    - LP: #747520
  * USB: add quirks entry for Keytouch QWERTY Panel
    - LP: #747520
  * USB: Add Samsung SGH-I500/Android modem ID switch to visor driver
    - LP: #747520
  * USB: Add quirk for Samsung Android phone modem
    - LP: #747520
  * p54pci: update receive dma buffers before and after processing
    - LP: #747520
  * sierra: add new ID for Airprime/Sierra USB IP modem
    - LP: #747520
  * staging: usbip: vhci: update reference count for usb_device
    - LP: #747520
  * staging: usbip: vhci: give back URBs from in-flight unlink requests
    - LP: #747520
  * staging: usbip: vhci: refuse to enqueue for dead connections
    - LP: #747520
  * staging: usbip: vhci: use urb->dev->portnum to find port
    - LP: #747520
  * epoll: prevent creating circular epoll structures
    - LP: #747520
  * ldm: corrupted partition table can cause kernel oops
    - LP: #747520
  * md: correctly handle probe of an 'mdp' device.
    - LP: #747520
  * x86 quirk: Fix polarity for IRQ0 pin2 override on SB800 systems
    - LP: #747520
  * xhci: Avoid BUG() in interrupt context
    - LP: #747520
  * xhci: Clarify some expressions in the TRB math
    - LP: #747520
  * xhci: Fix errors in the running total calculations in the TRB math
    - LP: #747520
  * xhci: Fix an error in count_sg_trbs_needed()
    - LP: #747520
  * x25: Do not reference freed memory.
    - LP: #747520
  * Ocfs2/refcounttree: Fix a bug for refcounttree to writeback clusters in
    a right number.
    - LP: #747520
  * drm: fix unsigned vs signed comparison issue in modeset ctl ioctl.
    - LP: #747520
  * mfd: Fix NULL pointer due to non-initialized ucb1x00-ts absinfo
    - LP: #747520
  * x86: Use u32 instead of long to set reset vector back to 0
    - LP: #747520
  * fuse: fix hang of single threaded fuseblk filesystem
    - LP: #747520
  * clockevents: Prevent oneshot mode when broadcast device is periodic
    - LP: #747520
  * ext2: Fix link count corruption under heavy link+rename load
    - LP: #747520
  * p54usb: add Senao NUB-350 usbid
    - LP: #747520
  * dccp: fix oops on Reset after close
    - LP: #747520
  * e1000e: disable broken PHY wakeup for ICH10 LOMs, use MAC wakeup
    instead
    - LP: #747520
  * r8169: disable ASPM
    - LP: #747520
  * usb: iowarrior: don't trust report_size for buffer size
    - LP: #747520
  * arp_notify: unconditionally send gratuitous ARP for
    NETDEV_NOTIFY_PEERS.
    - LP: #747520
  * CIFS: Fix oplock break handling (try #2)
    - LP: #747520
  * cpuset: add a missing unlock in cpuset_write_resmask()
    - LP: #747520
  * keyboard: integer underflow bug
    - LP: #747520
  * RxRPC: Fix v1 keys
    - LP: #747520
  * ixgbe: fix for 82599 erratum on Header Splitting
    - LP: #747520
  * mm: fix possible cause of a page_mapped BUG
    - LP: #747520
  * powerpc/kexec: Fix orphaned offline CPUs across kexec
    - LP: #747520
  * netfilter: nf_log: avoid oops in (un)bind with invalid nfproto values
    - LP: #747520
  * nfsd: wrong index used in inner loop
    - LP: #747520
  * r8169: use RxFIFO overflow workaround for 8168c chipset.
    - LP: #747520
  * net: don't allow CAP_NET_ADMIN to load non-netdev kernel modules
    - LP: #747520
  * ip6ip6-autoload-ip6-tunnel
    - LP: #747520
  * hwmon/f71882fg: Set platform drvdata to NULL later
    - LP: #747520
  * mtd: add "platform:" prefix for platform modalias
    - LP: #747520
  * libata: no special completion processing for EH commands
    - LP: #747520
  * MIPS: MTX-1: Make au1000_eth probe all PHY addresses
    - LP: #747520
  * x86/mm: Handle mm_fault_error() in kernel space
    - LP: #747520
  * ftrace: Fix memory leak with function graph and cpu hotplug
    - LP: #747520
  * x86: Fix panic when handling "mem={invalid}" param
    - LP: #553464, #747520
  * x86: Emit "mem=nopentium ignored" warning when not supported
    - LP: #553464, #747520
  * ahci: AHCI and RAID mode SATA patch for Intel Patsburg DeviceIDs
    - LP: #747520
  * ahci: AHCI mode SATA patch for Intel DH89xxCC DeviceIDs
    - LP: #747520
  * ahci: AHCI mode SATA patch for Intel Patsburg SATA RAID controller
    - LP: #747520
  * RDMA/cma: Fix crash in request handlers
    - LP: #747520
  * IB/cm: Bump reference count on cm_id before invoking callback
    - LP: #747520
  * x86, quirk: Fix SB600 revision check
    - LP: #747520
  * ath9k_hw: Fix incorrect macversion and macrev checks
    - LP: #747520
  * USB: serial/kobil_sct, fix potential tty NULL dereference
    - LP: #747520
  * USB: serial: ch341: add new id
    - LP: #747520
  * xhci: Fix cycle bit calculation during stall handling.
    - LP: #747520
  * ALSA: hda - fix digital mic selection in mixer on 92HD8X codecs
    - LP: #747520
  * PCI: add more checking to ICH region quirks
    - LP: #747520
  * PCI: do not create quirk I/O regions below PCIBIOS_MIN_IO for ICH
    - LP: #747520
  * PCI: sysfs: Fix failure path for addition of "vpd" attribute
    - LP: #747520
  * ALSA: ctxfi - Fix incorrect SPDIF status bit mask
    - LP: #747520
  * ALSA: ctxfi - Fix SPDIF status retrieval
    - LP: #747520
  * ALSA: ctxfi - Clear input settings before initialization
    - LP: #747520
  * SUNRPC: Ensure we always run the tk_callback before tk_action
    - LP: #747520
  * perf, powerpc: Handle events that raise an exception without
    overflowing
    - LP: #747520
  * ext3: Always set dx_node's fake_dirent explicitly.
    - LP: #747520
  * call_function_many: fix list delete vs add race
    - LP: #747520
  * call_function_many: add missing ordering
    - LP: #747520
  * x86: Flush TLB if PGD entry is changed in i386 PAE mode
    - LP: #747520
  * smp_call_function_many: handle concurrent clearing of mask
    - LP: #747520
  * fix per-cpu flag problem in the cpu affinity checkers
    - LP: #747520
  * i2c: Fix typo in instantiating-devices document
    - LP: #747520
  * mmc: sdio: remember new card RCA when redetecting card
    - LP: #747520
  * x86, binutils, xen: Fix another wrong size directive
    - LP: #747520
  * hwmon: (sht15) Fix integer overflow in humidity calculation
    - LP: #747520
  * aio: wake all waiters when destroying ctx
    - LP: #747520
  * shmem: let shared anonymous be nonlinear again
    - LP: #747520
  * PCI hotplug: acpiphp: set current_state to D0 in register_slot
    - LP: #747520
  * xen: set max_pfn_mapped to the last pfn mapped
    - LP: #747520
  * Prevent rt_sigqueueinfo and rt_tgsigqueueinfo from spoofing the signal
    code
    - LP: #747520
  * ext3: skip orphan cleanup on rocompat fs
    - LP: #747520
  * procfs: fix /proc/<pid>/maps heap check
    - LP: #747520
  * proc: protect mm start_code/end_code in /proc/pid/stat
    - LP: #747520
  * fbcon: Bugfix soft cursor detection in Tile Blitting
    - LP: #747520
  * nfsd41: modify the members value of nfsd4_op_flags
    - LP: #747520
  * uvcvideo: Fix uvc_fixup_video_ctrl() format search
    - LP: #747520
  * ehci-hcd: Bug fix: don't set a QH's Halt bit
    - LP: #747520
  * USB: uss720 fixup refcount position
    - LP: #747520
  * USB: cdc-acm: fix memory corruption / panic
    - LP: #747520
  * USB: cdc-acm: fix potential null-pointer dereference
    - LP: #747520
  * USB: cdc-acm: fix potential null-pointer dereference on disconnect
    - LP: #747520
  * Input: xen-kbdfront - advertise either absolute or relative coordinates
    - LP: #747520
  * SUNRPC: Never reuse the socket port after an xs_close()
    - LP: #747520
  * fs: call security_d_instantiate in d_obtain_alias V2
    - LP: #747520
  * dcdbas: force SMI to happen when expected
    - LP: #747520
  * ALSA: hda - Fix SPDIF out regression on ALC889
    - LP: #747520
  * ALSA: Fix yet another race in disconnection
    - LP: #747520
  * perf: Better fit max unprivileged mlock pages for tools needs
    - LP: #747520
  * myri10ge: fix rmmod crash
    - LP: #747520
  * cciss: fix lost command issue
    - LP: #747520
  * sound/oss/opl3: validate voice and channel indexes
    - LP: #747520
  * mac80211: initialize sta->last_rx in sta_info_alloc
    - LP: #747520
  * ses: show devices for enclosures with no page 7
    - LP: #747520
  * ses: Avoid kernel panic when lun 0 is not mapped
    - LP: #747520
  * eCryptfs: Unlock page in write_begin error path
    - LP: #747520
  * eCryptfs: ecryptfs_keyring_auth_tok_for_sig() bug fix
    - LP: #747520
  * classmate-laptop: depends on RFKILL or RFKILL=n
    - LP: #747520
  * netfilter: arpt_mangle: fix return values of checkentry
    - LP: #747520
  * Patch cab9e9848b9a8283b0504a2d7c435a9f5ba026de to the 2.6.35.y stable
    tree
    - LP: #747520
  * revert misc: uss720.c: add another vendor/product ID
    - LP: #747520
  * cfg80211: fix can_beacon_sec_chan, reenable HT40
    - LP: #747520
  * USB: isp1760: Implement solution for erratum 2
    - LP: #747520
  * xhci: Update internal dequeue pointers after stalls.
    - LP: #747520
  * perf: Fix tear-down of inherited group events
    - LP: #747520
  * net: Fix ip link add netns oops
    - LP: #747520
  * hwmon: (w83627ehf) Driver cleanup
    - LP: #747520
  * md: Fix - again - partition detection when array becomes active
    - LP: #747520
  * iwl3945: remove plcp check
    - LP: #747520
  * KVM: enlarge number of possible CPUID leaves
    - LP: #747520
  * KVM: i8259: initialize isr_ack
    - LP: #747520
  * KVM: VMX: Fix host userspace gsbase corruption
    - LP: #747520
  * Relax si_code check in rt_sigqueueinfo and rt_tgsigqueueinfo
    - LP: #747520
  * Release 2.6.35.12
    - LP: #747520
  * drm/radeon/kms: check AA resolve registers on r300, CVE-2011-1016
    - LP: #745686
    - CVE-2011-1016
  * drm/radeon: fix regression with AA resolve checking, CVE-2011-1016
    - LP: #745686
    - CVE-2011-1016
  * staging: hv: fix netvsc sleeping while atomic
    - LP: #752064
  * staging: hv: Fix the WARN_ON condition in free_net_device()
    - LP: #752064
  * Yama: fix default relationship to check thread group
    - LP: #737676
 -- Brad Figg <brad.figg@canonical.com>   Tue, 07 Jun 2011 08:42:13 -0700

Changed in linux (Ubuntu Maverick):
status:	In Progress → Fix Released

Revision history for this message

Davim (davim) wrote on 2011-11-10:

#51

This problem returned in 2.6.32-35....

Revision history for this message

Stefan Bader (smb) wrote on 2011-11-10:

#52

Likely the regression reported in bug #881542. Trying to get it fixed in the next update.

Revision history for this message

David Ehle (ehle-p) wrote on 2011-11-13:

#53

I'm also seeing this with the 2.6.32-35 kernel. When I try to live migrate a system with that kernel, it hangs in some fashion. Console is black screen. When I try to ssh in, it responds but cannot finish auth process.
2.6.32-34 had a kernel panic when I migrated.

Installing linux-image-server-lts-backport-maverick package to get 2.6.35.30.38 seems to let the VMs migrate without crashing.

Xen environment is XenServer 5.6SP2

Revision history for this message

jagudo (jagudo) wrote on 2011-12-02:

#54

Same problem for Lucid (10.04) in today update: 2.6.32-36-generic

Revision history for this message

Davim (davim) wrote on 2011-12-23:

#55

I've just tested 2.6.32-37-generic-pae on Lucid and the problem is solved.

Please make sure this problem stays solved, It has been a pain having this problem coming back on almost every update...

Revision history for this message

Luiz Ozaki (luiz-ozaki) wrote on 2012-01-02:

#56

Hmmm... Now it doesn't crash. But I lose network access and I get a stack trace

[7744382.224271] eth0: no IPv6 routers present
[7744715.716789] PM: suspend of devices complete after 0.119 msecs
[7744715.716794] suspending xenstore...
[7744715.716825] PM: late suspend of devices complete after 0.026 msecs
[7746693.008996] trying to map vcpu_info 0 at ffff8800034ab020, mfn 18aa756, offset 32
[7746693.009004] cpu 0 using vcpu_info at ffff8800034ab020
[7746693.009359] PM: early resume of devices complete after 0.034 msecs
[7746693.016403] PM: resume of devices complete after 5.629 msecs
[7746827.319483] INFO: task xenwatch:11 blocked for more than 120 seconds.
[7746827.319498] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[7746827.319508] xenwatch D ffff8800034b5f00 0 11 2 0x00000000
[7746827.319516] ffff88001fd9bbf0 0000000000000246 0000000000015f00 0000000000015f00
[7746827.319524] ffff88001fda03d0 ffff88001fd9bfd8 0000000000015f00 ffff88001fda0000
[7746827.319531] 0000000000015f00 ffff88001fd9bfd8 0000000000015f00 ffff88001fda03d0
[7746827.319539] Call Trace:
[7746827.319551] [<ffffffff81559aed>] schedule_timeout+0x22d/0x300
[7746827.319559] [<ffffffff81062549>] ? __enqueue_rt_entity+0x129/0x240
[7746827.319565] [<ffffffff81558c66>] wait_for_common+0xd6/0x180
[7746827.319577] [<ffffffff8105deb0>] ? default_wake_function+0x0/0x20
[7746827.319583] [<ffffffff81558dcd>] wait_for_completion+0x1d/0x20
[7746827.319589] [<ffffffff81085e1b>] kthread_stop+0x4b/0xd0
[7746827.319595] [<ffffffff8108914f>] ? hrtimer_force_reprogram+0x7f/0x90
[7746827.319600] [<ffffffff81081aae>] cleanup_workqueue_thread+0x3e/0x80
[7746827.319605] [<ffffffff81081c93>] destroy_workqueue+0x93/0xe0
[7746827.319612] [<ffffffff810b7d44>] stop_machine_destroy+0x34/0x50
[7746827.319619] [<ffffffff8132431f>] do_suspend+0xaf/0x120
[7746827.319623] [<ffffffff81324499>] shutdown_handler+0x109/0x160
[7746827.319628] [<ffffffff81325812>] xenwatch_thread+0xc2/0x190
[7746827.319634] [<ffffffff81086140>] ? autoremove_wake_function+0x0/0x40
[7746827.319639] [<ffffffff81325750>] ? xenwatch_thread+0x0/0x190
[7746827.319644] [<ffffffff81085dc6>] kthread+0x96/0xa0
[7746827.319650] [<ffffffff810141aa>] child_rip+0xa/0x20
[7746827.319658] [<ffffffff81013391>] ? int_ret_from_sys_call+0x7/0x1b
[7746827.319663] [<ffffffff81013b1d>] ? retint_restore_args+0x5/0x6
[7746827.319668] [<ffffffff810141a0>] ? child_rip+0x0/0x20

Linux x 2.6.32-37-server #81-Ubuntu SMP Fri Dec 2 20:49:12 UTC 2011 x86_64 GNU/Linux @ XenServer 5.6 SP2

Gonna test on XS 6.0

Hmmm... Now it doesn't crash. But I lose network access and I get a stack trace

[7744382.224271] eth0: no IPv6 routers present
[7744715.716789] PM: suspend of devices complete after 0.119 msecs
[7744715.716794] suspending xenstore...
[7744715.716825] PM: late suspend of devices complete after 0.026 msecs
[7746693.008996] trying to map vcpu_info 0 at ffff8800034ab020, mfn 18aa756, offset 32
[7746693.009004] cpu 0 using vcpu_info at ffff8800034ab020
[7746693.009359] PM: early resume of devices complete after 0.034 msecs
[7746693.016403] PM: resume of devices complete after 5.629 msecs
[7746827.319483] INFO: task xenwatch:11 blocked for more than 120 seconds.
[7746827.319498] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
[7746827.319508] xenwatch      D ffff8800034b5f00     0    11      2 0x00000000
[7746827.319516]  ffff88001fd9bbf0 0000000000000246 0000000000015f00 0000000000015f00
[7746827.319524]  ffff88001fda03d0 ffff88001fd9bfd8 0000000000015f00 ffff88001fda0000
[7746827.319531]  0000000000015f00 ffff88001fd9bfd8 0000000000015f00 ffff88001fda03d0
[7746827.319539] Call Trace:
[7746827.319551]  [<ffffffff81559aed>] schedule_timeout+0x22d/0x300
[7746827.319559]  [<ffffffff81062549>] ? __enqueue_rt_entity+0x129/0x240
[7746827.319565]  [<ffffffff81558c66>] wait_for_common+0xd6/0x180
[7746827.319577]  [<ffffffff8105deb0>] ? default_wake_function+0x0/0x20
[7746827.319583]  [<ffffffff81558dcd>] wait_for_completion+0x1d/0x20
[7746827.319589]  [<ffffffff81085e1b>] kthread_stop+0x4b/0xd0
[7746827.319595]  [<ffffffff8108914f>] ? hrtimer_force_reprogram+0x7f/0x90
[7746827.319600]  [<ffffffff81081aae>] cleanup_workqueue_thread+0x3e/0x80
[7746827.319605]  [<ffffffff81081c93>] destroy_workqueue+0x93/0xe0
[7746827.319612]  [<ffffffff810b7d44>] stop_machine_destroy+0x34/0x50
[7746827.319619]  [<ffffffff8132431f>] do_suspend+0xaf/0x120
[7746827.319623]  [<ffffffff81324499>] shutdown_handler+0x109/0x160
[7746827.319628]  [<ffffffff81325812>] xenwatch_thread+0xc2/0x190
[7746827.319634]  [<ffffffff81086140>] ? autoremove_wake_function+0x0/0x40
[7746827.319639]  [<ffffffff81325750>] ? xenwatch_thread+0x0/0x190
[7746827.319644]  [<ffffffff81085dc6>] kthread+0x96/0xa0
[7746827.319650]  [<ffffffff810141aa>] child_rip+0xa/0x20
[7746827.319658]  [<ffffffff81013391>] ? int_ret_from_sys_call+0x7/0x1b
[7746827.319663]  [<ffffffff81013b1d>] ? retint_restore_args+0x5/0x6
[7746827.319668]  [<ffffffff810141a0>] ? child_rip+0x0/0x20

Linux x 2.6.32-37-server #81-Ubuntu SMP Fri Dec 2 20:49:12 UTC 2011 x86_64 GNU/Linux @ XenServer 5.6 SP2

Gonna test on XS 6.0

Revision history for this message

Stefan Bader (smb) wrote on 2012-01-03:

#57

Hm, that stack trace looks a bit like that task somehow got starved off on its way into suspend. There has been regression reports (unfortunately after testing in proposed) on real hardware which was tracked to

commit f0cf1db8f15e8f95f5085f191313694cb623a558
Author: Thomas Gleixner <email address hidden>
Date: Fri Dec 2 16:02:45 2011 +0100

    clockevents: Set noop handler in clockevents_exchange_device()
    BugLink: http://bugs.launchpad.net/bugs/902317
    commit de28f25e8244c7353abed8de0c7792f5f883588c upstream.

That has been reverted now upstream and it will come down via stable. Luiz, does that happen to you all the time or at least often enough to verify with a kernel that has that patch reverted (which I would provide)?

Revision history for this message

Luiz Ozaki (luiz-ozaki) wrote on 2012-01-03:

#58

Yep, all the time.

Sure Stefan, give me the kernel and I'll test it.

Cheers.

Revision history for this message

Stefan Bader (smb) wrote on 2012-01-03:

#59

Seems I got confused on the kernel versions this morning. The patch I was suspecting actually was not present in 2.6.32-37.81. The code with it did not yet get moved into updates. So clearly not the problem here. That will make it take a bit longer until I may post a kernel to try... :(

Revision history for this message

Luiz Ozaki (luiz-ozaki) wrote on 2012-01-03:

#60

Revert this makes any sense ?

http://patchwork.ozlabs.org/patch/129004/

I saw the kernel source it seems still applied in this release, but I'm not sure if thats causing the problem.

btw in XS 6.0 works fine, problem only occurs in 5.6

Revision history for this message

Stefan Bader (smb) wrote on 2012-01-04:

#61

Tentiatively I would say no (makes not sense). The story there is that for 2.6.35 the code was at a point where save/restore seems to work. Upstream changed after that to use functionality introduced into the generic interrupt handling code. But that had the problem of not reactivating interrupts early enough. This has now been fixed in a backport to 2.6.32, but 2.6.35 dropped off being really being cared upstream. So there it will make sense to revert (or not apply the change) but for .32 not so much.

Beside, you say XS 6.0 works. So there seems to be a dependency at least to the host code. I am not that familiar with various XS versions. Maybe you could check what Xen version the guest reports in dmesg (for 5.6 and 6.0)?

Ubuntu
linux package

Ubuntu Crashes/Freeze on XenMotion

Bug Description

Related branches

CVE References

Other bug subscribers

Patches

Remote bug watches

	Status	Importance	Assigned to
linux (Ubuntu)	Fix Released	Medium	Unassigned
Lucid	Fix Released	Medium	Stefan Bader
Maverick	Fix Released	Medium	Stefan Bader

Ubuntulinux package

Ubuntu Crashes/Freeze on XenMotion

Bug Description

Related branches

CVE References

Other bug subscribers

Patches

Remote bug watches

Ubuntu
linux package