e1000 irq problems after live migration with qemu-kvm 0.12.4
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
qemu-kvm |
New
|
Undecided
|
Unassigned |
Bug Description
After live migrating ubuntu 9.10 server (2.6.31-14-server) and suse linux 10.1 (2.6.16.13-4-smp)
it happens sometimes that the guest runs into irq problems. i mention these 2 guest oss
since i have seen the error there. there are likely others around with the same problem.
on the host i run 2.6.33.3 (kernel+mod) and qemu-kvm 0.12.4.
i started a vm with:
/usr/bin/
for testing i have a clean ubuntu 9.10 server 64-bit install and created a small script with fetches a dvd iso from a local server and checking md5sum in an endless loop.
the download performance is approx. 50MB/s on that vm.
to trigger the error i did several migrations of the vm throughout the last days. finally I ended up in the following oops in the guest:
[64442.298521] irq 10: nobody cared (try booting with the "irqpoll" option)
[64442.299175] Pid: 0, comm: swapper Not tainted 2.6.31-14-server #48-Ubuntu
[64442.299179] Call Trace:
[64442.299185] <IRQ> [<ffffffff810b4
[64442.299227] [<ffffffff810b4
[64442.299232] [<ffffffff810b5
[64442.299244] [<ffffffff81014
[64442.299246] [<ffffffff81014
[64442.299249] [<ffffffff81012
[64442.299266] [<ffffffff810b3
[64442.299269] [<ffffffff810b5
[64442.299271] [<ffffffff81014
[64442.299273] [<ffffffff81014
[64442.299275] [<ffffffff81012
[64442.299290] [<ffffffff81526
[64442.299302] [<ffffffff81332
[64442.299307] [<ffffffff81339
[64442.299322] [<ffffffff8125f
[64442.299324] [<ffffffff8125f
[64442.299327] [<ffffffff81338
[64442.299336] [<ffffffff81339
[64442.299338] [<ffffffff8133a
[64442.299340] [<ffffffff8133a
[64442.299348] [<ffffffff81036
[64442.299351] [<ffffffff81332
[64442.299353] [<ffffffff8133a
[64442.299356] [<ffffffff81264
[64442.299368] [<ffffffff81065
[64442.299370] [<ffffffff81013
[64442.299372] [<ffffffff81014
[64442.299374] [<ffffffff81064
[64442.299376] [<ffffffff81014
[64442.299379] [<ffffffff81012
[64442.299380] <EOI> [<ffffffff81035
[64442.299390] [<ffffffff8101a
[64442.299395] [<ffffffff81529
[64442.299398] [<ffffffff81010
[64442.299406] [<ffffffff81512
[64442.299424] [<ffffffff81838
[64442.299427] [<ffffffff81837
[64442.299429] [<ffffffff81837
[64442.299433] handlers:
[64442.299840] [<ffffffffa0000
[64442.300046] Disabling IRQ #10
After this the guest is still allive, but download performance is down to approx. 500KB/s
This error is definetly not triggerable with option -no-kvm-irqchip. I have seen this error occasionally
since my first experiments with qemu-kvm-88 and also without hugetablefs.
Help appreciated.
I did 2 additional tests
1) Stop VM, Live Migrate, Continue -> Triggers BUG
2) Stop VM, Continue -> Does NOT trigger BUG.
My guess it seems that pending interrupts are incorrectly transferred with kernel irqchip.
As said earlier userspace irqchip does not trigger the bug.