Bug #448674 “VM is suspended after live migrate in Karmic” : Bugs : libvirt package : Ubuntu

Revision history for this message

Chuck Short (zulcss) wrote on 2009-10-13:

#1

Thank you for taking the time to report this bug and helping to make Ubuntu better. Please answer these questions:
1. Is this reproducible?
2. If so, what specific steps should we take to recreate this bug? Be as detailed as possible.
This will help us to find and resolve the problem.

Changed in libvirt (Ubuntu):
importance:	Undecided → Low
status:	New → Incomplete

Revision history for this message

EAB (erwin-true) wrote on 2009-10-20:

#2

Download full text (3.3 KiB)

Hosts:
CPU: Intel(R) Core(TM)2 CPU 6300 @ 1.86GHz
RAM: 2GB
Disk: Gbit NFS-mount on NetApp FAS3040 (/etc/libvirt/qemu)
10.0.40.100:/vol/hl/disk_images /etc/libvirt/qemu/disks nfs rsize=32768,wsize=32768,hard,intr,tcp,timeo=600,rw 0 0

Installed both hosts with Ubuntu Jaunty 9.04.
aptitude install libvirt-bin qemu kvm host sysstat iptraf iptables portmap nfs-common realpath bridge-utils vlan ubuntu-virt-server python-vm-builder whois postfix hdparm

After some testing with migration (all failed because of several errors/bugs) I upgraded to Ubuntu Karmic 9.10 Beta.

cat /etc/network/interfaces:
auto lo
iface lo inet loopback

auto eth1
iface eth1 inet manual
up ifconfig eth1 0.0.0.0 up
up ip link set eth1 promisc on

auto eth1.1503
iface eth1.1503 inet manual
up ifconfig eth1.1503 0.0.0.0 up
up ip link set eth1.1503 promisc on

auto br_extern
iface br_extern inet static
        address 123.123.32.252 # HOSTA
        address 123.123.32.253 # HOSTB
        network 123.123.32.0
        netmask 255.255.252.0
        broadcast 123.123.35.255
        gateway 123.123.32.1
        bridge_ports eth0.1503
        bridge_stp off

/etc/resolv.conf is correct
/etc/hosts is correct
Hostnames are correct and resolvable

VM running Ubuntu Jaunty 9.04:
fqdn.com.xml:
<?xml version="1.0"?>
<domain type="kvm">
  <name>fqdn.com</name>
  <uuid>70a1c1f2-9a3e-4ee5-9f95-69e7e2682e15</uuid>
  <memory>1048576</memory>
  <currentMemory>1048576</currentMemory>
  <vcpu>1</vcpu>
  <features>
    <acpi/>
    <apic/>
    <pae/>
  </features>
  <os>
    <type>hvm</type>
    <boot dev="cdrom"/>
    <boot dev="hd"/>
  </os>
  <clock offset="utc"/>
  <on_poweroff>destroy</on_poweroff>
  <on_reboot>restart</on_reboot>
  <on_crash>restart</on_crash>
  <devices>
    <emulator>/usr/bin/kvm</emulator>
    <disk type="file" device="disk">
      <source file="/etc/libvirt/qemu/disks/1378/fqdn.com/disk0.qcow2"/>
      <target dev="hda" bus="ide"/>
      <driver cache="writethrough"/>
    </disk>
    <interface type="bridge">
      <mac address="56:16:43:76:ab:09"/>
      <source bridge="br_extern"/>
    </interface>
    <disk type="file" device="cdrom">
      <target dev="hdc" bus="ide"/>
      <readonly/>
    </disk>
    <input type="mouse" bus="ps2"/>
    <graphics type="vnc" port="-1" listen="127.0.0.1"/>
  </devices>
</domain>

Define instance:
/usr/bin/virsh define /etc/libvirt/qemu/xml/1378/fqdn.com.xml

Start instance:
/usr/bin/virsh start fqdn.com

ps auxf | grep kvm:
/usr/bin/kvm -S -M pc-0.11 -m 1024 -smp 1 -name fqdn.com -uuid 70a1c1f2-9a3e-4ee5-9f95-69e7e2682e15 -monitor unix:/var/run/libvirt/qemu/fqdn.com.monitor,server,nowait -boot dc -
drive file=/etc/libvirt/qemu/disks/1378/fqdn.com/disk0.qcow2,if=ide,index=0,boot=on -drive file=,if=ide,media=cdrom,index=2 -net nic,macaddr=56:16:43:76:ab:09,vlan=0,name=nic.0 -net tap,fd=17,vlan=0
,name=tap.0 -serial none -parallel none -usb -vnc 127.0.0.1:0 -vga cirrus

Migrate instance:
/usr/bin/virsh migrate fqdn.com qemu+ssh://hostb.fqdn.com/system

Migration will complete but the instance seems to be suspended.
On HostB to resume the instance:
/usr/bin/virsh...

Hosts:
CPU: Intel(R) Core(TM)2 CPU          6300  @ 1.86GHz
RAM: 2GB
Disk: Gbit NFS-mount on NetApp FAS3040 (/etc/libvirt/qemu)
10.0.40.100:/vol/hl/disk_images         /etc/libvirt/qemu/disks         nfs     rsize=32768,wsize=32768,hard,intr,tcp,timeo=600,rw            0       0

Installed both hosts with Ubuntu Jaunty 9.04.
aptitude install libvirt-bin qemu kvm host sysstat iptraf iptables portmap nfs-common realpath bridge-utils vlan ubuntu-virt-server python-vm-builder whois postfix hdparm

After some testing with migration (all failed because of several errors/bugs) I upgraded to Ubuntu Karmic 9.10 Beta.

cat /etc/network/interfaces:
auto lo
iface lo inet loopback

auto eth1
iface eth1 inet manual
        up ifconfig eth1 0.0.0.0 up
        up ip link set eth1 promisc on

auto eth1.1503
iface eth1.1503 inet manual
        up ifconfig eth1.1503 0.0.0.0 up
        up ip link set eth1.1503 promisc on

auto br_extern
iface br_extern inet static
        address 123.123.32.252 # HOSTA
        address 123.123.32.253 # HOSTB
        network 123.123.32.0
        netmask 255.255.252.0
        broadcast 123.123.35.255
        gateway 123.123.32.1
        bridge_ports eth0.1503
        bridge_stp off

/etc/resolv.conf is correct
/etc/hosts is correct
Hostnames are correct and resolvable

VM running Ubuntu Jaunty 9.04:
fqdn.com.xml:
<?xml version="1.0"?>
<domain type="kvm">
  <name>fqdn.com</name>
  <uuid>70a1c1f2-9a3e-4ee5-9f95-69e7e2682e15</uuid>
  <memory>1048576</memory>
  <currentMemory>1048576</currentMemory>
  <vcpu>1</vcpu>
  <features>
    <acpi/>
    <apic/>
    <pae/>
  </features>
  <os>
    <type>hvm</type>
    <boot dev="cdrom"/>
    <boot dev="hd"/>
  </os>
  <clock offset="utc"/>
  <on_poweroff>destroy</on_poweroff>
  <on_reboot>restart</on_reboot>
  <on_crash>restart</on_crash>
  <devices>
    <emulator>/usr/bin/kvm</emulator>
    <disk type="file" device="disk">
      <source file="/etc/libvirt/qemu/disks/1378/fqdn.com/disk0.qcow2"/>
      <target dev="hda" bus="ide"/>
      <driver cache="writethrough"/>
    </disk>
    <interface type="bridge">
      <mac address="56:16:43:76:ab:09"/>
      <source bridge="br_extern"/>
    </interface>
    <disk type="file" device="cdrom">
      <target dev="hdc" bus="ide"/>
      <readonly/>
    </disk>
    <input type="mouse" bus="ps2"/>
    <graphics type="vnc" port="-1" listen="127.0.0.1"/>
  </devices>
</domain>

Define instance:
/usr/bin/virsh define /etc/libvirt/qemu/xml/1378/fqdn.com.xml

Start instance:
/usr/bin/virsh start fqdn.com

ps auxf | grep kvm:
/usr/bin/kvm -S -M pc-0.11 -m 1024 -smp 1 -name fqdn.com -uuid 70a1c1f2-9a3e-4ee5-9f95-69e7e2682e15 -monitor unix:/var/run/libvirt/qemu/fqdn.com.monitor,server,nowait -boot dc -
drive file=/etc/libvirt/qemu/disks/1378/fqdn.com/disk0.qcow2,if=ide,index=0,boot=on -drive file=,if=ide,media=cdrom,index=2 -net nic,macaddr=56:16:43:76:ab:09,vlan=0,name=nic.0 -net tap,fd=17,vlan=0
,name=tap.0 -serial none -parallel none -usb -vnc 127.0.0.1:0 -vga cirrus

Migrate instance:
/usr/bin/virsh migrate fqdn.com qemu+ssh://hostb.fqdn.com/system

Migration will complete but the instance seems to be suspended.
On HostB to resume the instance:
/usr/bin/virsh suspend fqdn.com
/usr/bin/virsh resume fqdn.com

Only running resume fqdn.com does nothing.

The Hosts were initialy installed as Ubuntu Jaunty 9.04 and upgraded to Ubuntu Karmic 9.10 Beta. Maybe this is the problem?

Revision history for this message

Dmitry Ljautov (dljautov) wrote on 2009-10-29:

#3

Download full text (3.2 KiB)

I have reproduced bug.
I have "asus" and "kvm" with karmic as host os (it's ok for jaunty).
# uname -a
Linux kvm 2.6.31-14-generic #48-Ubuntu SMP Fri Oct 16 14:05:01 UTC 2009 x86_64 GNU/Linux

There's no problem with DNS: "asus" and "kvm" resolved corectly on both hosts.
Both hosts has
1.
listen_tls = 0
listen_tcp = 1
auth_tcp = "none"
in /etc/libvirt/libvirtd.conf
2.
libvirtd_opts="-d -l"
in /etc/default/libvirt-bin
3.
Turned off apparmor with command `sudo invoke-rc.d apparmor stop`

I have fresh installed XP as guest (also tried with Win 2008 x64 with same results).

# virsh --connect=qemu+tcp://kvm/system list
Connecting to uri: qemu+tcp://kvm/system
Id Name State
----------------------------------
5 xp running

It pings (rdp session work too), and of course it works through vnc.

When I try to migrate it.
# virsh --connect=qemu+tcp://kvm/system migrate --live xp qemu+tcp://asus/system

I get in /var/log/syslog and /var/log/libvirt/qemu/xp.log (time on both host is syncronized)

Oct 29 12:31:39 asus kernel: [ 7868.432787] device vnet0 entered promiscuous mode
Oct 29 12:31:39 asus kernel: [ 7868.434144] breth0: port 2(vnet0) entering learning state

==> /var/log/libvirt/qemu/xp.log <==
LC_ALL=C LD_LIBRARY_PATH=/usr/local/lib PATH=/usr/local/sbin:/usr/local/bin:/sbin:/bin:/usr/sbin:/usr/bin HOME=/root USER=root LOGNAME=root /usr/bin/kvm -S -M pc-0.11 -m 512 -smp 1 -name xp -uuid 02f32130-6933-6594-544e-7b12fa1bbd34 -monitor unix:/var/run/libvirt/qemu/xp.monitor,server,nowait -boot c -drive file=/mnt/nfs/images/xp.img,if=ide,index=0,boot=on -drive file=/mnt/nfs/iso/R10.iso,if=ide,media=cdrom,index=2 -net nic,macaddr=54:52:00:17:57:79,vlan=0,name=nic.0 -net tap,fd=18,vlan=0,name=tap.0 -serial pty -parallel none -usb -usbdevice tablet -vnc 0.0.0.0:0 -k en-us -vga cirrus -incoming tcp:0.0.0.0:49154
char device redirected to /dev/pts/0

==> /var/log/syslog <==
Oct 29 12:31:48 asus kernel: [ 7877.430637] breth0: port 2(vnet0) entering forwarding state
Oct 29 12:31:49 asus kernel: [ 7878.472528] vnet0: no IPv6 routers present

==> /var/log/syslog <==
Oct 29 12:33:06 kvm kernel: [ 4912.152966] breth0: port 2(vnet0) entering disabled state
Oct 29 12:33:06 kvm kernel: [ 4912.192109] device vnet0 left promiscuous mode
Oct 29 12:33:06 kvm kernel: [ 4912.192112] breth0: port 2(vnet0) entering disabled state

And just after migration guest xp hangs (not answer on keyboard and mouse in vnc console), no reply on `ping xp` anymore.

# virsh --connect=qemu+tcp://kvm/system list
Connecting to uri: qemu+tcp://kvm/system
Id Name State
----------------------------------

# virsh --connect=qemu+tcp://asus/system list
Connecting to uri: qemu+tcp://asus/system
Id Name State
----------------------------------
2 xp running

But If we do:

# virsh --connect=qemu+tcp://asus/system suspend xp
Connecting to uri: qemu+tcp://asus/system
Domain xp suspended

# virsh --connect=qemu+tcp://asus/system resume xp
Connecting to uri: qemu+tcp://asus/system
Domain xp resumed

XP'll become alive in vnc, and begin answer for icmp requests (or rdp sessions will continue working -- no matte...

I have reproduced bug. 
I have "asus" and "kvm" with karmic as host os (it's ok for jaunty).
# uname -a
Linux kvm 2.6.31-14-generic #48-Ubuntu SMP Fri Oct 16 14:05:01 UTC 2009 x86_64 GNU/Linux

There's no problem with DNS: "asus" and "kvm" resolved corectly on both hosts.
Both hosts has
1.
listen_tls = 0
listen_tcp = 1
auth_tcp = "none"
in /etc/libvirt/libvirtd.conf
2.  
libvirtd_opts="-d -l"
in /etc/default/libvirt-bin
3. 
Turned off apparmor with command `sudo invoke-rc.d apparmor stop`

I have fresh installed XP as guest (also tried with Win 2008 x64 with same results).

# virsh --connect=qemu+tcp://kvm/system list
Connecting to uri: qemu+tcp://kvm/system
 Id Name                 State
----------------------------------
  5 xp                   running