LXD - stateful stop hanging

Bug #1642558 reported by Rick Box
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
criu (Ubuntu)
New
Undecided
Unassigned

Bug Description

Originally when performing a migration or an LXD stateful stop the system simply sits and waits around. It looks like CRIU has executed a command then nothing happens after waiting any length of time, until the command is stopped with ctrl + c.

After leaving the system overnight now just redid it to try this and it actually stopped the system but then failed to start it, so the following error is from the system failing to restart the container.

root@mist3:/var/log/lxd/repo2# lsb_release -rd
Description: Ubuntu 16.04.1 LTS
Release: 16.04

Output of “lxc info” (*)

root@mist3:~# lxc info
apiextensions: []
apistatus: stable
apiversion: "1.0"
auth: trusted
environment:
  addresses:
  - :8443
  architectures:
  - x86_64
  - i686
  certificate: |
    -----BEGIN CERTIFICATE-----
    MIIFjzCCA3egAwIBAgIRAOzxMWhdSQ6vsB1TdN78U3kwDQYJKoZIhvcNAQELBQAw
    MzEcMBoGA1UEChMTbGludXhjb250YWluZXJzLm9yZzETMBEGA1UEAwwKcm9vdEBt
    aXN0MzAeFw0xNjExMTYxNjM0MjNaFw0yNjExMTQxNjM0MjNaMDMxHDAaBgNVBAoT
    E2xpbnV4Y29udGFpbmVycy5vcmcxEzARBgNVBAMMCnJvb3RAbWlzdDMwggIiMA0G
    CSqGSIb3DQEBAQUAA4ICDwAwggIKAoICAQC5mWbzTP23LbBhlcImNJbThMLF7apy
    sd1q7xel/6Ot2WUmJYdXBK54R9KplMEY8zqjvLbGqwZopqcSc4FaIs3mg1bmpL5o
    OZVzTjMC6oENV1dGIvLJp5zKHusZNnBLAFCkGuk43O0OozC+ylEFMM8nAlkAq9tI
    fEZD8JvS0Fs5kT0t8xNg6g1Q6osqN32OPmG3+3ZRCukpxrrsv4cuxYIpWqq84s4u
    zGiQJd8wIr1ZOZTTCv3X+a8qmDFuxfyXVMg3pkujIahWAB8yaK+tYQVr/xp2iF40
    yjmB8YlcXxiUQ5FsYd9S9Lghx+flAy2XkEL/YHFkRo2hKkfTC4x3S60PqQfHPBOt
    LkjUZ4UGM8zocub2W5jznoI5rv9nfO31VpzanKj62vyZsNQCrXG1LYPKaa6jJ6Sb
    IkyBeqxjRLh9c/JMk+i9oymXyZSSWz8XHd6U129vX0i8EvNHbPScM0Q9UukHk3Ne
    lX2D95AL0FQn4lz27oRBDVmVGzpW5YIRzDLxd7noVK2YCda7Sk8LUs8xID7/fQQe
    COeAZ1H5j6lneCGugAkABjRlOQQVz6ISrZdsgWfYyo9E8G61FQ5/pZTLTT/Oq0tw
    o2MDpEJuva515/TqzACkMx352oYa7ZKT3u88jBIIhAWVR7p3H0OWYE6X7QKx4J2h
    GUKZRLVeTiEo/QIDAQABo4GdMIGaMA4GA1UdDwEB/wQEAwIFoDATBgNVHSUEDDAK
    BggrBgEFBQcDATAMBgNVHRMBAf8EAjAAMGUGA1UdEQReMFyCBW1pc3Qzgg8xNzIu
    MjkuODkuMTcvMjSCJTIwMDE6NDFjMDpjMDo4OTplYzQ6N2FmZjpmZWFhOjEyZjIv
    NjSCG2ZlODA6OmVjNDo3YWZmOmZlYWE6MTJmMi82NDANBgkqhkiG9w0BAQsFAAOC
    AgEAGmNVUxnqO598hzX6+RRmirEu0mQAxu6L+ruNGHY4fh/YCObB80TO31UESs6c
    krIAnLMfeLKdEtB5KNYAmq/Pf84YuRou/iDob5VszBmCEeGd7vXl7siWUWRbfra9
    6uDurpG5rxnyPyfVmF/w+LvpXEG/5CQV0YzAOdKX5h+w3QEPfb2q1B6X+eDXBwAZ
    zVpdo0EmNxwL7wjjPgA8uFSGX4u7W+mGuYScrbLydiQQbiNmJj6GtPP+wSvO4m8l
    O8HRGhuGMW1fNuaRrZ/z1OO7Z2zVzfvbYZqfcEWt7e7Dz+EryEhoPXu8mhRM2hNt
    FMeFgTp06xkMU6blJ520a9onbIiXSY3bkOSsRK9X+wKlU3m65+3/x3v9oYP1chf+
    Q59qSZG61iVum5bLmFxaG6SuRL6HeYMWbTyO5rSP3NUDrfdhhpppZGfE3aXl5TwS
    S7KK8K9ESbdwBYZztfHaXE3AWlbpUZtABcGSasTrPzAxUU7+89tpD2tvOEC61n6S
    ntOwlhXn3cE5ztggYoQM+9w4o9PyBFTu7n32BedfWKbDequ8fFQgkjRdULbhl9O9
    ZN9cFZ2l94+k9FK4MehD62rK4ITDvQgRs/ZUA4N3bkUgQPKrzjucJiOUyScrzOOl
    tcqu+IP15gsz/2+jaF+4UrtcBis39qlqHz79/SGnMgX5XVM=
    -----END CERTIFICATE-----
  certificatefingerprint: 97fe7f57020eea6747de7229ab2eec579dd87ec5794c850c3824cf0effca9ee0
  driver: lxc
  driverversion: 2.0.5
  kernel: Linux
  kernelarchitecture: x86_64
  kernelversion: 4.4.0-47-generic
  server: lxd
  serverpid: 16876
  serverversion: 2.0.5
  storage: zfs
  storageversion: "5"
config:
  core.https_address: :8443
  core.proxy_http: http://www-cache.rd.bbc.co.uk:8080
  core.trust_password: true
  storage.zfs_pool_name: zfs_pool/lxd
public: false

Output of “lxc info <container name>”

root@mist3:~# lxc info repo2
Name: repo2
Remote: unix:/var/lib/lxd/unix.socket
Architecture: x86_64
Created: 2016/11/17 10:56 UTC
Status: Stopped
Type: persistent
Profiles: single-interface

Output of “lxc config show –expanded <container name>”

root@mist3:~# lxc config show --expanded repo2
name: repo2
profiles:
- single-interface
config:
  environment.RSYNC_PROXY: http://www-cache.rd.bbc.co.uk:8080
  environment.http_proxy: http://www-cache.rd.bbc.co.uk:8080
  environment.https_proxy: http://www-cache.rd.bbc.co.uk:8080
  volatile.base_image: b9cba741ef621f14e6e8c306c7f87637a59d7d09c1a3ec8ef717ab0d6e43430a
  volatile.eth0.hwaddr: 52:54:00:1d:59:a0
  volatile.last_state.idmap: '[{"Isuid":true,"Isgid":false,"Hostid":427680,"Nsid":0,"Maprange":65536},{"Isuid":false,"Isgid":true,"Hostid":427680,"Nsid":0,"Maprange":65536}]'
devices:
  eth0:
    name: eth0
    nictype: bridged
    parent: br-lxd
    type: nic
  root:
    path: /
    type: disk
ephemeral: false

Output of “dmesg” (*)

See attached tarball

Output of “lxc exec <container name>”

This shows nothing due to container no longer running

Output of “uname -a” (*)

root@mist3:~# uname -a
Linux mist3 4.4.0-47-generic #68-Ubuntu SMP Wed Oct 26 19:39:52 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux

The content of /var/log/lxd.log (*)

t=2016-11-17T11:00:02+0000 lvl=info msg="Shutting down container" name=ns1 action=shutdown creation date=2016-11-17T10:56:58+0000 ephemeral=false timeout=-1s
t=2016-11-17T11:00:03+0000 lvl=info msg="Shut down container" name=ns1 action=shutdown creation date=2016-11-17T10:56:58+0000 ephemeral=false timeout=-1s
t=2016-11-17T11:00:10+0000 lvl=info msg="Deleting container" name=ns1 creation date=2016-11-17T10:56:58+0000 ephemeral=false
t=2016-11-17T11:00:11+0000 lvl=info msg="Deleted container" name=ns1 creation date=2016-11-17T10:56:58+0000 ephemeral=false
t=2016-11-17T11:00:46+0000 lvl=info msg="Stopping container" ephemeral=false stateful=true name=repo2 action=stop creation date=2016-11-17T10:56:33+0000
t=2016-11-17T11:00:46+0000 lvl=info msg="Migrating container" actionscript=false stop=true name=repo2 created=2016-11-17T10:56:33+0000 ephemeral=false statedir=/var/lib/lxd/containers/repo2/state
t=2016-11-17T11:00:50+0000 lvl=info msg="Migrated container" stop=true name=repo2 created=2016-11-17T10:56:33+0000 ephemeral=false statedir=/var/lib/lxd/containers/repo2/state actionscript=false
t=2016-11-17T11:00:50+0000 lvl=info msg="Stopped container" ephemeral=false stateful=true name=repo2 action=stop creation date=2016-11-17T10:56:33+0000
t=2016-11-17T11:02:14+0000 lvl=info msg="Starting container" ephemeral=false stateful=true name=repo2 action=start creation date=2016-11-17T10:56:33+0000
t=2016-11-17T11:02:14+0000 lvl=info msg="Migrating container" statedir=/var/lib/lxd/containers/repo2/state actionscript=false stop=false name=repo2 created=2016-11-17T10:56:33+0000 ephemeral=false
t=2016-11-17T11:02:15+0000 lvl=info msg="Failed migrating container" name=repo2 created=2016-11-17T10:56:33+0000 ephemeral=false statedir=/var/lib/lxd/containers/repo2/state actionscript=false stop=false
t=2016-11-17T11:02:15+0000 lvl=info msg="Migrated container" name=repo2 created=2016-11-17T10:56:33+0000 ephemeral=false statedir=/var/lib/lxd/containers/repo2/state actionscript=false stop=false
t=2016-11-17T11:02:23+0000 lvl=info msg="Starting container" stateful=true name=repo2 action=start creation date=2016-11-17T10:56:33+0000 ephemeral=false
t=2016-11-17T11:02:23+0000 lvl=info msg="Migrating container" stop=false name=repo2 created=2016-11-17T10:56:33+0000 ephemeral=false statedir=/var/lib/lxd/containers/repo2/state actionscript=false
t=2016-11-17T11:02:24+0000 lvl=info msg="Failed migrating container" created=2016-11-17T10:56:33+0000 ephemeral=false statedir=/var/lib/lxd/containers/repo2/state actionscript=false stop=false name=repo2
t=2016-11-17T11:02:24+0000 lvl=info msg="Migrated container" name=repo2 created=2016-11-17T10:56:33+0000 ephemeral=false statedir=/var/lib/lxd/containers/repo2/state actionscript=false stop=false
t=2016-11-17T11:09:16+0000 lvl=info msg="Starting container" action=start creation date=2016-11-17T10:56:33+0000 ephemeral=false stateful=true name=repo2
t=2016-11-17T11:09:16+0000 lvl=info msg="Migrating container" actionscript=false stop=false name=repo2 created=2016-11-17T10:56:33+0000 ephemeral=false statedir=/var/lib/lxd/containers/repo2/state
t=2016-11-17T11:09:17+0000 lvl=info msg="Failed migrating container" name=repo2 created=2016-11-17T10:56:33+0000 ephemeral=false statedir=/var/lib/lxd/containers/repo2/state actionscript=false stop=false
t=2016-11-17T11:09:17+0000 lvl=info msg="Migrated container" created=2016-11-17T10:56:33+0000 ephemeral=false statedir=/var/lib/lxd/containers/repo2/state actionscript=false stop=false name=repo2

The content of /etc/default/lxd-bridge (*)

root@mist3:~# cat /etc/default/lxd-bridge
# WARNING: This file is generated by a debconf template!
# It is recommended to update it by using "dpkg-reconfigure -p medium lxd"

# Whether to setup a new bridge or use an existing one
USE_LXD_BRIDGE="false"

# Bridge name
# This is still used even if USE_LXD_BRIDGE is set to false
# set to an empty value to fully disable
LXD_BRIDGE=""

# Update the "default" LXD profile
UPDATE_PROFILE="true"

# Path to an extra dnsmasq configuration file
LXD_CONFILE=""

# DNS domain for the bridge
LXD_DOMAIN="lxd"

# IPv4
## IPv4 address (e.g. 10.0.8.1)
LXD_IPV4_ADDR=""

## IPv4 netmask (e.g. 255.255.255.0)
LXD_IPV4_NETMASK=""

## IPv4 network (e.g. 10.0.8.0/24)
LXD_IPV4_NETWORK=""

## IPv4 DHCP range (e.g. 10.0.8.2,10.0.8.254)
LXD_IPV4_DHCP_RANGE=""

## IPv4 DHCP number of hosts (e.g. 250)
LXD_IPV4_DHCP_MAX=""

## NAT IPv4 traffic
LXD_IPV4_NAT="false"

# IPv6
## IPv6 address (e.g. 2001:470:b368:4242::1)
LXD_IPV6_ADDR=""

## IPv6 CIDR mask (e.g. 64)
LXD_IPV6_MASK=""

## IPv6 network (e.g. 2001:470:b368:4242::/64)
LXD_IPV6_NETWORK=""

## NAT IPv6 traffic
LXD_IPV6_NAT="false"

# Run a minimal HTTP PROXY server
LXD_IPV6_PROXY="false"

A tarball of /var/log/lxd/<container name>/ (*) - see attachement

Revision history for this message
Rick Box (boxrick) wrote :
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.