Activity log for bug #1196295

Date Who What changed Old value New value Message
2013-06-30 18:44:33 Pavel Bennett bug added bug
2013-07-01 14:39:15 Serge Hallyn lxc (Ubuntu): importance Undecided High
2013-07-01 14:39:15 Serge Hallyn lxc (Ubuntu): status New Incomplete
2013-07-02 22:06:32 Serge Hallyn bug task added linux
2013-07-03 00:59:57 Joseph Salisbury linux: importance Undecided High
2013-07-03 01:00:43 Joseph Salisbury bug task added linux (Ubuntu)
2013-07-03 01:00:57 Joseph Salisbury linux (Ubuntu): importance Undecided High
2013-07-03 01:01:29 Joseph Salisbury tags kernel-da-key raring
2013-07-03 01:30:09 Brad Figg linux: status New Incomplete
2013-07-03 01:42:15 Pavel Bennett tags kernel-da-key raring apport-collected kernel-da-key raring
2013-07-03 01:42:16 Pavel Bennett description After running and terminating around 6000 containers overnight, something happened on my box that is affecting every new LXC container I try to start. The DEBUG log file looks like: lxc-start 1372615570.399 WARN lxc_start - inherited fd 9 lxc-start 1372615570.399 INFO lxc_apparmor - aa_enabled set to 1 lxc-start 1372615570.399 DEBUG lxc_conf - allocated pty '/dev/pts/302' (5/6) lxc-start 1372615570.399 DEBUG lxc_conf - allocated pty '/dev/pts/303' (7/8) lxc-start 1372615570.399 DEBUG lxc_conf - allocated pty '/dev/pts/304' (10/11) lxc-start 1372615570.399 DEBUG lxc_conf - allocated pty '/dev/pts/305' (12/13) lxc-start 1372615570.399 DEBUG lxc_conf - allocated pty '/dev/pts/306' (14/15) lxc-start 1372615570.399 DEBUG lxc_conf - allocated pty '/dev/pts/307' (16/17) lxc-start 1372615570.399 DEBUG lxc_conf - allocated pty '/dev/pts/308' (18/19) lxc-start 1372615570.399 DEBUG lxc_conf - allocated pty '/dev/pts/309' (20/21) lxc-start 1372615570.399 INFO lxc_conf - tty's configured lxc-start 1372615570.399 DEBUG lxc_start - sigchild handler set lxc-start 1372615570.399 INFO lxc_start - 'vm-59' is initialized lxc-start 1372615570.404 DEBUG lxc_start - Not dropping cap_sys_boot or watching utmp lxc-start 1372615570.404 INFO lxc_start - stored saved_nic #0 idx 12392 name vethP59 lxc-start 1372615570.404 INFO lxc_conf - opened /home/x/vm/vm-59.hold as fd 25 It stops there. In 'ps faux', it looks like: root 31621 0.0 0.0 25572 1272 ? D 14:06 0:00 \_ lxc-start -n vm-59 -f /tmp/tmp.fG6T6ERZpS -l DEBUG -o /home/x/lxcdebug/vm-59.txt -- /usr/sbin/dropbear -F -E -m On a successful LXC run (prior to the server getting into this state), this hangs just before: lxc-start 1372394092.208 DEBUG lxc_cgroup - checking '/' (rootfs) lxc-start 1372394092.208 DEBUG lxc_cgroup - checking '/sys' (sysfs) lxc-start 1372394092.208 DEBUG lxc_cgroup - checking '/proc' (proc) lxc-start 1372394092.208 DEBUG lxc_cgroup - checking '/dev' (devtmpfs) lxc-start 1372394092.208 DEBUG lxc_cgroup - checking '/dev/pts' (devpts) lxc-start 1372394092.208 DEBUG lxc_cgroup - checking '/run' (tmpfs) lxc-start 1372394092.208 DEBUG lxc_cgroup - checking '/' (btrfs) lxc-start 1372394092.208 DEBUG lxc_cgroup - checking '/sys/fs/cgroup' (tmpfs) lxc-start 1372394092.208 DEBUG lxc_cgroup - checking '/sys/fs/cgroup/cpuset' (cgroup) lxc-start 1372394092.208 INFO lxc_cgroup - [1] found cgroup mounted at '/sys/fs/cgroup/cpuset',opts='rw,relatime,cpuset,clone_children' lxc-start 1372394092.208 DEBUG lxc_cgroup - get_init_cgroup: found init cgroup for subsys (null) at / It looks like a resource leak, but I'm not yet sure of what that would be. If it matters, I SIGKILL my lxc-start processes instead of using lxc-stop. Could that have any negative implications? Oh, and cgroups had almost 6000 entries for VMs that are long dead (I'm guessing it's due to my SIGKILL). I've run cgclear and my /sys/fs/cgroup/*/ dirs are now totally empty, but the new containers still hang. After running and terminating around 6000 containers overnight, something happened on my box that is affecting every new LXC container I try to start. The DEBUG log file looks like: lxc-start 1372615570.399 WARN lxc_start - inherited fd 9 lxc-start 1372615570.399 INFO lxc_apparmor - aa_enabled set to 1 lxc-start 1372615570.399 DEBUG lxc_conf - allocated pty '/dev/pts/302' (5/6) lxc-start 1372615570.399 DEBUG lxc_conf - allocated pty '/dev/pts/303' (7/8) lxc-start 1372615570.399 DEBUG lxc_conf - allocated pty '/dev/pts/304' (10/11) lxc-start 1372615570.399 DEBUG lxc_conf - allocated pty '/dev/pts/305' (12/13) lxc-start 1372615570.399 DEBUG lxc_conf - allocated pty '/dev/pts/306' (14/15) lxc-start 1372615570.399 DEBUG lxc_conf - allocated pty '/dev/pts/307' (16/17) lxc-start 1372615570.399 DEBUG lxc_conf - allocated pty '/dev/pts/308' (18/19) lxc-start 1372615570.399 DEBUG lxc_conf - allocated pty '/dev/pts/309' (20/21) lxc-start 1372615570.399 INFO lxc_conf - tty's configured lxc-start 1372615570.399 DEBUG lxc_start - sigchild handler set lxc-start 1372615570.399 INFO lxc_start - 'vm-59' is initialized lxc-start 1372615570.404 DEBUG lxc_start - Not dropping cap_sys_boot or watching utmp lxc-start 1372615570.404 INFO lxc_start - stored saved_nic #0 idx 12392 name vethP59 lxc-start 1372615570.404 INFO lxc_conf - opened /home/x/vm/vm-59.hold as fd 25 It stops there. In 'ps faux', it looks like: root 31621 0.0 0.0 25572 1272 ? D 14:06 0:00 \_ lxc-start -n vm-59 -f /tmp/tmp.fG6T6ERZpS -l DEBUG -o /home/x/lxcdebug/vm-59.txt -- /usr/sbin/dropbear -F -E -m On a successful LXC run (prior to the server getting into this state), this hangs just before: lxc-start 1372394092.208 DEBUG lxc_cgroup - checking '/' (rootfs) lxc-start 1372394092.208 DEBUG lxc_cgroup - checking '/sys' (sysfs) lxc-start 1372394092.208 DEBUG lxc_cgroup - checking '/proc' (proc) lxc-start 1372394092.208 DEBUG lxc_cgroup - checking '/dev' (devtmpfs) lxc-start 1372394092.208 DEBUG lxc_cgroup - checking '/dev/pts' (devpts) lxc-start 1372394092.208 DEBUG lxc_cgroup - checking '/run' (tmpfs) lxc-start 1372394092.208 DEBUG lxc_cgroup - checking '/' (btrfs) lxc-start 1372394092.208 DEBUG lxc_cgroup - checking '/sys/fs/cgroup' (tmpfs) lxc-start 1372394092.208 DEBUG lxc_cgroup - checking '/sys/fs/cgroup/cpuset' (cgroup) lxc-start 1372394092.208 INFO lxc_cgroup - [1] found cgroup mounted at '/sys/fs/cgroup/cpuset',opts='rw,relatime,cpuset,clone_children' lxc-start 1372394092.208 DEBUG lxc_cgroup - get_init_cgroup: found init cgroup for subsys (null) at / It looks like a resource leak, but I'm not yet sure of what that would be. If it matters, I SIGKILL my lxc-start processes instead of using lxc-stop. Could that have any negative implications? Oh, and cgroups had almost 6000 entries for VMs that are long dead (I'm guessing it's due to my SIGKILL). I've run cgclear and my /sys/fs/cgroup/*/ dirs are now totally empty, but the new containers still hang. --- Architecture: amd64 DistroRelease: Ubuntu 13.04 MarkForUpload: True Package: lxc 0.9.0-0ubuntu3.3 PackageArchitecture: amd64 ProcEnviron: TERM=screen PATH=(custom, no user) LANG=en_US.UTF-8 SHELL=/bin/bash Uname: Linux 3.8.0-25-generic x86_64 UserGroups:
2013-07-03 01:42:17 Pavel Bennett attachment added Dependencies.txt https://bugs.launchpad.net/bugs/1196295/+attachment/3722559/+files/Dependencies.txt
2013-07-03 01:42:19 Pavel Bennett attachment added HookError_cloud_archive.txt https://bugs.launchpad.net/bugs/1196295/+attachment/3722560/+files/HookError_cloud_archive.txt
2013-07-03 01:42:21 Pavel Bennett attachment added HookError_generic.txt https://bugs.launchpad.net/bugs/1196295/+attachment/3722561/+files/HookError_generic.txt
2013-07-03 01:42:23 Pavel Bennett attachment added HookError_source_linux.txt https://bugs.launchpad.net/bugs/1196295/+attachment/3722562/+files/HookError_source_linux.txt
2013-07-03 01:42:24 Pavel Bennett attachment added HookError_source_lxc.txt https://bugs.launchpad.net/bugs/1196295/+attachment/3722563/+files/HookError_source_lxc.txt
2013-07-03 01:42:26 Pavel Bennett attachment added HookError_ubuntu.txt https://bugs.launchpad.net/bugs/1196295/+attachment/3722564/+files/HookError_ubuntu.txt
2013-07-03 01:49:59 Pavel Bennett lxc (Ubuntu): status Incomplete Confirmed
2013-07-03 02:24:41 Pavel Bennett attachment added apport.lxc.txt https://bugs.launchpad.net/ubuntu/+source/lxc/+bug/1196295/+attachment/3722586/+files/apport.lxc.txt
2013-07-03 06:11:19 Pavel Bennett linux: status Incomplete Confirmed
2013-07-03 17:41:47 Joseph Salisbury linux (Ubuntu): status New Confirmed
2013-07-05 19:34:50 Pavel Bennett tags apport-collected kernel-da-key raring apport-collected kernel-bug-exists-upstream kernel-da-key raring
2013-07-19 08:56:13 Stefan Bader bug added subscriber Stefan Bader
2013-10-01 14:54:06 Joseph Salisbury linux (Ubuntu): status Confirmed Incomplete
2013-10-24 10:07:46 markoa bug added subscriber markoa
2014-01-16 03:26:25 Stéphane Graber lxc (Ubuntu): status Confirmed Invalid