LXD machines on AWS occasionally get stuck with no IP
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Canonical Juju |
Fix Released
|
High
|
John A Meinel | ||
2.1 |
Fix Released
|
High
|
John A Meinel |
Bug Description
When deploying a unit --to lxd:0 on AWS, occasionally I see the machine 0/lxd/0 get stuck in a "pending" state forever.
I do not recall this happening in 2.1-beta4, but have seen it several times in 2.1-beta5 - probably about 30% of my deployments get stuck here.
From machine 0, I'm able to `lxc exec` into juju-90d10d-0-lxd-0 and see that it came up with no IP:
$ sudo lxc exec juju-90d10d-0-lxd-0 ifconfig eth0
eth0 Link encap:Ethernet HWaddr 00:16:3e:da:02:74
inet6 addr: fe80::216:
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:8 errors:0 dropped:0 overruns:0 frame:0
TX packets:8 errors:0 dropped:0 overruns:0 carrier:0
RX bytes:648 (648.0 B) TX bytes:648 (648.0 B)
I haven't been able to find any leads on this, though the output of /var/log/
Changed in juju: | |
milestone: | 2.1.0 → 2.2.0-alpha1 |
Changed in juju: | |
assignee: | nobody → John A Meinel (jameinel) |
Changed in juju: | |
status: | Triaged → Incomplete |
Changed in juju: | |
status: | Incomplete → Triaged |
Changed in juju: | |
status: | Triaged → Incomplete |
milestone: | 2.2.0-alpha1 → none |
Changed in juju: | |
status: | Incomplete → Fix Committed |
milestone: | none → 2.2-rc1 |
Changed in juju: | |
status: | Fix Committed → Fix Released |
I have not found a way to reproduce this reliably, however, I see it occasionally when I do `juju deploy cs:~containers/ kubernetes- core` in an AWS model.