Artful to Bionic upgrade kills all LXD container networking

Bug #1768118 reported by David Favor on 2018-04-30
10
This bug affects 2 people
Affects Status Importance Assigned to Milestone
cloud-init (Ubuntu)
Undecided
Unassigned

Bug Description

Upgrading machine from Artful -> Bionic all LXD containers fail, as no DHCP addresses are assigned.

David Favor (davidfavor) wrote :

This is a high priority bug as it means any machine running many containers, like hosting companies will destroy all connectivity to containers if upgrade is done.

This means Bionic upgrades should not occur (when LXD containers running), till this bug is fixed.

Scott Moser (smoser) wrote :

Hi David,
Thank you for filing the bug.

Could you please run 'cloud-init collect-logs' and attach the output?

We're interested also in knowing what platform you are running on.
Is cloud-init installed on your container host ? Is that running on a cloud platform?

Or, are you only having problems network problems in the containers?

Changed in cloud-init (Ubuntu):
status: New → Incomplete
Scott Moser (smoser) wrote :

David,
After you've provided the information requested, please set the bug back to 'New' status.
Thanks,
Scott

Ryan Harper (raharper) wrote :

What's the process for upgrading from Artful to Bionic?

do-release-upgrade -d ?

David Favor (davidfavor) wrote :

Additional info.

Correct. I did a do-release-upgrade -d to update from Artful to Bionic.

Also, this machine installed Artful from scratch, so was using Netplan networking.

I've run cloud-init collect-logs + this produces no output + even using strace it's not clear where logs end up being deposited.

Let me know + I'll provide the logs.

Only network problems are in containers.

Also, if I simply copy a failing container over to another Artful machine the container works fine, so it appears something's wrong with container <-> machine plumbing.

David Favor (davidfavor) wrote :

Ah... Maybe you're referring to standard cloud-init logs.

I've attached these two logs. If other logs are required, let me know.

David Favor (davidfavor) wrote :

If you look closely at the cloud-init-output.log you'll see the DHCP address never gets assigned + there's no errors or warnings about any problems.

Scott Moser (smoser) wrote :

cloud-init collect-logs

writes output to the current working directory named 'cloud-init.tar.gz'.

Can you please attach that?

thank you.

David Favor (davidfavor) wrote :
Scott Moser (smoser) wrote :

Hi David,
Thank you for the information.

Its not clear to me what the problem is.

Please answer:
   Which OS did you ugprade from artful to bionic? The host? the container? both?

I just successfully attempted a upgrade from an artful lxd container to bionic lxd container (my host is bionic, my lxd is from the snap packaging of lxd version 3.0.0).

% lxc launch ubuntu-daily:artful a1
% lxc exec a1 -- do-release-upgrade -d
# say 'Y' (yes) several times, reboot
% lxc list a1
$ lxc list a1
+------+---------+-----------------------+-----------------------------------------------+------------+-----------+
| NAME | STATE | IPV4 | IPV6 | TYPE | SNAPSHOTS |
+------+---------+-----------------------+-----------------------------------------------+------------+-----------+
| a1 | RUNNING | 10.145.210.191 (eth0) | fd42:6470:c1cd:249f:216:3eff:fe85:80bb (eth0) | PERSISTENT | 0 |
+------+---------+-----------------------+-----------------------------------------------+------------+-----------+

David Favor (davidfavor) wrote :

Upgrade was done at host/machine level only.

When this is done all containers fail.

https://github.com/lxc/lxd/issues/4510 covers more information.

There appear to be many problems with bionic + LXD.

David Favor (davidfavor) wrote :

Problem seems to be in Ubuntu packaging, as this fixes problem.

1) copy all containers to another LXD machine

2) do fresh Bionic install

3) copy all containers back

At this point all problems resolve, so some sort of packaging problem in the Artful to Bionic upgrade.

summary: - /etc/netplan/50-cloud-init.yaml fails silently with no dhcp address
- assigned
+ Artful to Bionic upgrade kills all LXD container networking
David Favor (davidfavor) wrote :

https://github.com/lxc/lxd/issues/4510 is the LXD related bug with additional information + fix LXD dev is rolling in to attempt avoiding problem.

To post a comment you must log in.
This report contains Public information  Edit
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.