instance cleanup needs to be more defensive

Bug #1699552 reported by James Page
18
This bug affects 2 people
Affects Status Importance Assigned to Milestone
nova-lxd
Confirmed
High
Unassigned

Bug Description

If an instance moves to ERROR or STOPPED state in the underlying LXD hypervisor, the driver appears to sometimes have trouble cleaning it up, resulting in instance leakage - the instance is gone in nova, but is clearly still present in lxc list:

+--------------------------+---------+------+------+------------+-----------+
| NAME | STATE | IPV4 | IPV6 | TYPE | SNAPSHOTS |
+--------------------------+---------+------+------+------------+-----------+
| instance-00000042-rescue | STOPPED | | | PERSISTENT | 0 |
+--------------------------+---------+------+------+------------+-----------+
| instance-0000004c-rescue | STOPPED | | | PERSISTENT | 0 |
+--------------------------+---------+------+------+------------+-----------+

Profiles also get left behind when this occurs (as the can't be deleted because the container is present, and LXD blocks due to the dependency).

This is pretty easy to reproduce on a standard tempest run with devstack+nova-lxd

Revision history for this message
Chris MacNaughton (chris.macnaughton) wrote :

This bug is also now blocking the nova-lxd gate

Changed in nova-lxd:
status: New → Confirmed
Changed in nova-lxd:
assignee: nobody → Alex Kavanagh (ajkavanagh)
Changed in nova-lxd:
importance: Undecided → Critical
importance: Critical → High
Changed in nova-lxd:
assignee: Alex Kavanagh (ajkavanagh) → nobody
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Duplicates of this bug

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.