kill-controller is stuck, lots of "lease manager stopped" errors
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Canonical Juju |
Expired
|
Medium
|
Unassigned | ||
juju-core |
Won't Fix
|
High
|
Unassigned | ||
1.25 |
Won't Fix
|
Undecided
|
Unassigned |
Bug Description
juju-2.0-beta5, lxd controller, lxd 2.0.0-0ubuntu4
`juju kill-controller` is stuck and not making progress in tearing down my models & controller.
Messages writing to machine-0.log on the controller every 3 seconds like this:
2016-04-21 16:47:47 ERROR juju.worker.
2016-04-21 16:47:50 ERROR juju.worker.
2016-04-21 16:47:53 ERROR juju.worker.
2016-04-21 16:47:56 ERROR juju.worker.
2016-04-21 16:47:59 ERROR juju.worker.
2016-04-21 16:48:02 ERROR juju.worker.
2016-04-21 16:48:05 ERROR juju.worker.
2016-04-21 16:48:08 ERROR juju.worker.
2016-04-21 16:48:11 ERROR juju.worker.
2016-04-21 16:48:14 ERROR juju.worker.
2016-04-21 16:48:17 ERROR juju.worker.
2016-04-21 16:48:20 ERROR juju.worker.
2016-04-21 16:48:23 ERROR juju.worker.
2016-04-21 16:48:26 ERROR juju.worker.
I'm going to nuke the lxc containers and try again with master to see if the issue has been fixed...
summary: |
- kill-controller is stuck + kill-controller is stuck, lots of "lease manager stopped" errors |
Changed in juju-core: | |
milestone: | 2.0-beta6 → 2.0-beta7 |
Changed in juju-core: | |
milestone: | 2.0-beta7 → 2.0-beta8 |
Changed in juju-core: | |
status: | Fix Committed → Fix Released |
affects: | juju-core → juju |
Changed in juju: | |
milestone: | 2.0-beta8 → none |
milestone: | none → 2.0-beta8 |
Changed in juju-core: | |
importance: | Undecided → High |
status: | New → Triaged |
Changed in juju-core: | |
status: | Triaged → Won't Fix |
Changed in juju: | |
status: | Fix Released → Triaged |
milestone: | 2.0-beta8 → 2.0-rc1 |
assignee: | William Reade (fwereade) → nobody |
importance: | High → Medium |
Changed in juju: | |
milestone: | 2.0-rc1 → 2.0.1 |
Changed in juju: | |
milestone: | 2.0.1 → none |
I saw the same problem when trying to clean up the environment in bug #1572237, and I think this is the same underlying issue. I think for that bug, the fix William is working on is specific to the pinger, but there is a more general issue of sub-state workers not being managed workers yet. We may use this bug for that issue.
And, there's also bug #1566426 which asks for kill-controller to time out when destroying through the API when we have cases like this where the environment is broken.