Upgrade to 2.3.8 caused unit agents to become unresponsive
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Canonical Juju |
Expired
|
Undecided
|
Unassigned |
Bug Description
After upgrading the controller and workload model from 2.3.5 to 2.3.8, on an environment with Maas 2.3 and a bunch of machines with lxd containers, hooks in units are not running and even 'juju run --unit $thing' is unresponsive. 'juju run --machine X' is fine.
When I use 'juju run' on a unit, the unit agent reports:
2018-06-26 00:49:46 DEBUG juju.worker.
juju debug-log also shows the same message, but nothing else.
engine-report from the unit machine agent: https:/
machine log on the controller machine, with the PRIMARY mongo role (there's 2 more controllers): https:/
Complete machine log on the keystone/0 unit (lxc): https:/
last 5000 lines of the unit log for keystone/0: https:/
When I restarted the subordinate agents on the unit, the log showed some change (restart was at 01:57): https:/ /pastebin. canonical. com/p/fV4vBFnJS n/
Looks like hooks are running, maybe there was some log not being released till a service got restarted?