agent restarted as part of machine jobs update
Bug #1597830 reported by
Anastasia
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Canonical Juju |
Fix Released
|
High
|
Anastasia | ||
2.0 |
Fix Released
|
High
|
Anastasia | ||
2.2 |
Fix Released
|
High
|
Anastasia | ||
juju-core |
Fix Released
|
High
|
Anastasia | ||
1.25 |
Fix Released
|
High
|
Anastasia |
Bug Description
This was discovered when running HA on xenial machines with units.
Steps to reproduce:
1. bootstrap a new maas environment (used MAAS 1.9.2 with 4 nodes)
2. juju deploy -n 2 ubuntu
3. juju ensure-availability --to 1,2
Xenial machines 1 and 2 hang with "adding-vote" instead of "has-vote".
Changed in juju-core: | |
status: | New → Triaged |
assignee: | nobody → Anastasia (anastasia-macmood) |
importance: | Undecided → High |
tags: | added: blocker |
tags: |
added: ha removed: blocker |
Changed in juju-core: | |
milestone: | none → 2.0.0 |
assignee: | Anastasia (anastasia-macmood) → nobody |
affects: | juju-core → juju |
Changed in juju: | |
milestone: | 2.0.0 → none |
milestone: | none → 2.0.0 |
Changed in juju-core: | |
assignee: | nobody → Anastasia (anastasia-macmood) |
importance: | Undecided → High |
status: | New → Fix Committed |
Changed in juju: | |
assignee: | nobody → Alexis Bruemmer (alexis-bruemmer) |
tags: | added: ateam |
Changed in juju: | |
milestone: | 2.0-rc3 → 2.0.0 |
Changed in juju: | |
milestone: | 2.0.0 → 2.0.1 |
Changed in juju: | |
milestone: | 2.0.1 → none |
Changed in juju-core: | |
status: | Fix Committed → Fix Released |
Changed in juju: | |
milestone: | none → 2.1.0 |
assignee: | Alexis Bruemmer (alexis-bruemmer) → Anastasia (anastasia-macmood) |
Changed in juju: | |
status: | In Progress → Fix Committed |
Changed in juju: | |
milestone: | 2.1-rc1 → 2.1-beta3 |
Changed in juju: | |
status: | Fix Committed → Fix Released |
To post a comment you must log in.
On further investigation, the worker that picks up change in machine jobs (conv2state worker) issues agent.restart which seems to hang with these external dependencies:
github. com/coreos/ go-systemd 7b2428fec400335 49c68f54e26e89e 7ca9a9ce31 com/godbus/ dbus 32c6cc29c14570d e4cf6d7e7737d68 fb2d01ad15
github.
We should not restart agent from worker.
The fix on 1.25.5 will to throw a fatal error instead.
The fix on master will need to use flags in manifold dependencies and throw context-specific error that we can check for.