inappropriate "relation-joined" for unit bounces untiter repeatedly
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Canonical Juju |
Fix Released
|
High
|
Ian Booth |
Bug Description
From a solutions QA test run of k8s on vsphere. Seems that we have a race condition somewhere.
Three units showed up as "failed", and on inspection the uniters are repeatedly restarting.
https:/
See graylog/0, easyrsa/0 and vsphere-integrator
There is potentially more than one bug here.
From greylog/0 we see the following in the unit logs:
2020-05-27 16:34:36 INFO juju.worker.
2020-05-27 16:34:36 INFO juju.worker.
2020-05-27 16:34:36 INFO juju.worker.
2020-05-27 16:34:36 INFO juju.worker.
2020-05-27 16:34:36 INFO juju.worker.
2020-05-27 16:34:36 INFO juju.worker.
2020-05-27 16:34:36 INFO juju.worker.
2020-05-27 16:34:36 INFO juju.worker.
From the machine-lock.log file, we only see relation-created for two of those relations:
2020-05-27 15:54:25 === agent unit-graylog-0 started ===
2020-05-27 15:54:40 unit-graylog-0: meterstatus (meter-
2020-05-27 16:10:30 unit-graylog-0: uniter (run install hook), waited 0s, held 14m16s
2020-05-27 16:12:05 unit-graylog-0: uniter (run relation-created (19; app: mongodb) hook), waited 0s, held 2s
2020-05-27 16:12:06 unit-graylog-0: uniter (run relation-created (22; app: filebeat) hook), waited 0s, held 2s
2020-05-27 16:12:25 unit-graylog-0: uniter (run leader-elected hook), waited 0s, held 19s
2020-05-27 16:12:29 unit-graylog-0: uniter (run config-changed hook), waited 0s, held 4s
2020-05-27 16:13:14 unit-graylog-0: uniter (run start hook), waited 0s, held 45s
2020-05-27 16:13:14 unit-graylog-0: uniter (run relation-joined (17; unit: apache2/0) hook), waited 0s, held 0s
2020-05-27 16:13:18 unit-graylog-0: uniter (run relation-joined (17; unit: apache2/0) hook), waited 0s, held 0s
2020-05-27 16:13:23 unit-graylog-0: uniter (run relation-joined (17; unit: apache2/0) hook), waited 0s, held 0s
2020-05-27 16:13:28 unit-graylog-0: uniter (run relation-joined (17; unit: apache2/0) hook), waited 0s, held 0s
2020-05-27 16:13:36 unit-graylog-0: uniter (run relation-joined (17; unit: apache2/0) hook), waited 0s, held 0s
The uniter bounces with the following error:
2020-05-27 16:13:19 ERROR juju.worker.
2020-05-27 16:13:19 DEBUG juju.worker.
relation is broken and cannot be changed further
/workspace/
/workspace/
/workspace/
/workspace/
tags: | added: cdo-qa cdo-release-blocker foundations-engine |
Changed in juju: | |
assignee: | nobody → Ian Booth (wallyworld) |
status: | Triaged → In Progress |
Changed in juju: | |
status: | In Progress → Fix Committed |
Changed in juju: | |
status: | Fix Committed → Fix Released |
https:/ /github. com/juju/ juju/pull/ 11634