"juju run" timeouts on single units of an application
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Canonical Juju |
Expired
|
High
|
Unassigned |
Bug Description
We've been observing a strange problem with Juju 2.2.6 (controller and model) where a single unit of an application does not seem to run its command until many minutes have elapsed. Here is an example with the timeout extended so that we get output from all commands, but the with the default 5m timeout the unit would not get to run the command in time for "juju run" to collect the results. The problem occurred with multiple applications in the model, not just this one, both primary and subordinate.
(mojo-ols-
01:29:31 INFO juju.cmd supercommand.go:63 running juju [2.2.6 gc go1.8]
01:29:31 DEBUG juju.cmd supercommand.go:64 args: []string{"juju", "run", "--timeout", "15m", "--debug", "--application", "sca-conn-check", "date -u"}
01:29:31 INFO juju.juju api.go:67 connecting to API addresses: [10.25.2.110:17070 10.25.2.109:17070 10.25.2.111:17070]
01:29:31 DEBUG juju.api apiclient.go:863 successfully dialed "wss://
01:29:31 INFO juju.api apiclient.go:617 connection established to "wss://
- Stdout: |
Tue Nov 21 01:29:39 UTC 2017
UnitId: sca-conn-check/20
- Stdout: |
Tue Nov 21 01:29:40 UTC 2017
UnitId: sca-conn-check/21
- Stdout: |
Tue Nov 21 01:29:40 UTC 2017
UnitId: sca-conn-check/23
- Stdout: |
Tue Nov 21 01:29:40 UTC 2017
UnitId: sca-conn-check/24
- Stdout: |
Tue Nov 21 01:29:40 UTC 2017
UnitId: sca-conn-check/26
- Stdout: |
Tue Nov 21 01:29:40 UTC 2017
UnitId: sca-conn-check/27
- Stdout: |
Tue Nov 21 01:29:50 UTC 2017
UnitId: sca-conn-check/25
- Stdout: |
Tue Nov 21 01:39:49 UTC 2017
UnitId: sca-conn-check/22
01:39:50 DEBUG juju.api monitor.go:35 RPC connection died
01:39:50 INFO cmd supercommand.go:465 command finished
And here is the corresponding action for the command that took 10 minutes to begin execution:
(mojo-ols-
results:
Code: "0"
Stderr: ""
Stdout: |
Tue Nov 21 01:39:49 UTC 2017
status: completed
timing:
completed: 2017-11-21 01:39:49 +0000 UTC
enqueued: 2017-11-21 01:29:33 +0000 UTC
started: 2017-11-21 01:39:46 +0000 UTC
(mojo-ols-
description: | updated |
description: | updated |
description: | updated |
Changed in juju: | |
importance: | Undecided → High |
status: | New → Triaged |
no longer affects: | juju/2.2 |
no longer affects: | juju/2.3 |
Changed in juju: | |
assignee: | Eric Claude Jones (ecjones) → nobody |
AIUI juju run with a unit will take the lock that's used for hook execution. Was there a hook that was running on the given unit during the period that it was waiting?