Agent shutdown can cause cert updater channel already closed panic
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
juju-core |
Fix Released
|
High
|
Ian Booth | ||
1.24 |
Fix Released
|
High
|
Ian Booth |
Bug Description
A landscape cloud deployment is stuck right after bootstrap. The landscape juju client is getting an "upgrade in progress" error from juju for about 20min now. This is the first occurance:
Jul 8 17:26:16 job-handler-1 INFO Traceback (failure with no frames): <class 'canonical.
This environment was bootstrapped with the setting "agent-version: 1.24.1", so it shouldn't have even tried to upgrade the tools. Bootstrap was kicked at 17:17:52.
A juju status in that env works fine now:
$ juju status
environment: "3"
machines:
"0":
agent-state: started
agent-version: 1.24.1
dns-name: barley.scapestack
instance-id: /MAAS/api/
series: trusty
hardware: arch=amd64 cpu-cores=4 mem=16384M
state-
services: {}
As does a simple ubuntu deployment --to lxc:0
environment: "3"
machines:
"0":
agent-state: started
agent-version: 1.24.1
dns-name: barley.scapestack
instance-id: /MAAS/api/
series: trusty
containers:
0/lxc/0:
dns-name: 10.96.7.179
series: trusty
hardware: arch=amd64
hardware: arch=amd64 cpu-cores=4 mem=16384M
state-
services:
ubuntu:
charm: cs:trusty/ubuntu-3
exposed: false
service-status:
current: unknown
since: 08 Jul 2015 17:45:12Z
units:
ubuntu/0:
current: unknown
since: 08 Jul 2015 17:45:12Z
current: idle
since: 08 Jul 2015 17:45:15Z
version: 1.24.1
machine: 0/lxc/0
I also see a panic in machine-0.log after a lot of EOF errors:
(...)
2015-07-08 17:26:09 ERROR juju.worker runner.go:219 exited "authentication
2015-07-08 17:26:09 ERROR juju.worker runner.go:219 exited "authentication
panic: runtime error: send on closed channel
panic: runtime error: close of closed channel
goroutine 626 [running]:
runtime.
github.
(...)
juju logs attached, and bits of landscape logs too, useful for timestamps.
summary: |
- juju stuck in "upgrade in progress " for 20min + Deploy with pinned agent-version still tried to upgrade |
summary: |
- Deploy with pinned agent-version still tried to upgrade + Deploy with pinned agent-version still tried to upgrade, and panic()ed |
summary: |
- Deploy with pinned agent-version still tried to upgrade, and panic()ed + Deploy with pinned agent-version still tried to upgrade, panic()ed |
tags: | added: regression |
Changed in juju-core: | |
importance: | Medium → High |
milestone: | none → 1.25.0 |
Changed in juju-core: | |
assignee: | nobody → Ian Booth (wallyworld) |
status: | Triaged → Fix Committed |
Changed in juju-core: | |
status: | Fix Committed → Fix Released |
Landscape's job-handler.log. Shows the bootstrap starting at 17:17:52 and subsequent errors.