juju-upgrade to 1.24.7 leaves juju state server unreachable
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
juju-core |
Won't Fix
|
High
|
Unassigned |
Bug Description
Running upgrade (from 1.23.3) seems fine:
juju upgrade-juju --debug
2015-11-19 16:47:54 INFO juju.cmd supercommand.go:37 running juju [1.25.0-
2015-11-19 16:47:54 DEBUG juju.api api.go:154 trying cached API connection settings - endpoints [192.168.
2015-11-19 16:47:54 INFO juju.api api.go:266 connecting to API addresses: [192.168.
2015-11-19 16:47:54 INFO juju.api apiclient.go:262 dialing "wss://
2015-11-19 16:47:54 INFO juju.api apiclient.go:194 connection established to "wss://
2015-11-19 16:47:54 DEBUG juju.api api.go:476 API hostnames unchanged - not resolving
2015-11-19 16:47:54 DEBUG juju.api api.go:506 cacheChangedAPI
2015-11-19 16:47:59 DEBUG juju.cmd.juju upgradejuju.go:363 found a more recent stable version 1.24.7
2015-11-19 16:47:59 INFO juju.cmd cmd.go:129 available tools:
1.15.
<snip>
1.25.
2015-11-19 16:47:59 INFO juju.cmd cmd.go:129 best version:
1.24.7
2015-11-19 16:47:59 INFO juju.cmd.juju upgradejuju.go:214 started upgrade to 1.24.7
2015-11-19 16:47:59 INFO juju.cmd supercommand.go:435 command finished
Soon on machine-0.log (and thousands of apiserver messages):
2015-11-19 16:59:03 DEBUG juju.apiserver apiserver.go:267 <- [4] unit-nrpe-19 {"RequestId"
2015-11-19 16:59:03 INFO juju.worker runner.go:261 start "networker"
2015-11-19 16:59:03 INFO juju.worker runner.go:261 start "authentication
2015-11-19 16:59:03 INFO juju.networker networker.go:163 networker is disabled - not starting on machine "machine-0"
2015-11-19 16:59:03 DEBUG juju.apiserver apiserver.go:267 <- [8] machine-0 {"RequestId"
2015-11-19 16:59:03 DEBUG juju.apiserver apiserver.go:267 <- [8] machine-0 {"RequestId"
2015-11-19 16:59:03 DEBUG juju.apiserver apiserver.go:267 <- [8] machine-0 {"RequestId"
2015-11-19 16:59:03 DEBUG juju.apiserver apiserver.go:267 <- [8] machine-0 {"RequestId"
2015-11-19 16:59:03 DEBUG juju.apiserver apiserver.go:267 <- [8] machine-0 {"RequestId"
2015-11-19 16:59:03 INFO juju.mongo open.go:125 dialled mongo successfully on address "192.168.
2015-11-19 16:59:03 DEBUG juju.worker.logger logger.go:45 reconfiguring logging from "<root>=DEBUG" to "<root>
2015-11-19 16:59:09 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 16:59:09 ERROR juju.worker runner.go:208 fatal "api": must restart: an agent upgrade is available
2015-11-19 16:59:09 ERROR juju.worker.
2015-11-19 16:59:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 16:59:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 16:59:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 16:59:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 16:59:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 16:59:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 16:59:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 16:59:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 16:59:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 16:59:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 16:59:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 16:59:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 16:59:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 16:59:10 ERROR juju.worker runner.go:208 fatal "environ-
2015-11-19 16:59:10 ERROR juju.worker runner.go:208 fatal "firewaller": watcher has been stopped
2015-11-19 16:59:10 ERROR juju.worker runner.go:219 exited "ab165d76-
2015-11-19 16:59:11 ERROR juju.rpc server.go:573 error writing response: write tcp 192.168.0.79:49173: broken pipe
juju status becomes unresponsive. Running 'restart jujud-machine-0 results in about the the same errors as above:
On machine-0.log:
2015-11-19 17:05:05 DEBUG juju.apiserver apiserver.go:278 -> [59] machine-0 6.071358ms {"RequestId"
2015-11-19 17:05:05 DEBUG juju.worker.logger logger.go:45 reconfiguring logging from "<root>=DEBUG" to "<root>
2015-11-19 17:05:05 ERROR juju.worker runner.go:208 fatal "upgrader": must restart: an agent upgrade is available
2015-11-19 17:05:09 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 17:05:09 ERROR juju.worker runner.go:208 fatal "api": must restart: an agent upgrade is available
2015-11-19 17:05:09 ERROR juju.worker.
2015-11-19 17:05:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 17:05:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 17:05:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 17:05:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 17:05:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 17:05:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 17:05:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 17:05:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 17:05:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 17:05:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 17:05:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 17:05:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 17:05:10 ERROR juju.api.watcher watcher.go:84 error trying to stop watcher: connection is shut down
2015-11-19 17:05:10 ERROR juju.worker runner.go:208 fatal "firewaller": watcher has been stopped
2015-11-19 17:05:10 ERROR juju.worker runner.go:208 fatal "environ-
2015-11-19 17:05:10 ERROR juju.worker runner.go:219 exited "ab165d76-
Between restarting jujud and the errors is a window of about 5 seconds when I can connect to the state server.
Changed in juju-core: | |
status: | New → Triaged |
importance: | Undecided → High |
Trying to use that window to reset the upgrade does not help:
johan@maas:~$ juju upgrade-juju --reset- previous- upgrade --debug trusty- amd64 gc] 0.50:17070] 0.50:17070] 192.168. 0.50:17070/ environment/ ab165d76- 5703-4ec2- 806f-fad6691ff3 39/api" 192.168. 0.50:17070/ environment/ ab165d76- 5703-4ec2- 806f-fad6691ff3 39/api" , will retry: websocket.Dial wss://192. 168.0.50: 17070/environme nt/ab165d76- 5703-4ec2- 806f-fad6691ff3 39/api: dial tcp 192.168.0.50:17070: connection refused 192.168. 0.50:17070/ environment/ ab165d76- 5703-4ec2- 806f-fad6691ff3 39/api" .go:28 opening environment "maas". 192.168. 0.50:17070/ environment/ ab165d76- 5703-4ec2- 806f-fad6691ff3 39/api" ances returned: [/MAAS/ api/1.0/ nodes/node- b0c658c8- 7f8e-11e4- 9d45-0021702cb7 de/] Info: serverUUID= "ab165d76- 5703-4ec2- 806f-fad6691ff3 39" 0-precise- amd64 0-win81- amd64
2015-11-19 17:28:18 INFO juju.cmd supercommand.go:37 running juju [1.25.0-
2015-11-19 17:28:18 DEBUG juju.api api.go:154 trying cached API connection settings - endpoints [192.168.
2015-11-19 17:28:18 INFO juju.api api.go:266 connecting to API addresses: [192.168.
2015-11-19 17:28:18 INFO juju.api apiclient.go:262 dialing "wss://
2015-11-19 17:28:18 DEBUG juju.api apiclient.go:268 error dialing "wss://
2015-11-19 17:28:20 INFO juju.api apiclient.go:262 dialing "wss://
2015-11-19 17:28:20 DEBUG juju.provider.maas environprovider
2015-11-19 17:28:20 INFO juju.api apiclient.go:194 connection established to "wss://
2015-11-19 17:28:20 DEBUG juju.environs utils.go:94 StateServerInst
2015-11-19 17:28:20 DEBUG juju.api api.go:476 API hostnames unchanged - not resolving
2015-11-19 17:28:20 DEBUG juju.api api.go:506 cacheChangedAPI
2015-11-19 17:28:24 DEBUG juju.cmd.juju upgradejuju.go:363 found a more recent stable version 1.25.0
2015-11-19 17:28:24 INFO juju.cmd cmd.go:129 available tools:
1.15.
<snip>
1.25.
2015-11-19 17:28:24 INFO juju.cmd cmd.go:129 best version:
1.25.0
WARNING! using --reset- previous- upgrade when an upgrade is in progress
will cause the upgrade to fail. Only use this option to clear an
incomplete upgrade where the root cause has been resolved.
Continue [y/N]? y
2015-11-19 17:28:25 ERROR juju.cmd supercommand.go:429 some agents have not upgraded to the current environment version 1.24.7: machine-0 <snip>