Cannot achieve HA
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
juju-core |
Fix Released
|
Critical
|
Andrew Wilkins |
Bug Description
Both functional-
In both tests, ensure-ha never reaches HA. This might be mongo related because the backup-restore test also failed, though the errors look differently. The error specifically is about status fails, juju never go to to HA. I suspect the juju client is raising a real error because the state-server has become unavailable in a way that status doesn't know the user should try again. In general, status cannot raise an error while the state server transitions to HA because we know transitioning to HA is normal.
I have retested on AWS and HP. The error is the same. I am attaching the latest log. There isn't really anything to learn here because juju is not providing details of what status failed. I will try to capture a log from machine 0.
Changed in juju-core: | |
assignee: | nobody → Menno Smits (menno.smits) |
status: | Triaged → In Progress |
Changed in juju-core: | |
status: | Fix Committed → Fix Released |
I can reproduce this easily on EC2. You just need to do:
juju bootstrap
juju deploy ubuntu # this might not be necessary but it's what the test does
juju ensure-availability
At some point "juju status" stops working with "WARNING discarding API open error: auth fails" and if you look at the machine agent logs for the new state servers created by ensure-availability they fail right after startup like this:
2014-08-11 23:17:24 INFO juju.cmd.jujud machine.go:164 machine agent machine-2 start (1.21-alpha1. 1-trusty- amd64 [gc]) ip-10-197- 159-62. us-west- 2.compute. internal: 17070/" ip-10-197- 159-62. us-west- 2.compute. internal: 17070/" ip-10-197- 159-62. us-west- 2.compute. internal: 17070/" ip-10-197- 159-62. us-west- 2.compute. internal: 17070/" ip-10-197- 159-62. us-west- 2.compute. internal: 17070/" ip-10-197- 159-62. us-west- 2.compute. internal: 17070/"
2014-08-11 23:17:24 INFO juju.network network.go:97 setting prefer-ipv6 to false
2014-08-11 23:17:24 INFO juju.worker runner.go:261 start "api"
2014-08-11 23:17:24 INFO juju.worker runner.go:261 start "statestarter"
2014-08-11 23:17:24 INFO juju.worker runner.go:261 start "termination"
2014-08-11 23:17:24 INFO juju.state.api apiclient.go:252 dialing "wss://
2014-08-11 23:17:25 INFO juju.state.api apiclient.go:175 connection established to "wss://
2014-08-11 23:17:25 INFO juju.state.api apiclient.go:252 dialing "wss://
2014-08-11 23:17:26 INFO juju.state.api apiclient.go:175 connection established to "wss://
2014-08-11 23:17:26 ERROR juju.worker runner.go:219 exited "api": cannot set password of machine 2: auth fails
2014-08-11 23:17:26 INFO juju.worker runner.go:253 restarting "api" in 3s
2014-08-11 23:17:29 INFO juju.worker runner.go:261 start "api"
2014-08-11 23:17:29 INFO juju.state.api apiclient.go:252 dialing "wss://
2014-08-11 23:17:29 INFO juju.state.api apiclient.go:175 connection established to "wss://
2014-08-11 23:17:29 ERROR juju.worker runner.go:219 exited "api": cannot get machine 2: auth fails
2014-08-11 23:17:29 INFO juju.worker runner.go:253 restarting "api" in 3s
(continues on and on)
Investigating further...