ec2 changes? rising failure rate in ec2 health checks
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
juju-core |
Fix Released
|
Medium
|
Unassigned |
Bug Description
CI's hourly health checks are seeing a 50% failure rate to bootstrap. We need to understand why this is happening and maybe we need to make changes to keep up with ec2. I am marking this bug critical for the period we do not know the cause. I hope we can lower the priority afterward.
2014-07-20 15:45:53 INFO juju.environs.
Launching instance
2014-07-20 15:45:56 INFO juju.utils http.go:59 hostname SSL verification enabled
2014-07-20 15:45:56 INFO juju.utils http.go:59 hostname SSL verification enabled
2014-07-20 15:45:56 INFO juju.utils http.go:59 hostname SSL verification enabled
2014-07-20 15:45:58 INFO juju.provider.ec2 ec2.go:643 started instance "i-ad695887" in "us-east-1a"
- i-ad695887
Waiting for address
2014-07-20 15:45:59 ERROR juju.provider.
Stopping instance...
Bootstrap failed, destroying environment
2014-07-20 15:45:59 INFO juju.provider.
2014-07-20 15:46:00 ERROR juju.cmd supercommand.go:323 refreshing addresses: The service is unavailable. Please try again shortly. (Unavailable)
<class 'subprocess.
Build step 'Execute shell' marked build as failure
no longer affects: | juju-core/1.20 |
Changed in juju-core: | |
status: | Triaged → Fix Released |
The failures a fast "Took 12 sec on master". Juju isn't waiting for an IP address. There isn't enough time to intervene in the aws console to add termination protection. I will try some manual runs with --debug to capture more information.