[library] Nova-compute services are in 'down' state after deployment: Timed out waiting for a reply to message
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Confirmed
|
High
|
Dmitry Ilyin |
Bug Description
api: '1.0'
astute_sha: bc60b7d027ab244
auth_required: true
build_id: 2014-08-27_10-46-57
build_number: '481'
feature_groups:
- mirantis
fuellib_sha: 6f478caa9111e42
fuelmain_sha: 74a97d500bb2fe9
nailgun_sha: 04e3f9d9ad3140c
ostf_sha: 4dcd99cc4bfa19f
production: docker
release: '5.1'
Steps to reproduce:
1. Deploy new environment:
* Ubuntu, HA, NeutronGre, Ceph for Glance, Ceilometer
* 2 Controller+Ceph, 3 Controller+Mongo, 4 Compute+Ceph, 6 Compute+Cinder, 2 Cinder+Ceph nodes
2. Deployment is successful. Start OSTF.
Expected result:
- health checks passed
Actual:
- most of checks failed (some of sanity checks failed due to '504' error from haproxy(nova/glance api), compute checks failed to launch instance because all computes services are down)
Here is the part of nova-compute logs(node-24):
http://
Example of 504 error:
http://
and HaProxy stats (errors):
http://
Also, I discovered, that RabbitMQ has lots of messagess in queries from nova-conductor and neutron:
http://
and there are errors in its logs:
Changed in fuel: | |
status: | New → Confirmed |
Changed in fuel: | |
assignee: | nobody → Fuel Library Team (fuel-library) |
I can't attach snapshot with logs (>500 MBytes), please download it here:
https:/ /yadi.sk/ d/RA09LbhxaZwcM