Nova services break after rabbitmq slave restart
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Invalid
|
Undecided
|
Unassigned |
Bug Description
I'm using a highly-customized ISO based on 5.1 release. Basically, after a couple of hard resets of one specific rabbit slave nova services (nova-compute and nova-network) residing on compute node stop working.
Environment:
HA/nova-
Steps to reproduce:
1. Hard power off rabbit slave.
2. Wait and check that cluster has recovered and OpenStack (OS) is operational (try to spawn an instance).
3. Power on rabbit slave.
4. Wait for it to come online and check that OpenStack is operational.
5. Power off that slave again.
6. Wait for cluster to recover and check that it's operational.
7. Power on that slave back.
8. Wait for it to come online.
Nova services can break either on steps 6 or 8.
Update 1: nova-compute (and nova-network) become unresponsive. When trying to restart, it's process eventually dies. Log: http://
Update 2: after a full rabbit restart (crm resource stop, wait, crm resource start) and restart of dead nova-compute/
description: | updated |
description: | updated |
Changed in fuel: | |
status: | Incomplete → Invalid |
Dmitry, thanks for issue. Could you please add diagnostic snapshot?