compute node heartbeat out of sync causing scheduler to fail in devstack: VMs fail to spawn
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ceilometer |
Won't Fix
|
Medium
|
Unassigned | ||
OpenStack Compute (nova) |
Fix Released
|
Critical
|
Unassigned |
Bug Description
I have now seen this several times in devstack-gate
Occasionally instances will go into error state, and when checking the logs, I see the error is in nova-scheduler where the heartbeat is too old, which causes the scheduler to not schedule on that node.
From logfile:
DB_Driver.is_up last_heartbeat = 2013-09-07 00:04:47.516557 elapsed = 87.363841 is_up /opt/stack/
(devstack-
Filter ComputeFilter returned 0 host(s) get_filtered_
[instance: 415fb048-
20
tags: | added: testing |
Changed in nova: | |
status: | New → Confirmed |
Changed in nova: | |
importance: | Undecided → Critical |
Changed in nova: | |
milestone: | none → havana-rc1 |
Changed in ceilometer: | |
status: | New → Triaged |
importance: | Undecided → Medium |
Changed in nova: | |
status: | Fix Committed → Fix Released |
Changed in nova: | |
milestone: | havana-rc1 → 2013.2 |
more examples of this bug can be found by searching logstash. openstack. org for '@tags: "logs/screen- n-sch.txt" AND @message:"Setting instance to ERROR state"'