a bad AMI can hang an entire compute node
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
nova (Ubuntu) |
Fix Released
|
Critical
|
Unassigned |
Bug Description
Using the attached image (and others) causes the entire compute node to hang between the booting of the image and the configuration of networking. The running image has a console ring buffer output file (however problematic--often it looks like it never got a proper root filesystem somehow--lots of "NO PTY" errors), but is unpingable.
The only way to terminate these instances is to restart nova-compute so that it will collect amqp messages again, and then send the terminate request. This seems suspiciously like the compute code is blocking in a libvirt call of some sort.
The cluster used booted an older Oneiric image with no problems whatsoever.
This effectively can DoS an entire openstack installation through nothing more than running instances.
Attached is the amd64 image from http://
tags: | added: canonistack |
Changed in nova (Ubuntu): | |
importance: | Undecided → Critical |
Changed in nova (Ubuntu): | |
status: | Confirmed → Triaged |
Changed in nova (Ubuntu): | |
status: | Triaged → Fix Committed |
Changed in nova (Ubuntu): | |
status: | Fix Committed → Fix Released |
This does not cause libvirtd to hang, by the way. "sudo virsh list" does fine, and I'm able to kill instances manually with virsh destroy.