KVM + Glance + OS API 1.1 -- Image boot/networking problems
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Compute (nova) |
Invalid
|
Undecided
|
Unassigned |
Bug Description
I'm having an odd issue where I glance upload an image, and I can then create an instance that pings/SSH, but whenever I try to create a second instance I get the following error:
##########
2011-04-05 17:15:49,070 INFO nova.virt.
2011-04-05 17:15:49,142 DEBUG nova.utils [-] Attempting to grab semaphore "00000006" for method "call_if_
2011-04-05 17:15:49,143 DEBUG nova.utils [-] Running cmd (subprocess): qemu-img create -f qcow2 -o cluster_
2011-04-05 17:15:49,193 DEBUG nova.utils [-] Attempting to grab semaphore "local_20" for method "call_if_
2011-04-05 17:15:49,194 DEBUG nova.utils [-] Running cmd (subprocess): qemu-img create -f qcow2 -o cluster_
2011-04-05 17:15:49,245 INFO nova.virt.
2011-04-05 17:15:49,256 DEBUG nova.utils [-] Running cmd (subprocess): sudo qemu-nbd -c /dev/nbd15 /var/lib/
2011-04-05 17:15:50,287 DEBUG nova.utils [-] Running cmd (subprocess): sudo kpartx -a /dev/nbd15 from (pid=7537) execute /usr/lib/
2011-04-05 17:15:50,302 DEBUG nova.utils [-] Running cmd (subprocess): sudo qemu-nbd -d /dev/nbd15 from (pid=7537) execute /usr/lib/
2011-04-05 17:15:50,317 WARNING nova.virt.
2011-04-05 17:15:52,252 DEBUG nova.virt.
2011-04-05 17:15:52,494 DEBUG nova.virt.
##########
The second instances boots, and gets an IP, but I cannot ping or SSH to that instance. When I connect to the console via VNC, I get "Booting from Hard Disk... Boot failed: not a bootable disk -- No bootable device."
Now the first instance I had working is also hosed. From here on out, I cannot create any instances that 'boot', even though 'novatools' says they are up and have IP's:
root@colo07:
+----+-
| ID | Name | Status | Public IP | Private IP |
+----+-
| 46 | Tester01 | ACTIVE | | 184.106.53.9 |
| 47 | Tester02 | ACTIVE | | 184.106.53.10 |
| 48 | Tester03 | ACTIVE | | 184.106.53.11 |
| 49 | Tester04 | ACTIVE | | 184.106.53.12 |
| 55 | jason-test | ACTIVE | | 184.106.53.13 |
| 56 | jason-test-
| 57 | Tester05 | ACTIVE | | 184.106.53.15 |
+----+-
One thing that I saw was " /dev/mapper/
Any idea what could be causing this behavior?
Cheers
Changed in nova: | |
status: | New → Invalid |
Hey Wayne, there is a flag in Nova "max_nbd_devices" which, by default is set to 16. This is curiously the same as the nbd15 device mentioned.
Although I can't give you a fix because I don't know, well, a lot... :) but I can throw out some suggestions/ questions:
1) How many VMs / disks do you have running right now? (it looks like only about 7?)
2) qemu-nbd, while in use, should put a pid file in /sys/block/nbdX/, so can you see if all of your /sys/block/nbdX/pid files exist? Something like `find /sys/block/nbd*/ -maxdepth 1 | grep pid` might work.
3) One work-around for you, while this is looked at is to try and increase the number of NBD's on your system. The only way I know of is to do a `modprobe -r nbd` and then `modprobe nbd nbds_max=32` (or some higher number). NOTE: I have absolutely no idea what this will do to all of your disks. You'd also have to increase the --max_nbd_devices flag to 32 in /etc/nova.conf.