bm node instance provisioning delay with 45 nodes
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Compute (nova) |
Won't Fix
|
Medium
|
Unassigned |
Bug Description
Nova - 1:2013.1
While doing scale test, within a batch of 42 cartridges, few of them get provisioned in ~7 mins and few takes 35 mins.
There is totally 10 instances which spans between 30-36 minutes of provisioning time.
nova-compute log snippet of the node which is taking 36 minutes [the highest time in the batch].
In the first minute, the Claim is successful.
The next 30 minutes is wait time with the message "During sync_power_state the instance has a pending task. Skip" and eventually the provisioning gets completed.
Per blueprint, https:/
the current is nova-baremetal-
while the desired approach is ramdisk fdisks the local disks, pulls specified image from glance, writes to local disk, and reboot into it
Is the current approach causing this performance bottle neck? Is there any parameter which can be tuned to better the performance?
Line 388: 2013-09-06 12:09:12.843 AUDIT nova.compute.
Line 801: 2013-09-06 12:10:15.230 AUDIT nova.compute.claims [req-894f8127-
Line 802: 2013-09-06 12:10:15.232 AUDIT nova.compute.claims [req-894f8127-
Line 803: 2013-09-06 12:10:15.233 AUDIT nova.compute.claims [req-894f8127-
Line 804: 2013-09-06 12:10:15.235 AUDIT nova.compute.claims [req-894f8127-
Line 805: 2013-09-06 12:10:15.236 AUDIT nova.compute.claims [req-894f8127-
Line 806: 2013-09-06 12:10:15.238 AUDIT nova.compute.claims [req-894f8127-
Line 807: 2013-09-06 12:10:15.240 AUDIT nova.compute.claims [req-894f8127-
Line 808: 2013-09-06 12:10:15.241 AUDIT nova.compute.claims [req-894f8127-
Line 1047: 2013-09-06 12:12:14.514 28732 INFO nova.compute.
Line 1354: 2013-09-06 12:22:23.266 28732 INFO nova.compute.
Line 1405: 2013-09-06 12:32:30.858 28732 INFO nova.compute.
Line 1486: 2013-09-06 12:42:38.456 28732 INFO nova.compute.
Line 1490: 2013-09-06 12:43:58.078 28732 INFO nova.virt.
Line 1493: 2013-09-06 12:44:41.641 28732 INFO nova.virt.
tags: | added: baremetal |
Changed in nova: | |
status: | New → Triaged |
importance: | Undecided → Medium |
Changed in nova: | |
status: | Triaged → Won't Fix |
From an irc conversation, the images size are about 3GB in a 1Gb network with an only pxe server. In the best case scenario, the images deployment time is ~18 minutes (just the network distribution of the image) + overhead of the file injection per image. I guess the file injection is done sequentially, so that's why the images halt in a syn_power state.
So no bug, but for sure something to improve. (Remove of file injection on the way in ironic, improve image distribution via bittorrent, multicast...)