baremetal PXE timeout interrupts active deploys
Bug #1208638 reported by
Robert Collins
This bug affects 2 people
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ironic |
Invalid
|
Medium
|
Yuriy Zveryanskyy | ||
OpenStack Compute (nova) |
Won't Fix
|
Medium
|
Unassigned |
Bug Description
When the DD of an image takes an unexpectedly long time (e.g. due to network congestion), the PXE deploy timeout may interrupt the deploy by powering off the node, which then causes it to be rescheduled and exacerbates the problem.
If we monitor dd and check it is making progress, we could use this as a heartbeat to prevent inappropriate interrupts - and have the timeout look for a period of no progress (vs just absolute time).
To post a comment you must log in.
Ironic will inherit this issue with the PXE driver, so adding task there.