iscsi deploys are sensitive to timeskew

Bug #1331862 reported by Robert Collins
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Ironic
Triaged
Medium
Unassigned
tripleo
Incomplete
High
Unassigned

Bug Description

With the PXE driver, if the deployed-to node and the conductor are too far apart clock wise, errors that may be quite inscrutable will be thrown.

We should:
a) in tripleo ensure ntp is available on the conductor
b) have the deploy ramdisk try (with a timeout) to sync time with the conductor
c) document this issue for other deployment approaches

Tags: pxe
Changed in tripleo:
status: New → Triaged
importance: Undecided → High
Changed in ironic:
status: New → Triaged
importance: Undecided → High
Revision history for this message
Dmitry Tantsur (divius) wrote :

Hi! Just for future reference: could you give some example of errors that can be thrown?

tags: added: pxe
aeva black (tenbrae)
Changed in ironic:
importance: High → Medium
Revision history for this message
Steven Hardy (shardy) wrote : potentially eol bug

This bug was reported against an old version of TripleO, and may no longer be valid.

Since it was reported before the start of the liberty cycle (and our oldest stable
branch is stable/liberty), I'm marking this incomplete.

Please reopen this (change the status from incomplete) if the bug is still valid
on a current supported (stable/liberty, stable/mitaka or trunk) version of TripleO,
thanks!

Changed in tripleo:
status: Triaged → Incomplete
Revision history for this message
Pavlo Shchelokovskyy (pshchelo) wrote :

While not strictly related to iscsi deploy and albeit probably being obvious, I'd like to add that the whole Ironic service is quite sensitive re time skew between ironic-conductor and ironic-api nodes.

This is due to ironic-conductor nodes posting DB keepalives in their local time (not the DB time), so api service might decide that given conductor is out if the time skew is big enough (with default settings, >50sec).

The resulting error manifests as "No valid conductor host found for driver..." etc when e.g. trying to create a node.

Revision history for this message
Jay Faulkner (jason-oldos) wrote :

Marking this as a duplicate of our longstanding IPA bug to make the system clock accurate. I'll add tripleo as effected by that bug.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.