stable/liberty CI: all jobs failing due to nodes stuck in wait call-back
Bug #1550772 reported by
James Slagle
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Fix Released
|
Critical
|
James Slagle |
Bug Description
It seems all stable/liberty CI jobs are currently failing. I took a look at a few of the failures, and they all seem stuck during the node deployment of the Overcloud. The nodes are started by Ironic, but the disk deployment is never started and all the nodes are stuck in the "wait call-back" state.
Example failure:
http://
I updated my local environment to the latest stable/liberty repos, and I was able to reproduce the same issue. I suspect a regression in either ironic-
Changed in tripleo: | |
status: | In Progress → Fix Released |
To post a comment you must log in.
I can't see anyway to debug what might be causing the nodes to not be able to reach back to Ironic to start the disk deployment. There's no way to see what is on the vm console, you apparently can't log the console to a file at this stage, and there are no logs saved anywhere else afaict.
i'm trying a few ealier delorean repos to see if i can pinpoint if there might be a regression. It seems our successeful job to pass on stable/liberty was around 8:00 2/25.
This repo was broken for me: /trunk. rdoproject. org/centos7- liberty/ 56/ef/56effa1f8 d8bb2545669019d bb159703c3e54bd e_5e110e28
https:/
Trying some earlier repos and will report back.