Comment 15 for bug 1768870

Revision history for this message
Andres Rodriguez (andreserl) wrote : Re: node failed commissioning - HTTP Error 400: {'boot_interface': ["Must be one of the node's interfaces."]}

So I've digged through the dump and and found a few interesting things:

1. A 'Failed Commissioning' machine (landscapeamqp-2) failed because one script took longer to run. Judging from the rsyslog, it is not obvious why it would have been the case because the rsyslog truncates at a certain point which doesn't show the whole commissioning process it would have normally followed to get to the point it got. This to me implies that there could have been some sort of network issue.

2. Other machines in 'Commissioning' state have NO interfaces. Since these machines come from a Pod, these machines should have had one interface attached to them but it didn't. Since we see the error in comment #3, I think what could be happening is that there could be VM's with duplicated MAC addresses across the different pods.

As such, Jason:

1. would imply a network issue, so I would recommend you explore the possibility there was some network breakage at some point.
2. We need logs from libvirt. Could you please start gathering /var/log/libvirt/ ?