[2.1b2] failure to deploy all systems that lasted until reboot of maas container
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
MAAS |
Invalid
|
Undecided
|
Unassigned |
Bug Description
We started seeing deployment failures on our maas container where system would show PXE installation but it did not look like kernel was ever booted. This was with 2.0 beta1
There was no rsyslog entries after the failures were observed and the event logs would show PXE installation then timeout:
Node changed status - From 'Deploying' to 'Failed deployment' Fri, 07 Oct. 2016 13:20:36
Marking node failed - Machine operation 'Deploying' timed out after 40 minutes. Fri, 07 Oct. 2016 13:20:36
PXE Request - installation Fri, 07 Oct. 2016 12:39:06
PXE Request - installation Fri, 07 Oct. 2016 12:39:06
Node powered on Fri, 07 Oct. 2016 12:38:40
Powering node on
This maybe due by bug 1631403 which we saw simultaneously but opening this bug in case they are separate issues.
After upgrading to Beta2 the deployment issues continued. Then, after rebooting the container, the issue went away.
$ dpkg -l '*maas*'|cat
Desired=
| Status=
|/ Err?=(none)
||/ Name Version Architecture Description
+++-===
ii maas 2.1.0~beta2+
ii maas-cli 2.1.0~beta2+
un maas-cluster-
ii maas-common 2.1.0~beta2+
ii maas-dhcp 2.1.0~beta2+
ii maas-dns 2.1.0~beta2+
ii maas-proxy 2.1.0~beta2+
ii maas-rack-
ii maas-region-api 2.1.0~beta2+
ii maas-region-
un maas-region-
un python-django-maas <none> <none> (no description available)
un python-maas-client <none> <none> (no description available)
un python-
ii python3-django-maas 2.1.0~beta2+
ii python3-maas-client 2.1.0~beta2+
ii python3-
Changed in maas: | |
milestone: | 2.1.0 → 2.1.1 |
Changed in maas: | |
milestone: | 2.1.1 → 2.1.2 |
Changed in maas: | |
milestone: | 2.1.2 → 2.1.3 |
I wonder if this is related to a network issue that was solved by rebooting the container (maybe related to iscsi).