OVB on centos8 fails because of networking failures
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Fix Released
|
Critical
|
wes hayutin |
Bug Description
Running OVB on centos8 and overcloud deployment fails on networking configuration part "TASK [Ensure network service is enabled]"
https:/
Sagi (Sergey) Shnaidman (sshnaidm) wrote : | #1 |
Luke Short (ekultails) wrote : | #2 |
I've been doing some troubleshooting in this bug, too: https:/
Luke Short (ekultails) wrote : | #3 |
I think this boils down to EL 8 not supporting the legacy eth* interface naming. We can see from here that the ens3 device is trying to be brought up but for some strange reason it complains about eth0 when doing that. I do not see eth0 in the ens3 configuration (or vice versa).
http://
https:/
https:/
Alan Pevec (apevec) wrote : | #4 |
> EL 8 not supporting the legacy eth* interface naming
How did it work in rhel8 jobs?
We've added net.ifnames=0 in c8 images to match what rhel8 images had:
https:/
wes hayutin (weshayutin) wrote : | #5 |
GRUB_CMDLINE_
Mar 05 16:58:01 localhost kernel: Command line: BOOT_IMAGE=
device (eth0): state change: unmanaged -> unavailable (reason 'managed', sys-iface-state: 'external')
Mar 05 16:58:08 overcloud-
Mar 05 16:58:08 overcloud-
Mar 05 16:58:08 overcloud-
Mar 05 16:58:08 overcloud-
Mar 05 16:58:08 overcloud-
info> [1583427488.7922] policy: set 'System eth0' (eth0) as default for IPv4 routing and DNS
Mar 05 16:58:08 overcloud-
Mar 05 16:58:08 overcloud-
reason="No suitable device found for this connection (device eth0 not available because profile is not compatible with device (mismatching interface name))."
Mar 05 16:58:39 overcloud-
Mar 05 16:58:39 overcloud-
Mar 05 16:58:39 overcloud-
Mar 05 16:58:39 overcloud-
Mar 05 16:58:39 overcloud-
Mar 05 16:58:39 overcloud-
Mar 05 16:58:39 overcloud-
Mar 05 16:58:39 overcloud-co...
wes hayutin (weshayutin) wrote : | #6 |
I see the following errors br-ex is added to the nic.
## CENTOS-8 ##
Mar 05 16:58:08 overcloud-
Mar 05 17:12:00 overcloud-
Mar 05 17:12:00 overcloud-
Mar 05 17:12:00 overcloud-
Mar 05 17:12:00 overcloud-
Mar 05 17:12:00 overcloud-
Mar 05 17:12:00 overcloud-
Mar 05 17:12:00 overcloud-
Mar 05 17:12:00 overcloud-
## RHEL 8 ##
Feb 25 02:15:45 overcloud-
Feb 25 02:35:50 overcloud-
Feb 25 02:35:50 overcloud-
Feb 25 02:35:50 overcloud-
Feb 25 02:35:50 overcloud-
Feb 25 02:35:50 overcloud-
Feb 25 02:35:50 overcloud-
wes hayutin (weshayutin) wrote : | #7 |
that warning may be related to tripleo-
wes hayutin (weshayutin) wrote : | #8 |
centos-8 openvswitch-
rhel-8 openvswitch2.
not sure if that makes a diff..
This is pretty noisy though
https:/
tags: | added: promotion-blocker |
yatin (yatinkarel) wrote : | #9 |
Some updates, it turned out to be garbage ifcfg-ens3 network-scripts on overcloud nodes, to confirm this for testing we cleaned it up before running NetworkConfig https:/
Green runs:-
https:/
https:/
Fix for the ens3 issue:-
ens3 can be cleaned up in the base image used to prepare overcloud-full or in some disk image element.
Though we seen two green runs for c8 ovb, there were couple of random issues noticed during multiple runs:-
1) Mar 07 17:13:27 overcloud-
Mar 07 17:13:27 overcloud-
Mar 07 17:13:27 overcloud-
Logs:- https:/
2) Baremetal nodes goes to deploy_failed state randomly:-
Logs:-
https:/
3) Tempest failures:-
Logs:- https:/
4) Baremetal nodes provision failed due to qemu scientific notation bug:- https:/
This is fixed already in oslo.utils but required patch is not yet available in current-tripleo. /me didn't got why it failed randomly with same overcloud-full image.
5) overcloud deployment just stucks
For this logs didn't get collected, but i noticed in other underecloud job too, and there we got collected vm console log and it turned out to be kernel panic http://
I think all these random issues need to be fixed/diagnosed seperately as needs different expertise.
yatin (yatinkarel) wrote : | #10 |
<< Fix for the ens3 issue:-
<< ens3 can be cleaned up in the base image used to prepare overcloud-full or in some disk image element.
Reported bug against CentOS https:/
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-image-elements (master) | #11 |
Fix proposed to branch: master
Review: https:/
Changed in tripleo: | |
assignee: | nobody → Alex Schultz (alex-schultz) |
status: | Triaged → In Progress |
Changed in tripleo: | |
assignee: | Alex Schultz (alex-schultz) → wes hayutin (weshayutin) |
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-puppet-elements (master) | #12 |
Fix proposed to branch: master
Review: https:/
OpenStack Infra (hudson-openstack) wrote : Change abandoned on tripleo-image-elements (master) | #13 |
Change abandoned by Alex Schultz (<email address hidden>) on branch: master
Review: https:/
Reason: This doesn't actually execute because it's a dependency on the os-net-config element but we don't use that element. I'm actually kinda concerned that this element isn't executed at all. Anyway https:/
OpenStack Infra (hudson-openstack) wrote : Related fix proposed to tripleo-common (master) | #14 |
Related fix proposed to branch: master
Review: https:/
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-image-elements (master) | #15 |
Reviewed: https:/
Committed: https:/
Submitter: Zuul
Branch: master
commit d641e019b90f931
Author: Alex Schultz <email address hidden>
Date: Wed Mar 11 10:00:44 2020 -0600
Cleanup stale interface if exists
There is a bug in the CentOS 8 image where the ens3 interface file
exists. We should clean that up if it exists to prevent issues when
booting. We manage the interfaces later with os-net-config so we don't
want them to exist in the image.
Change-Id: I95d851f194a524
Closes-Bug: #1866202
Changed in tripleo: | |
status: | In Progress → Fix Released |
OpenStack Infra (hudson-openstack) wrote : Related fix merged to tripleo-common (master) | #16 |
Reviewed: https:/
Committed: https:/
Submitter: Zuul
Branch: master
commit 2589b8f22ad2d26
Author: Alex Schultz <email address hidden>
Date: Wed Mar 11 15:20:22 2020 -0600
Add interface-names to centos8 images
Back in Bug #1841441, we disabled the net.ifnames because of the
RHEL7->RHEL8 changes to interface names. Now that we have centos8, we
need to ensure this action is also run on those images.
Depends-On: https:/
Change-Id: Ice40fec0eacefd
Related-Bug: #1866202
Related-Bug: #1841441
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-image-elements (stable/train) | #17 |
Fix proposed to branch: stable/train
Review: https:/
OpenStack Infra (hudson-openstack) wrote : Related fix proposed to tripleo-common (stable/train) | #18 |
Related fix proposed to branch: stable/train
Review: https:/
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-image-elements (stable/train) | #19 |
Reviewed: https:/
Committed: https:/
Submitter: Zuul
Branch: stable/train
commit 8c91b4651e9dff6
Author: Alex Schultz <email address hidden>
Date: Wed Mar 11 10:00:44 2020 -0600
Cleanup stale interface if exists
There is a bug in the CentOS 8 image where the ens3 interface file
exists. We should clean that up if it exists to prevent issues when
booting. We manage the interfaces later with os-net-config so we don't
want them to exist in the image.
Change-Id: I95d851f194a524
Closes-Bug: #1866202
(cherry picked from commit d641e019b90f931
tags: | added: in-stable-train |
OpenStack Infra (hudson-openstack) wrote : Related fix merged to tripleo-common (stable/train) | #20 |
Reviewed: https:/
Committed: https:/
Submitter: Zuul
Branch: stable/train
commit e64cccf5e6cb6bd
Author: Alex Schultz <email address hidden>
Date: Wed Mar 11 15:20:22 2020 -0600
Add interface-names to centos8 images
Back in Bug #1841441, we disabled the net.ifnames because of the
RHEL7->RHEL8 changes to interface names. Now that we have centos8, we
need to ensure this action is also run on those images.
Depends-On: https:/
Change-Id: Ice40fec0eacefd
Related-Bug: #1866202
Related-Bug: #1841441
(cherry picked from commit 2589b8f22ad2d26
OpenStack Infra (hudson-openstack) wrote : Change abandoned on tripleo-puppet-elements (master) | #21 |
Change abandoned by Alex Schultz (<email address hidden>) on branch: master
Review: https:/
Reason: https:/
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/tripleo-image-elements 11.0.2 | #22 |
This issue was fixed in the openstack/
OpenStack Infra (hudson-openstack) wrote : Related fix proposed to tripleo-quickstart-extras (master) | #23 |
Related fix proposed to branch: master
Review: https:/
OpenStack Infra (hudson-openstack) wrote : Change abandoned on tripleo-quickstart-extras (master) | #24 |
Change abandoned by Sagi Shnaidman (<email address hidden>) on branch: master
Review: https:/
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/tripleo-image-elements 10.6.2 | #25 |
This issue was fixed in the openstack/
OpenStack Infra (hudson-openstack) wrote : Related fix merged to tripleo-quickstart-extras (master) | #26 |
Reviewed: https:/
Committed: https:/
Submitter: "Zuul (22348)"
Branch: master
commit 15b7b6ce5d244d8
Author: yatinkarel <email address hidden>
Date: Fri May 15 15:18:48 2020 +0530
Install atop in CentOS8 from RDO CentOS8 repo
Currently atop in CentOS8 jobs is installed from
Epel 7 repo which is wrong. Since atop is available
in RDO build deps repo let's use that when running
on CentOS8[1].
Also seen kernel panic related to atop as described
in related bug.
[1] https:/
Related-Bug: #1866202
Change-Id: I4f605615fb1bdc
OpenStack Infra (hudson-openstack) wrote : Related fix proposed to tripleo-quickstart-extras (master) | #27 |
Related fix proposed to branch: master
Review: https:/
OpenStack Infra (hudson-openstack) wrote : Related fix merged to tripleo-quickstart-extras (master) | #28 |
Reviewed: https:/
Committed: https:/
Submitter: "Zuul (22348)"
Branch: master
commit 89b425b81b23a50
Author: yatinkarel <email address hidden>
Date: Tue Jun 29 10:18:33 2021 +0530
Move atop installation after repo setup
[1] switched atop installation from RDO repos but
repos are getting setup later, so let's move atop
installation after repo setup.
Related-Bug: #1866202
Change-Id: I378e58eacd17d9
patch that temporarily runs ovb on centos8: https:/ /review. rdoproject. org/r/# /c/25666/
new log: https:/ /logserver. rdoproject. org/66/ 25666/7/ check/periodic- tripleo- ci-centos- 8-ovb-3ctlr_ 1comp-featurese t001-master/ 4fc1241/ logs/undercloud /home/zuul/ overcloud_ deploy. log.txt. gz