Bootstrap timeout for multiracks

Bug #1606914 reported by Dmitry Belyaninov
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Fuel for OpenStack
Fix Committed
High
Alex Schultz
Mitaka
Fix Released
High
Alex Schultz
Revision history for this message
Dmitry Belyaninov (dbelyaninov) wrote :
Revision history for this message
Dmitry Belyaninov (dbelyaninov) wrote :
Revision history for this message
Alexander Kislitsky (akislitsky) wrote :

Bootstrap of admin node is failed. Seems that it is product-ci server issue.

Revision history for this message
Alexander Kislitsky (akislitsky) wrote :
tags: added: swarm-blocker
Changed in fuel:
milestone: none → 9.1
assignee: nobody → Fuel Sustaining (fuel-sustaining-team)
importance: Undecided → High
status: New → Confirmed
Revision history for this message
Alexander Kislitsky (akislitsky) wrote :

On the problem environment we have only 3 bootstraped nodes. In virsh we can see all 5 nodes as active. After connecting to the problem node we can see, that configuration of network interfaces is failed: see attachment.

Revision history for this message
Alexander Kislitsky (akislitsky) wrote :
Revision history for this message
Alexander Kislitsky (akislitsky) wrote :

Seems that we have no network interfaces for the nodes 4, 5

virsh # net-list --all
 Name State Autostart Persistent
----------------------------------------------------------
 10.0.system_test.ubuntu.ha_neutron.104.104_admin active yes yes
 10.0.system_test.ubuntu.ha_neutron.104.104_management active yes yes
 10.0.system_test.ubuntu.ha_neutron.104.104_private active yes yes
 10.0.system_test.ubuntu.ha_neutron.104.104_public active yes yes
 10.0.system_test.ubuntu.ha_neutron.104.104_storage active yes yes
 9.x.system_test.ubuntu.multiracks_2.9_admin active yes yes
 9.x.system_test.ubuntu.multiracks_2.9_admin2 active yes yes
 9.x.system_test.ubuntu.multiracks_2.9_admin3 active yes yes
 9.x.system_test.ubuntu.multiracks_2.9_management active yes yes
 9.x.system_test.ubuntu.multiracks_2.9_management2 active yes yes
 9.x.system_test.ubuntu.multiracks_2.9_management3 active yes yes
 9.x.system_test.ubuntu.multiracks_2.9_private active yes yes
 9.x.system_test.ubuntu.multiracks_2.9_private2 active yes yes
 9.x.system_test.ubuntu.multiracks_2.9_private3 active yes yes
 9.x.system_test.ubuntu.multiracks_2.9_public active yes yes
 9.x.system_test.ubuntu.multiracks_2.9_public2 active yes yes
 9.x.system_test.ubuntu.multiracks_2.9_public3 active yes yes
 9.x.system_test.ubuntu.multiracks_2.9_storage active yes yes
 9.x.system_test.ubuntu.repeatable_image_based.9_admin active yes yes
 9.x.system_test.ubuntu.repeatable_image_based.9_management active yes yes
 9.x.system_test.ubuntu.repeatable_image_based.9_private active yes yes
 9.x.system_test.ubuntu.repeatable_image_based.9_public active yes yes
 9.x.system_test.ubuntu.repeatable_image_based.9_storage active yes yes

Revision history for this message
Alexander Kislitsky (akislitsky) wrote :

But we have networks in the configuration for vm:
virsh dumpxml 9.x.system_test.ubuntu.multiracks_2.9_slave-05

Moreover configurations for successfully bootstarped and problem nodes are identical.

Revision history for this message
Alexander Kislitsky (akislitsky) wrote :

For fix this bug we need PXE expert. Bug is stable reproduced by job https://product-ci.infra.mirantis.net/job/9.x.system_test.ubuntu.thread_7/

Changed in fuel:
assignee: Fuel Sustaining (fuel-sustaining-team) → Georgy Kibardin (gkibardin)
status: Confirmed → In Progress
Revision history for this message
Georgy Kibardin (gkibardin) wrote :

Guys, were there any changes recently on srv85? Like reinstalling it from scratch, or just libvirt or fuel-devops update? It, actually, was green quite recently.

Also, please ensure that syslinux is installed on the slave where the test is being run.

Changed in fuel:
status: In Progress → Incomplete
Revision history for this message
Alex Schultz (alex-schultz) wrote :

The success/failure split is when we actually started applying the updates to swarm I believe. Previously the swarm runs weren't getting updated packages. https://review.fuel-infra.org/#/c/23680/ so it's definitely a regression

Revision history for this message
Alex Schultz (alex-schultz) wrote :

I tracked this down, so I think it's related to https://review.openstack.org/#/c/343317/. The issue with the environment is that the cobbler dnsmasq is not listening on the additional address for the other network. I manually updated cobbler to include the second rack network and the nodes bootstrapped.

Changed in fuel:
status: Incomplete → Confirmed
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to fuel-library (stable/mitaka)

Fix proposed to branch: stable/mitaka
Review: https://review.openstack.org/349721

no longer affects: fuel/newton
Changed in fuel:
assignee: Georgy Kibardin (gkibardin) → Alex Schultz (alex-schultz)
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to fuel-library (master)

Fix proposed to branch: master
Review: https://review.openstack.org/349735

Changed in fuel:
status: Confirmed → In Progress
Revision history for this message
Alex Schultz (alex-schultz) wrote :

ETA: 8/3

Dmitry Pyzhov (dpyzhov)
tags: added: 9.1-proposed
Revision history for this message
Dmitry Belyaninov (dbelyaninov) wrote :

Appropriate jobs are passed on snapshot #136.
Mova to FixReleased state

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to fuel-library (master)

Reviewed: https://review.openstack.org/349735
Committed: https://git.openstack.org/cgit/openstack/fuel-library/commit/?id=acbb2ee849bd617ab5eac9164be3fabaf0315260
Submitter: Jenkins
Branch: master

commit acbb2ee849bd617ab5eac9164be3fabaf0315260
Author: Alex Schultz <email address hidden>
Date: Mon Aug 1 16:05:46 2016 -0600

    Update dnsmasq for additional networks

    This change includes an updated listen-address configuration for the
    additional admin networks that can be configured. This change looks
    through the puppet facts to try and identify which interface is
    configured for the extra admin network and uses the ip address that
    exists in the admin network cidr as a listen-address for dnsmasq

    Change-Id: I3322b4e945d3bd016a2c1977dc89c2227735b2e8
    Closes-Bug: #1606914

Changed in fuel:
status: In Progress → Fix Committed
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/fuel-library 10.0.0rc1

This issue was fixed in the openstack/fuel-library 10.0.0rc1 release candidate.

Revision history for this message
Dmitry Belyaninov (dbelyaninov) wrote :
Revision history for this message
Dmitry Pyzhov (dpyzhov) wrote :

@Dmitry, it is a bad practice to reopen issue with merged, tested and released fix when you don't have strong evidence that you faced this very issue again. Please create new report.

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/fuel-library 10.0.0

This issue was fixed in the openstack/fuel-library 10.0.0 release.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.