system tests booting of 5 slave nodes somtimes fails

Bug #1317213 reported by Vladimir Kuklin
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Fuel for OpenStack
Invalid
High
Fuel QA Team

Bug Description

{"build_id": "2014-05-07_01-00-26", "mirantis": "yes", "build_number": "184", "ostf_sha": "fe718434f88f2ab167779770828a195f06eb29f8", "nailgun_sha": "6b5e2797dae6a3b803b3ac7102ae3d9164a013cf", "production": "docker", "api": "1.0", "fuelmain_sha": "cb99264a1d5c5cd949f95a835f86ea1eb00c766f", "astute_sha": "9c83d3ecec69df03cd94620e2df92249ba4ec786", "release": "5.0", "fuellib_sha": "616a164132a6a195b756236923bc2f4345bc93f0"}

try to run HA test with 5 slave nodes. One of the nodes cannot boot over PXE.
Also, all the container services are configured and work fine. All the other nodes boot just fine. It looks like cobbler/dnsmasq/tftpd server cannot handle 5 simultaneous connections sometimes.

This may be due to highload issues

Tags: system-tests
description: updated
tags: added: system-tests
Revision history for this message
Vladimir Kuklin (vkuklin) wrote :

after snapshot revert it seems that this some issue with dhcrelay performance or iPXE emulation of slaves. if I insert sleep for 1 second between slaves starts the problem goes away

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to fuel-main (master)

Fix proposed to branch: master
Review: https://review.openstack.org/92657

Changed in fuel:
assignee: nobody → Vladimir Kuklin (vkuklin)
status: New → In Progress
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to fuel-main (master)

Reviewed: https://review.openstack.org/92657
Committed: https://git.openstack.org/cgit/stackforge/fuel-main/commit/?id=81348bd8c8f7cba312ff76872ec1ec464967acd2
Submitter: Jenkins
Branch: master

commit 81348bd8c8f7cba312ff76872ec1ec464967acd2
Author: Vladimir Kuklin <email address hidden>
Date: Wed May 7 22:45:11 2014 +0400

    Add 2-sec sleep between slave nodes start

    There are some issues either with iPXE or
    dhcrelay which lead to the problem with
    PXE booting of virtual slaves. This
    workaround makes bootstrapping success
    more likely

    Change-Id: I85584f61cd06a7c5cd1f73e6638c9475d0f86ac2
    Partial-Bug: #1317213

Revision history for this message
Mike Scherbakov (mihgen) wrote :

Vladimir is visiting OpenStack design summit, so deferring this to 5.1 to verify.

Changed in fuel:
status: In Progress → Confirmed
milestone: 5.0 → 5.1
assignee: Vladimir Kuklin (vkuklin) → Fuel QA Team (fuel-qa)
Changed in fuel:
status: Confirmed → Triaged
Revision history for this message
Nastya Urlapova (aurlapova) wrote :
Changed in fuel:
status: Triaged → Invalid
Mike Scherbakov (mihgen)
Changed in fuel:
milestone: 5.1 → 5.0
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to fuel-main (stable/4.1)

Fix proposed to branch: stable/4.1
Review: https://review.openstack.org/99169

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to fuel-main (stable/4.1)

Reviewed: https://review.openstack.org/99169
Committed: https://git.openstack.org/cgit/stackforge/fuel-main/commit/?id=947f94c733eade81a6b79c98155ef027d883cc60
Submitter: Jenkins
Branch: stable/4.1

commit 947f94c733eade81a6b79c98155ef027d883cc60
Author: Vladimir Kuklin <email address hidden>
Date: Wed May 7 22:45:11 2014 +0400

    Add 2-sec sleep between slave nodes start

    There are some issues either with iPXE or
    dhcrelay which lead to the problem with
    PXE booting of virtual slaves. This
    workaround makes bootstrapping success
    more likely

    Change-Id: I85584f61cd06a7c5cd1f73e6638c9475d0f86ac2
    Partial-Bug: #1317213
    (cherry picked from commit 81348bd8c8f7cba312ff76872ec1ec464967acd2)

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.