[regression] Test deploy_ceph_net_tmpl failed with "Loop detected" error

Bug #1583629 reported by ElenaRossokhina
44
This bug affects 8 people
Affects Status Importance Assigned to Milestone
Fuel for OpenStack
Fix Released
Critical
Aleksandr Didenko
Mitaka
Fix Released
Critical
Aleksandr Didenko

Bug Description

Detailed bug description:
Test deploy_ceph_net_tmpl failed with following message
Deployment has failed. Method task_deploy. Cluster[]: Loop detected! Path: Task[netconfig/1], Task[hosts/1], Task[primary-cluster/1], Task[virtual_ips/1], Task[netconfig/1].
Steps to reproduce:
run 'deploy_ceph_net_tmpl' group
Expected results:
pass
Actual result:
deployment fails due to loop detection

fuel-9.0-335-2016-05-19_08-00-59.iso
here is diagnostic snapshot https://drive.google.com/open?id=0B2ag_Bf-ShtTQkhCRVNFWFk1d2s

Revision history for this message
Vladimir Kuklin (vkuklin) wrote :

Regression was introduced by https://github.com/openstack/fuel-library/commit/7903d3d152503da746d2610bb54a4ab5e57ad6bf

role combination creates a loop. netconfig and netconfig-controller tasks should be exclusive

Changed in fuel:
assignee: nobody → Aleksandr Didenko (adidenko)
Dmitry Klenov (dklenov)
tags: added: area-library
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to fuel-library (master)

Fix proposed to branch: master
Review: https://review.openstack.org/318763

Changed in fuel:
status: Confirmed → In Progress
Revision history for this message
Oleksiy Butenko (obutenko) wrote : Re: Test deploy_ceph_net_tmpl failed with "Loop detected" error

This bug blocker for tempest ci

tags: added: blocker-for-qa
Revision history for this message
Timur Nurlygayanov (tnurlygayanov) wrote :

The deployment of all 4 popular OpenStack configuration are blocked because of the issue, priority changed to Critical.

tags: added: tempest
summary: - Test deploy_ceph_net_tmpl failed with "Loop detected" error
+ [regression] Test deploy_ceph_net_tmpl failed with "Loop detected" error
tags: added: swarm-blocker
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to fuel-library (master)

Reviewed: https://review.openstack.org/318763
Committed: https://git.openstack.org/cgit/openstack/fuel-library/commit/?id=7c106ae13ee38bb5ad91f30ef3a7d58e7c8c2227
Submitter: Jenkins
Branch: master

commit 7c106ae13ee38bb5ad91f30ef3a7d58e7c8c2227
Author: Aleksandr Didenko <email address hidden>
Date: Thu May 19 18:16:11 2016 +0200

    Fix netconfig task loop for multiroles

    Add strict condition to yaql_exp for netconfig task to not run it
    on controllers.

    Change-Id: I6faf8ff7cf97353d9c6b79265d5f9fb16afb57b3
    Closes-bug: #1583629

Changed in fuel:
status: In Progress → Fix Committed
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to fuel-library (stable/mitaka)

Fix proposed to branch: stable/mitaka
Review: https://review.openstack.org/319389

Revision history for this message
Sergey Novikov (snovikov) wrote :
Dmitry Pyzhov (dpyzhov)
no longer affects: fuel/newton
Revision history for this message
Aleksandr Didenko (adidenko) wrote :

Btw, maybe we should add multirole case to Fuel-CI? Considering that now we have complex task-based orchestration this could be a good addition to our CI coverage. Any role + controller would be sufficient. For instance (copy-paste from the test group that detected this regression):

            3. Add 3 controller + ceph nodes
            4. Add 2 compute + ceph nodes

Revision history for this message
Anastasia Kuznetsova (akuznetsova) wrote :

The same issue has occurred on ISO 9.0-mos 388 during following test case:
Scenario:
            1. Create cluster
            2. Add 3 node with controller role and mongo
            3. Add 1 nodes with compute role
            4. Set Ceilometer install option
            5. Deploy the cluster
            6. Verify Heat and Ceilometer services
            7. Run OSTF platform tests
https://product-ci.infra.mirantis.net/job/9.0.system_test.ubuntu.services_ha/115/testReport/(root)/deploy_heat_ha/deploy_heat_ha/

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to fuel-library (stable/mitaka)

Reviewed: https://review.openstack.org/319389
Committed: https://git.openstack.org/cgit/openstack/fuel-library/commit/?id=d850ed5808a783fc6cb447ea2144fb4798ddd75d
Submitter: Jenkins
Branch: stable/mitaka

commit d850ed5808a783fc6cb447ea2144fb4798ddd75d
Author: Aleksandr Didenko <email address hidden>
Date: Thu May 19 18:16:11 2016 +0200

    Fix netconfig task loop for multiroles

    Add strict condition to yaql_exp for netconfig task to not run it
    on controllers.

    Change-Id: I6faf8ff7cf97353d9c6b79265d5f9fb16afb57b3
    Closes-bug: #1583629
    (cherry picked from commit 7c106ae13ee38bb5ad91f30ef3a7d58e7c8c2227)

Revision history for this message
Kyrylo Romanenko (kromanenko) wrote :

This bug occurs also on environment without ceph:
ironic_deploy_swift
        Scenario:
            1. Create cluster
            2. Add 1 node with controller role
            3. Add 2 node with controller+ironic role
            4. Add 1 node with compute role
            5. Add 1 nodes with ironic role
            6. Deploy the cluster
...
AssertionError: Task 'deploy' has incorrect status. error != ready, 'Deployment has failed. Method task_deploy. Cluster[]: Loop detected! Path: Task[netconfig/1], Task[hosts/1], Task[primary-cluster/1], Task[virtual_ips/1], Task[netconfig/1].
Inspect Astute logs for the details'

Revision history for this message
Ivan Lozgachev (ilozgachev) wrote :

Has been reproduced on MOS 9.0 build 390 with configuration:

1 controller + mongo
2 compute
1 cinder

Revision history for this message
Alexandr Kostrikov (akostrikov-mirantis) wrote :
Revision history for this message
Alexandr Kostrikov (akostrikov-mirantis) wrote :
Revision history for this message
Alexandr Kostrikov (akostrikov-mirantis) wrote :

Possible, fix has not been on iso at the time swarm started

Revision history for this message
Sergey Novikov (snovikov) wrote :
Revision history for this message
Sergey Novikov (snovikov) wrote :
Revision history for this message
Artem Minasyan (aminasyan) wrote :
Revision history for this message
Kyrylo Romanenko (kromanenko) wrote :

Bug is still reproduced on today`s ISO 394 (2016-05-24).

Cluster:
1 Controller+Ironic
1 Compute

Error:
Deployment has failed. Method task_deploy. Cluster[]: Loop detected! Path: Task[netconfig/1], Task[hosts/1], Task[primary-cluster/1], Task[virtual_ips/1], Task[netconfig/1].

Revision history for this message
Alexander Gubanov (ogubanov) wrote :

Bug again reproduced, this is affected Tempest CI with merged fix https://review.openstack.org/#/c/318763/

[root@nailgun ~]# shotgun2 short-report
cat /etc/fuel_build_id:
 394
cat /etc/fuel_build_number:
 394
cat /etc/fuel_release:
 9.0
cat /etc/fuel_openstack_version:
 mitaka-9.0

details http://pastebin.com/GMnvukQt
Snapshot is attached

Changed in fuel:
status: Fix Committed → Confirmed
Revision history for this message
Aleksandr Didenko (adidenko) wrote :

Yep, I confirm the bug is still there. yaql_exp does not remove task from the graph, it adds it into the graph with 'type=>skipped'. So task won't be executed, but the loop in the graph is still there.

Revision history for this message
Artem Minasyan (aminasyan) wrote :

This bug reproduced on 394 ISO with enabled ceilometer , with 3 controllers and compute with cinder LVM.

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to fuel-library (master)

Fix proposed to branch: master
Review: https://review.openstack.org/320530

Changed in fuel:
status: Confirmed → In Progress
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to fuel-library (stable/mitaka)

Fix proposed to branch: stable/mitaka
Review: https://review.openstack.org/320807

Changed in fuel:
status: In Progress → Fix Committed
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to fuel-library (master)

Reviewed: https://review.openstack.org/320530
Committed: https://git.openstack.org/cgit/openstack/fuel-library/commit/?id=08b7a0b50aa654696fade182f10808354452b778
Submitter: Jenkins
Branch: master

commit 08b7a0b50aa654696fade182f10808354452b778
Author: Aleksandr Didenko <email address hidden>
Date: Tue May 24 15:19:15 2016 +0200

    Revert all network idempotency related fixes

    Revert "Fix netconfig for envs without controllers/vips"
    This reverts commit 0d009b7facea66a1b88c9a428c8d4250e4790bba.

    Revert "Fix netconfig task loop for multiroles"
    This reverts commit 7c106ae13ee38bb5ad91f30ef3a7d58e7c8c2227.

    Revert "Make network configuration idempotent"
    This reverts commit 7903d3d152503da746d2610bb54a4ab5e57ad6bf.

    Change-Id: I3bfd3f31f53fa7fa99414bea956802371de41340
    Closes-bug: #1583912
    Closes-bug: #1583629
    Closes-bug: #1584002

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to fuel-library (stable/mitaka)

Reviewed: https://review.openstack.org/320807
Committed: https://git.openstack.org/cgit/openstack/fuel-library/commit/?id=f5879c940f584f9f94e6fac7fc95b43c06d4a3ab
Submitter: Jenkins
Branch: stable/mitaka

commit f5879c940f584f9f94e6fac7fc95b43c06d4a3ab
Author: Aleksandr Didenko <email address hidden>
Date: Wed May 25 09:13:19 2016 +0200

    Revert all network idempotency related fixes

    Revert "Fix netconfig task loop for multiroles"
    This reverts commit d850ed5808a783fc6cb447ea2144fb4798ddd75d.

    Revert "Make network configuration idempotent"
    This reverts commit fdb541f57de7d8769441ce92c2526807d5694017.

    Change-Id: I472341efdd2e38cc355d64a46030ffcf17dfac96
    Closes-bug: #1583912
    Closes-bug: #1583629
    Closes-bug: #1584002

Revision history for this message
Jay Pipes (jaypipes) wrote :

Hi all, what's the status on this? Looks like the stable/mitaka revert was merged. Can this be marked Fix Committed?

Revision history for this message
Alexandr Kostrikov (akostrikov-mirantis) wrote :

Hi, Jay. During today swarm there were no reproduces. I am closing it as 'Fix Released'

Changed in fuel:
status: Fix Committed → Fix Released
Revision history for this message
Maksym Strukov (unbelll) wrote :

Reproduced manually:

Steps:
1. Install detach-rabbit plugin on master node (https://github.com/openstack/fuel-plugin-detach-rabbitmq.git)
2. Create next cluster - Ubuntu, all default values, separate_rabbit enabled, 3 controllers, 3 rabbitmq_nodes, 1 compute, 1 cinder
3. Deploy cluster

Actual:
Deployment has failed. Method task_deploy. Cluster[]: Loop detected! Path: Task[generate_keys/master], Task[copy_keys/1], Task[pre_deployment_end/virtual_sync_node], Task[deploy_start/4], Task[hiera/4], Task[setup_repositories/4], Task[fuel_pkgs/4], Task[globals/4], Task[rabbitmq-hiera-override/4], Task[logging/4], Task[tools/4], Task[netconfig/4], Task[firewall/4], Task[primary-cluster/4], Task[cluster/3], Task[virtual_ips/3], Task[netconfig/4].
Inspect Astute logs for the details

Env:
9.0-mos-452

Revision history for this message
Alexandr Kostrikov (akostrikov-mirantis) wrote :
Revision history for this message
Alexandr Kostrikov (akostrikov-mirantis) wrote :
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Duplicates of this bug

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.