primary-ceph-mon timeout 180 sec is not enough

Bug #1614009 reported by Dmitry Teselkin
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Fuel for OpenStack
Fix Committed
High
Vladimir Kuklin

Bug Description

I've got more that 10 failures of custom bvt_2 job because of the same reason - primary-ceph-mon task failed because of timeout.

astute.log
---
2016-08-17 07:58:44 INFO [16970] Task[primary-ceph-mon/1]: Run on node: Node[1]
2016-08-17 07:58:44 DEBUG [16970] Waiting for puppet to finish deployment on node 1 (timeout = 180 sec)...
...
2016-08-17 08:01:50 DEBUG [16970] Node[1]: Node 1: task primary-ceph-mon, task status running
2016-08-17 08:01:50 WARNING [16970] Puppet agent 1 didn't respond within the allotted time
2016-08-17 08:01:50 DEBUG [16970] Task time summary: primary-ceph-mon with status failed on node 1 took 00:03:06
---

puppet.log
---
2016-08-17 07:59:08 +0000 Scope(Class[Osnailyfacter::Ceph::Mon]) (notice): MODULAR: ceph/mon.pp
...
2016-08-17 07:59:36 +0000 Puppet (debug): Executing '/usr/bin/apt-get -q -y -o DPkg::Options::=--force-confold -o APT::Get::AllowUnauthenticated=1 install ceph'
2016-08-17 08:02:09 +0000 /Stage[main]/Ceph/Package[ceph]/ensure (notice): ensure changed 'purged' to 'present'
2016-08-17 08:02:09 +0000 /Package[ceph] (debug): The container Class[Ceph] will propagate my refresh event
2016-08-17 08:02:09 +0000 /Package[ceph] (info): Evaluated in 153.82 seconds
...
2016-08-17 08:02:21 +0000 Puppet (debug): Processing report from node-1.test.domain.local with processor Puppet::Reports::Store
---

Timeout value 180 sec is too low for that task [1], and looks strange, because other ceph related tasks executed with timeout 300.

[1] https://github.com/openstack/fuel-library/blob/master/deployment/puppet/osnailyfacter/modular/ceph/tasks.yaml#L38

Tags: area-library
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to fuel-library (master)

Fix proposed to branch: master
Review: https://review.openstack.org/356324

Changed in fuel:
assignee: nobody → Dmitry Teselkin (teselkin-d)
status: New → In Progress
Revision history for this message
Dmitry Teselkin (teselkin-d) wrote :
Changed in fuel:
importance: Undecided → High
milestone: none → 10.0
tags: added: area-library
Changed in fuel:
assignee: Dmitry Teselkin (teselkin-d) → Vladimir Kuklin (vkuklin)
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to fuel-library (master)

Reviewed: https://review.openstack.org/356324
Committed: https://git.openstack.org/cgit/openstack/fuel-library/commit/?id=d57a2752c895a115bc9bee21060902dbbcd7e194
Submitter: Jenkins
Branch: master

commit d57a2752c895a115bc9bee21060902dbbcd7e194
Author: Dmitry Teselkin <email address hidden>
Date: Wed Aug 17 12:05:07 2016 +0300

    Increase primary-ceph-mon timeout

    Change-Id: I3afcc2c06923ea316b021800fa228240ffbd9f47
    Closes-bug: #1614009

Changed in fuel:
status: In Progress → Fix Committed
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/fuel-library 10.0.0rc1

This issue was fixed in the openstack/fuel-library 10.0.0rc1 release candidate.

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/fuel-library 10.0.0

This issue was fixed in the openstack/fuel-library 10.0.0 release.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.