multinode promotion jobs timing out at overcloud deploy

Bug #1816026 reported by Quique Llorente on 2019-02-15
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
tripleo
Critical
Quique Llorente

Bug Description

http://logs.rdoproject.org/openstack-periodic/git.openstack.org/openstack-infra/tripleo-ci/master/periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset018-master/842c47d/job-output.txt.gz
2019-02-15 00:58:13.283460 | primary | Friday 15 February 2019 00:58:13 +0000 (0:00:00.068) 0:00:31.024 *******
2019-02-15 00:58:13.307743 | primary | skipping: [undercloud]
2019-02-15 00:58:13.331631 | primary |
2019-02-15 00:58:13.331818 | primary | TASK [overcloud-deploy : Deploy the overcloud] *********************************
2019-02-15 00:58:13.353011 | primary | Friday 15 February 2019 00:58:13 +0000 (0:00:00.069) 0:00:31.093 *******
2019-02-15 03:19:01.213841 | RUN END RESULT_TIMED_OUT: [untrusted : git.openstack.org/openstack-infra/tripleo-ci/playbooks/tripleo-ci/run-v3.yaml@master]
2019-02-15 03:19:01.214355 | POST-RUN START: [trusted : review.rdoproject.org/config/playbooks/tripleo-ci-periodic-base/post.yaml@master]
2019-02-15 03:19:03.457998 |
2019-02-15 03:19:03.458239 | PLAY [primary:tripleo-ovb-centos-7]
2019-02-15 03:19:03.575181 |
2019-02-15 03:19:03.575434 | TASK [Set zuul-log-path fact]
2019-02-15 03:19:03.641573 | primary | ok
2019-02-15 03:19:03.696398 |
2019-02-15 03:19:03.696643 | TASK [set-zuul-log-path-fact : Set log path for a change]
2019-02-15 03:19:03.778682 | primary | skipping: Conditional result was False
2019-02-15 03:19:03.818512 |
2019-02-15 03:19:03.818883 | TASK [set-zuul-log-path-fact : Set log path for a ref update]
2019-02-15 03:19:03.894968 | primary | skipping: Conditional result was False
2

chandan kumar (chkumar246) wrote :

fs37, 30, 19, 18, 17, 16 in all of them got timeout.

chandan kumar (chkumar246) wrote :

From mistral engine.log, it appears the following error caused the timedout
https://logs.rdoproject.org/openstack-periodic/git.openstack.org/openstack-infra/tripleo-ci/master/periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset016-master/622f9f3/logs/undercloud/var/log/containers/mistral/engine.log.txt.gz?level=ERROR

2019-02-15 00:48:11.534 8 ERROR mistral.engine.task_handler [req-5d25ed5f-19f3-427a-a3b1-6e732b19231a ab1aa24494f14abfa681a06c05a83e76 0561c56b74b84014972d46763e42034f - default default] Failed to run task [error=Can not evaluate YAQL expression [expression=$.running_config_download_workflows[0].id, error=list index out of range, data={}], wf=tripleo.deployment.v1.config_download_deploy, task=fail_deploy_in_progress]:
Traceback (most recent call last):
  File "/usr/lib/python2.7/site-packages/mistral/engine/task_handler.py", line 63, in run_task
    task.run()
  File "/usr/lib/python2.7/site-packages/osprofiler/profiler.py", line 160, in wrapper
    result = f(*args, **kwargs)
  File "/usr/lib/python2.7/site-packages/mistral/engine/tasks.py", line 453, in run
    self._run_new()
  File "/usr/lib/python2.7/site-packages/osprofiler/profiler.py", line 160, in wrapper
    result = f(*args, **kwargs)
  File "/usr/lib/python2.7/site-packages/mistral/engine/tasks.py", line 485, in _run_new
    self._schedule_actions()
  File "/usr/lib/python2.7/site-packages/mistral/engine/tasks.py", line 558, in _schedule_actions
    input_dict = self._get_action_input()
  File "/usr/lib/python2.7/site-packages/osprofiler/profiler.py", line 160, in wrapper
    result = f(*args, **kwargs)
  File "/usr/lib/python2.7/site-packages/mistral/engine/tasks.py", line 589, in _get_action_input
    input_dict = self._evaluate_expression(self.task_spec.get_input(), ctx)
  File "/usr/lib/python2.7/site-packages/mistral/engine/tasks.py", line 615, in _evaluate_expression
    ctx_view
  File "/usr/lib/python2.7/site-packages/mistral/expressions/__init__.py", line 100, in evaluate_recursively
    data[key] = _evaluate_item(data[key], context)
  File "/usr/lib/python2.7/site-packages/mistral/expressions/__init__.py", line 79, in _evaluate_item
    return evaluate(item, context)
  File "/usr/lib/python2.7/site-packages/mistral/expressions/__init__.py", line 71, in evaluate
    return evaluator.evaluate(expression, context)
  File "/usr/lib/python2.7/site-packages/mistral/expressions/yaql_expression.py", line 159, in evaluate
    cls).evaluate(trim_expr, data_context)
  File "/usr/lib/python2.7/site-packages/mistral/expressions/yaql_expression.py", line 113, in evaluate
    ", data=%s]" % (expression, str(e), data_context)
YaqlEvaluationException: Can not evaluate YAQL expression [expression=$.running_config_download_workflows[0].id, error=list index out of range, data={}]
: YaqlEvaluationException: Can not evaluate YAQL expression [expression=$.running_config_download_workflows[0].id, error=list index out of range, data={}]

chandan kumar (chkumar246) wrote :

It also affects fs010.

Changed in tripleo:
assignee: nobody → chandan kumar (chkumar246)
status: Triaged → In Progress

Fix proposed to branch: master
Review: https://review.openstack.org/637522

Changed in tripleo:
assignee: chandan kumar (chkumar246) → Quique Llorente (quiquell)
Changed in tripleo:
assignee: Quique Llorente (quiquell) → chandan kumar (chkumar246)
Changed in tripleo:
assignee: chandan kumar (chkumar246) → Quique Llorente (quiquell)

Change abandoned by Chandan Kumar (<email address hidden>) on branch: master
Review: https://review.openstack.org/637488
Reason: in favor of https://review.openstack.org/#/c/637546/

Reviewed: https://review.openstack.org/637522
Committed: https://git.openstack.org/cgit/openstack/tripleo-common/commit/?id=573f67df3d6216f62467eb526a15be3120f9a01a
Submitter: Zuul
Branch: master

commit 573f67df3d6216f62467eb526a15be3120f9a01a
Author: Quique Llorente <email address hidden>
Date: Mon Feb 18 11:53:19 2019 +0100

    Use len to compare list at mistral

    Looks like mistral has issues using empty list [] comparation to check
    if the list is empty or not, let's just use len.

    This is needed since https://review.openstack.org/#/c/637507/ is not
    merged.

    Change-Id: Ifd04d741bd92b2ada5f181ed87ed5b07c0e01d48
    Closes-Bug: #1816026

Changed in tripleo:
status: In Progress → Fix Released

Reviewed: https://review.openstack.org/637546
Committed: https://git.openstack.org/cgit/openstack/tripleo-quickstart-extras/commit/?id=d1fd33e82f6c3eb4750561c6f50122d1026bad08
Submitter: Zuul
Branch: master

commit d1fd33e82f6c3eb4750561c6f50122d1026bad08
Author: Chandan Kumar <email address hidden>
Date: Mon Feb 18 19:28:49 2019 +0530

    Set container_binary to podman from stein releases onwards

    https://review.openstack.org/#/c/608452/ Switch the undercloud to
    deploy Podman by default starting from Stein and
    undercloud_container_cli var is setted under undercloud_deploy role
    and it cannot be used in another role which will set
    container_binary to docker but docker is not there in stein leading
    to fail running tempest from container.

    Use sudo while pulling tempest using podman

    Change-Id: I7ee75d71a7c34bdb85232a1378880ff16579c5db
    Related-Bug: #1816026

Changed in tripleo:
milestone: none → stein-3

This issue was fixed in the openstack/tripleo-common 10.5.0 release.

To post a comment you must log in.
This report contains Public information  Edit
Everyone can see this information.

Other bug subscribers