periodic integration/component jobs failing "[Zuul] Log Stream did not terminate"

Bug #1890571 reported by Marios Andreou
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
tripleo
Fix Released
Critical
Unassigned

Bug Description

At [1] for periodic *train* integration pipeline standalone, [2] fs1 master baremetal component [3] for scen1 and [4] for scen4 cinder component and many other places the jobs are failing with 'strange' logs. The main fail is simply (e.g. [4]):

        * 2020-08-06 01:56:50.801636 | primary | TASK [undercloud-setup : Run the package installation script] ******************
        * 2020-08-06 02:59:45.524791 | [Zuul] Log Stream did not terminate

In this particular case it is strange because the deployment is successful [5] including a green tempest run [6]. In each case there is something slightly different. In baremental component fs1 [2] the output in the undercloud install logs [7] includes blocks of "����".

At [8] the master cloudops component job fails with:

        * 2020-08-06 06:33:25.512079 | primary | TASK [build-test-packages : Fetch local rdoinfo copy] **************************
        * 2020-08-06 07:12:30.702593 | [Zuul] Log Stream did not terminate

And there are no more logs available [9]. Finally at [10] ussuri scen2 standalone cloudops component:

        * 2020-08-06 04:58:15.798502 | primary | TASK [undercloud-setup : Run the package installation script] ******************
        * 2020-08-06 05:58:40.793649 | [Zuul] Log Stream did not terminate

But the overcloud deployment is successful [11] with a green tempest run [12]. I don't think this is a cloud specific problem. Many of the examples here are running in vexx but this last one [12] is "cloud: rdo-cloud-tripleo" [13]. Same for the scen1/4 cinder component jobs referenced above [3][4].

[1] https://logserver.rdoproject.org/openstack-periodic-integration-stable2/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-standalone-train/77c7b75/job-output.txt
[2] https://logserver.rdoproject.org/openstack-component-baremetal/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-baremetal-master/d1081b0/job-output.txt
[3] https://logserver.rdoproject.org/openstack-component-cinder/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-scenario001-standalone-cinder-master/2026b37/job-output.txt
[4] https://logserver.rdoproject.org/openstack-component-cinder/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-scenario004-standalone-cinder-master/e1f5254/job-output.txt
[5] https://logserver.rdoproject.org/openstack-component-cinder/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-scenario004-standalone-cinder-master/e1f5254/logs/undercloud/home/zuul/standalone_deploy.log.txt.gz
[6] https://logserver.rdoproject.org/openstack-component-cinder/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-scenario004-standalone-cinder-master/e1f5254/logs/undercloud/var/log/tempest/stestr_results.html.gz
[7] https://logserver.rdoproject.org/openstack-component-baremetal/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-baremetal-master/d1081b0/logs/undercloud/home/zuul/undercloud_install.log.txt.gz
[8] https://logserver.rdoproject.org/openstack-component-cloudops/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-standalone-cloudops-master/ac41d8a/job-output.txt
[9] https://logserver.rdoproject.org/openstack-component-cloudops/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-standalone-cloudops-master/ac41d8a/
[10] https://logserver.rdoproject.org/openstack-component-cloudops/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-scenario002-standalone-cloudops-ussuri/d3bbe97/job-output.txt
[11] https://logserver.rdoproject.org/openstack-component-cloudops/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-scenario002-standalone-cloudops-ussuri/d3bbe97/logs/undercloud/home/zuul/standalone_deploy.log.txt.gz
[12] https://logserver.rdoproject.org/openstack-component-cloudops/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-scenario002-standalone-cloudops-ussuri/d3bbe97/logs/undercloud/var/log/tempest/stestr_results.html.gz
[13] https://logserver.rdoproject.org/openstack-component-cloudops/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-scenario002-standalone-cloudops-ussuri/d3bbe97/zuul-info/inventory.yaml

tags: added: infra
description: updated
description: updated
Revision history for this message
Marios Andreou (marios-b) wrote :

13:24 < jpena> chkumar|rover, marios|ruck: anything running on vexxhost is going to have trouble until they make networking stable again
13:24 < chkumar|rover> jpena: registry is also on vexxhost na?
13:24 < jpena> yep
13:24 < chkumar|rover> oh
13:25 < jpena> we moved all our infra to vexxhost (except lists.rdo and www.rdo, which are still WIP)
13:26 < jpena> it's been quite stable so far, until we hit today's issues
13:27 < marios|ruck> jpena: ack thanks but i don't think this one is restricted to vexx i have examples from rdo cloud too
13:27 < marios|ruck> chkumar|rover: jpena: there https://bugs.launchpad.net/tripleo/+bug/1890571
13:27 < jpena> marios|ruck: right, but they are interacting with zuul components running on vexxhost
13:27 < marios|ruck> jpena: i see
13:28 < marios|ruck> jpena: cos this is really random the logs are weird nothing makes sense and they are all different
13:28 < jpena> marios|ruck: yes. Every error message you could find is happening today :-/.

Revision history for this message
Marios Andreou (marios-b) wrote :

it seems to be gone now at least I didn't come across the issue on Friday.

Going to move to fix-released please move it back if you find more examples

Changed in tripleo:
status: Triaged → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.