Master periodic jobs are failing with "msg": "Error running container image prepare: failed"

Bug #1902190 reported by Sandeep Yadav
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
tripleo
In Progress
Critical
Sagi (Sergey) Shnaidman

Bug Description

Description:-

All the master periodic jobs are failing while modifying container images.

https://logserver.rdoproject.org/openstack-periodic-integration-main/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-standalone-master/21a7770/logs/undercloud/home/zuul/standalone_deploy.log.txt.gz

~~~
PLAY [External deployment step 1] **********************************************
2020-10-30 01:34:16.122192 | fa163ec6-0101-53e4-0bab-000000000102 | TASK | External deployment step 1
2020-10-30 01:34:16.137534 | fa163ec6-0101-53e4-0bab-000000000102 | OK | External deployment step 1 | undercloud | result={
    "changed": false,
    "msg": "Use --start-at-task 'External deployment step 1' to resume from this task"
}
2020-10-30 01:34:16.138464 | fa163ec6-0101-53e4-0bab-000000000102 | TIMING | External deployment step 1 | undercloud | 0:02:28.486354 | 0.02s
2020-10-30 01:34:16.181787 | fa163ec6-0101-53e4-0bab-000000000103 | TIMING | include_tasks | undercloud | 0:02:28.529674 | 0.01s
2020-10-30 01:34:16.214260 | 7cebd66f-bccd-4dc6-9d00-bb688c6ac72d | INCLUDED | /home/zuul/standalone-ansible-5yincrkd/external_deploy_steps_tasks_step1.yaml | undercloud
2020-10-30 01:34:16.244842 | fa163ec6-0101-53e4-0bab-000000000d01 | TASK | Container image prepare
2020-10-30 01:37:32.387611 | fa163ec6-0101-53e4-0bab-000000000d01 | FATAL | Container image prepare | undercloud | error={"changed": false, "error": "Modifying image 192.168.24.1:8787/tripleomaster/openstack-horizon:938acab2d960d1e9b3ede6e4d35983a8 failed", "msg": "Error running container image prepare: Modifying image 192.168.24.1:8787/tripleomaster/openstack-horizon:938acab2d960d1e9b3ede6e4d35983a8 failed", "params": {}, "success": false}
2020-10-30 01:37:32.390844 | fa163ec6-0101-53e4-0bab-000000000d01 | TIMING | tripleo_container_image_prepare : Container image prepare | undercloud | 0:05:44.738731 | 196.14s
~~~

https://logserver.rdoproject.org/openstack-periodic-integration-main/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-standalone-master/21a7770/logs/undercloud/var/log/tripleo-container-image-prepare.log.txt.gz
~~~
00000000003d | FATAL | Run yum_update.sh | localhost | error={"changed": true, "cmd": ["buildah", "run", "--volume", "/tmp/ansible.khxgug4e:/tmp/yum_update.sh", "--volume", "/etc/yum.repos.d:/etc/yum.repos.d", "--volume", "/etc/pki:/etc/pki", "--user", "root", "--net", "host", "openstack-horizon-working-container", "/tmp/yum_update.sh", "delorean-current"], "delta": "0:00:01.126458", "end": "2020-10-30 01:35:36.926783", "msg": "non-zero return code", "rc": 1, "start": "2020-10-30 01:35:35.800325", "stderr": "standard_init_linux.go:211: exec user process caused \"exec format error\"", "stderr_lines": ["standard_init_linux.go:211: exec user process caused \"exec format error\""], "stdout": "", "stdout_lines": []}
2020-10-30 01:35:36.956845 | fa163ec6-0101-5745-670c-00000000003d | TIMING | tripleo-modify-image : Run yum_update.sh | localhost | 0:00:03.961988 | 1.31s
2020-10-30 01:35:36.960747 | fa163ec6-0101-5745-670c-00000000003e | TASK | Run yum_update.sh (retry without yum cache)
2020-10-30 01:35:37.845219 | fa163ec6-0101-5745-670c-00000000003e | RETRY | Run yum_update.sh (retry without yum cache) | localhost | 2 retries

~~~

Another job example:-

~~~
https://logserver.rdoproject.org/openstack-periodic-integration-main/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-scenario002-standalone-master/c425eec/logs/undercloud/home/zuul/standalone_deploy.log.txt.gz
~~~

Tags: alert ci
Revision history for this message
Sandeep Yadav (sandeepyadav93) wrote :

Hi,

Integration line jobs passed in the next run.

The issue is under observation.

We also discussed internally regarding stoping container updates in the integration line jobs.

Revision history for this message
Sandeep Yadav (sandeepyadav93) wrote :

Hello,

We again hit this issue for some periodic ovb jobs:-

https://logserver.rdoproject.org/openstack-periodic-integration-main/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-ovb-3ctlr_1comp-featureset001-master/a3164e6/logs/undercloud/home/zuul/overcloud_deploy.log.txt.gz
~~~
2020-11-01 18:59:40 | PLAY [External deployment step 1] **********************************************
2020-11-01 18:59:40 | 2020-11-01 18:59:40.686593 | fa163ec3-869c-32b7-71fb-00000000018e | TASK | External deployment step 1
2020-11-01 18:59:40 | 2020-11-01 18:59:40.716034 | fa163ec3-869c-32b7-71fb-00000000018e | OK | External deployment step 1 | undercloud | result={
2020-11-01 18:59:40 | "changed": false,
2020-11-01 18:59:40 | "msg": "Use --start-at-task 'External deployment step 1' to resume from this task"
2020-11-01 18:59:40 | }
2020-11-01 18:59:40 | 2020-11-01 18:59:40.717102 | fa163ec3-869c-32b7-71fb-00000000018e | TIMING | External deployment step 1 | undercloud | 0:04:24.694171 | 0.03s
2020-11-01 18:59:40 | 2020-11-01 18:59:40.784512 | fa163ec3-869c-32b7-71fb-00000000018f | TIMING | include_tasks | undercloud | 0:04:24.761563 | 0.02s
2020-11-01 18:59:40 | 2020-11-01 18:59:40.948540 | 13ed9b30-56b8-4a44-98dc-491e6f2e0bae | INCLUDED | /home/zuul/config-download/overcloud/external_deploy_steps_tasks_step1.yaml | undercloud
2020-11-01 18:59:40 | 2020-11-01 18:59:40.993370 | fa163ec3-869c-32b7-71fb-000000002ff6 | TASK | Container image prepare
2020-11-01 19:02:28 | 2020-11-01 19:02:28.169563 | fa163ec3-869c-32b7-71fb-000000002ff6 | FATAL | Container image prepare | undercloud | error={"changed": false, "error": "Modifying image 192.168.24.1:8787/tripleomaster/openstack-ovn-northd:ca33b3e19a95731d8e372ad76053fef6 failed", "msg": "Error running container image prepare: Modifying image 192.168.24.1:8787/tripleomaster/openstack-ovn-northd:ca33b3e19a95731d8e372ad76053fef6 failed", "params": {}, "success": false}
2020-11-01 19:02:28 | 2020-11-01 19:02:28.191212 | fa163ec3-869c-32b7-71fb-000000002ff6 | TIMING | tripleo_container_image_prepare : Container image prepare | undercloud | 0:07:12.168221 | 167.20s
~~~

https://logserver.rdoproject.org/openstack-periodic-integration-main/opendev.org/openstack/tripleo-ci/master/periodic-tripleo-ci-centos-8-ovb-1ctlr_1comp-featureset002-master/44c574e/logs/undercloud/home/zuul/overcloud_deploy.log.txt.gz

Changed in tripleo:
milestone: victoria-rc1 → wallaby-1
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-quickstart (master)

Fix proposed to branch: master
Review: https://review.opendev.org/761286

Changed in tripleo:
assignee: nobody → Sagi (Sergey) Shnaidman (sshnaidm)
status: New → In Progress
Changed in tripleo:
milestone: wallaby-1 → wallaby-2
Changed in tripleo:
milestone: wallaby-2 → wallaby-3
Changed in tripleo:
milestone: wallaby-3 → wallaby-rc1
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Change abandoned on tripleo-ci (master)

Change abandoned by "Shnaidman Sagi (Sergey) <email address hidden>" on branch: master
Review: https://review.opendev.org/c/openstack/tripleo-ci/+/761285

Changed in tripleo:
milestone: wallaby-rc1 → xena-1
Changed in tripleo:
milestone: xena-1 → xena-2
Changed in tripleo:
milestone: xena-2 → xena-3
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Change abandoned on tripleo-quickstart (master)

Change abandoned by "Shnaidman Sagi (Sergey) <email address hidden>" on branch: master
Review: https://review.opendev.org/c/openstack/tripleo-quickstart/+/761286

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.