ironic-pxe in unhealthy status

Bug #1803153 reported by Jose Luis Franco
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
tripleo
Fix Released
Undecided
Jose Luis Franco

Bug Description

ironic-pxe container appers in unhealthy state after upgrading the undercloud from queens to rocky:
[stack@c01-h01-r620 ~]$ sudo docker ps
CONTAINER ID IMAGE COMMAND CREATED STATUS PORTS NAMES
a2a76eb540a9 172.31.0.1:8787/rhosp14/openstack-neutron-dhcp-agent:2018-11-06.1 "ip netns exec qdh..." About an hour ago Up About an hour neutron-dnsmasq-qdhcp-16331c4d-a176-4310-9c06-8ccfcae18b66
2dc6e66dab70 172.31.0.1:8787/rhosp14/openstack-nova-compute-ironic:2018-11-06.1 "kolla_start" 24 hours ago Up 24 hours (healthy) nova_compute
8cadae919e0e 172.31.0.1:8787/rhosp14/openstack-ironic-inspector:2018-11-06.1 "kolla_start" 24 hours ago Up 24 hours ironic_inspector_dnsmasq
d773b73fb89b 172.31.0.1:8787/rhosp14/openstack-ironic-inspector:2018-11-06.1 "kolla_start" 24 hours ago Up 24 hours ironic_inspector
f1f549a899f9 172.31.0.1:8787/rhosp14/openstack-ironic-pxe:2018-11-06.1 "kolla_start" 24 hours ago Up 24 hours ironic_pxe_http
29a5949d0389 172.31.0.1:8787/rhosp14/openstack-ironic-pxe:2018-11-06.1 "/bin/bash -c 'BIN..." 24 hours ago Up 24 hours (unhealthy)

Revision history for this message
Jose Luis Franco (jfrancoa) wrote :

When inspecting the Health section in the container we can observe:

            "Health": {
                "Status": "unhealthy",
                "FailingStreak": 2877,
                "Log": [
                    {
                        "Start": "2018-11-13T16:16:05.264857069Z",
                        "End": "2018-11-13T16:16:05.509162744Z",
                        "ExitCode": 1,
                        "Output": "grep: /run_command: No such file or directory\ncurl: (69) getpeername() failed with errno 107: Transport endpoint is not connected\n\n000 172.31.0.1:69 0.001 seconds\n"

Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix proposed to tripleo-common (master)

Fix proposed to branch: master
Review: https://review.openstack.org/617728

Changed in tripleo:
status: Triaged → In Progress
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix merged to tripleo-common (master)

Reviewed: https://review.openstack.org/617728
Committed: https://git.openstack.org/cgit/openstack/tripleo-common/commit/?id=67ed5a0488671892f61528aed88df81c5bff9205
Submitter: Zuul
Branch: master

commit 67ed5a0488671892f61528aed88df81c5bff9205
Author: Jose Luis Franco Arza <email address hidden>
Date: Tue Nov 13 17:20:14 2018 +0100

    Fix ironic-pxe healthcheck.

    When the /run_command file is not found on the
    container using the ironic-pxe image, the health
    status displays unhealthy even though it's not
    true. The reason is because grep can't find
    the file and the error makes the whole healthcheck
    fail.
    This patch redirects the error output so the command
    doesn't fail, but the conditional is properly
    evaluated.

    Change-Id: Ia6141f5499f34cd10618378f448598f74aca6273
    Closes-Bug: #1803153

Changed in tripleo:
status: In Progress → Fix Released
Revision history for this message
OpenStack Infra (hudson-openstack) wrote : Fix included in openstack/tripleo-common 10.3.0

This issue was fixed in the openstack/tripleo-common 10.3.0 release.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.