[heat-admin@overcloud-controller-0 ~]$ sudo docker ps | grep unheal
fecc8432ac43 192.168.24.1:8787/tripleostein/centos-binary-swift-proxy-server:current-tripleo-rdo "dumb-init --singl..." 21 hours ago Up 21 hours (unhealthy) swift_proxy
dae072d3369a 192.168.24.1:8787/tripleostein/centos-binary-ovn-controller:current-tripleo-rdo "dumb-init --singl..." 25 hours ago Up 25 hours (unhealthy) ovn_controller
dcbb15acda19 192.168.24.1:8787/tripleostein/centos-binary-nova-scheduler:current-tripleo-rdo "dumb-init --singl..." 25 hours ago Up 25 hours (unhealthy) nova_scheduler
e9f316e1e000 192.168.24.1:8787/tripleostein/centos-binary-cinder-scheduler:current-tripleo-rdo "dumb-init --singl..." 25 hours ago Up 25 hours (unhealthy) cinder_scheduler
e32cfba7c46e 192.168.24.1:8787/tripleostein/centos-binary-heat-engine:current-tripleo-rdo "dumb-init --singl..." 25 hours ago Up 25 hours (unhealthy) heat_engine
de6b7bc41d6b 192.168.24.1:8787/tripleostein/centos-binary-nova-consoleauth:current-tripleo-rdo "dumb-init --singl..." 25 hours ago Up 25 hours (unhealthy) nova_consoleauth
I tested that this issue is present when tripleo is deployed with oooq from master and stein, and reverting this patch https://review.opendev.org/652647 fixes the issue.
The change that added quoting the healthcheck commands breaks healthchecks in tripleo.
For example, the nova_compute service is configured with a healthcheck that looks like this:
]
},
other services, that report healthy don't mind the additional quotes, ex:
]
},
[heat-admin@ overcloud- controller- 0 ~]$ sudo docker ps | grep unheal 24.1:8787/ tripleostein/ centos- binary- swift-proxy- server: current- tripleo- rdo "dumb-init --singl..." 21 hours ago Up 21 hours (unhealthy) swift_proxy 24.1:8787/ tripleostein/ centos- binary- ovn-controller: current- tripleo- rdo "dumb-init --singl..." 25 hours ago Up 25 hours (unhealthy) ovn_controller 24.1:8787/ tripleostein/ centos- binary- nova-scheduler: current- tripleo- rdo "dumb-init --singl..." 25 hours ago Up 25 hours (unhealthy) nova_scheduler 24.1:8787/ tripleostein/ centos- binary- cinder- scheduler: current- tripleo- rdo "dumb-init --singl..." 25 hours ago Up 25 hours (unhealthy) cinder_scheduler 24.1:8787/ tripleostein/ centos- binary- heat-engine: current- tripleo- rdo "dumb-init --singl..." 25 hours ago Up 25 hours (unhealthy) heat_engine 24.1:8787/ tripleostein/ centos- binary- nova-consoleaut h:current- tripleo- rdo "dumb-init --singl..." 25 hours ago Up 25 hours (unhealthy) nova_consoleauth
fecc8432ac43 192.168.
dae072d3369a 192.168.
dcbb15acda19 192.168.
e9f316e1e000 192.168.
e32cfba7c46e 192.168.
de6b7bc41d6b 192.168.
I tested that this issue is present when tripleo is deployed with oooq from master and stein, and reverting this patch https:/ /review. opendev. org/652647 fixes the issue.