When doing a overcloud deployment with ceph in master following error appears in deployment log [1]:
overcloud.AllNodesDeploySteps.WorkflowTasks_Step2_Execution:
resource_type: OS::Mistral::ExternalResource
physical_resource_id: e30e9ca1-9be4-4102-9a51-df5b5186485c
status: CREATE_FAILED
status_reason: |
resources.WorkflowTasks_Step2_Execution: ERROR
Looking at ceph-install log [2], following error is found when deploying ceph:
2017-11-29 02:34:58,363 p=7774 u=mistral | fatal: [192.168.24.7]: FAILED! => {"attempts": 5, "changed": true, "cmd": ["docker", "exec", "ceph-mon-upstream-centos-7-2-node-rdo-cloud-tripleo-53070-22071", "stat", "/var/run/ceph/ceph-mon.upstream-centos-7-2-node-rdo-cloud-tripleo-53070-22071.localdomain.asok"], "delta": "0:00:00.052434", "end": "2017-11-29 02:34:58.336669", "failed": true, "msg": "non-zero return code", "rc": 1, "start": "2017-11-29 02:34:58.284235", "stderr": "stat: cannot stat '/var/run/ceph/ceph-mon.upstream-centos-7-2-node-rdo-cloud-tripleo-53070-22071.localdomain.asok': No such file or directory", "stderr_lines": ["stat: cannot stat '/var/run/ceph/ceph-mon.upstream-centos-7-2-node-rdo-cloud-tripleo-53070-22071.localdomain.asok': No such file or directory"], "stdout": "", "stdout_lines": []}
However, apparently the ceph.mon container is up and running [3].
[1] https://logs.rdoproject.org/openstack-periodic/periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset019-master/1509905/undercloud/home/jenkins/failed_deployment_list.log.txt.gz
[2] https://logs.rdoproject.org/openstack-periodic/periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset019-master/1509905/undercloud/var/log/mistral/ceph-install-workflow.log.txt.gz
[3] https://logs.rdoproject.org/openstack-periodic/periodic-tripleo-ci-centos-7-multinode-1ctlr-featureset019-master/1509905/subnode-2/var/log/extra/docker/containers/ceph-mon-upstream-centos-7-2-node-rdo-cloud-tripleo-53070-22071/log/ceph/ceph-mon.upstream-centos-7-2-node-rdo-cloud-tripleo-53070-22071.log.txt.gz
I confirm this but, I have the same problem when I'm deploying in RDO Cloud. I remember gfidente having a fix.