create_admin_via_nova returns before the ssh key is installed on all nodes
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Fix Released
|
High
|
Giulio Fidente |
Bug Description
When Mistral kicks off Ceph-Ansible, I am seeing issues like :
2017-09-29 15:38:10,768 p=19459 u=mistral | TASK [ceph-defaults : is ceph running already?] *******
2017-09-29 15:38:10,780 p=19459 u=mistral | [DEPRECATION WARNING]: always_run is deprecated. Use check_mode = no instead..
2017-09-29 15:38:11,180 p=19459 u=mistral | fatal: [192.168.24.56]: UNREACHABLE! => {"changed": false, "msg": "Failed to connect to the host via ssh: Could not create directory '/home/
2017-09-29 15:38:11,181 p=19459 u=mistral | fatal: [192.168.24.71]: UNREACHABLE! => {"changed": false, "msg": "Failed to connect to the host via ssh: Could not create directory '/home/
2017-09-29 15:38:11,188 p=19459 u=mistral | [DEPRECATION WARNING]: always_run is deprecated. Use check_mode = no instead..
Which causes the deployment to fail due to the host being unreachable.
However, I am able to login to the hosts that mentions unreachable=1.
This only has become a problem since growing the overcloud deployment to 3 controllers, 3 ceph nodes, and 26 compute nodes (deployed at once).
description: | updated |
summary: |
- ceph-ansible starts before hosts are ready + create_admin_via_nova returns before the ssh key is installed on all + nodes |
Changed in tripleo: | |
milestone: | none → queens-1 |
tags: | added: pike-backport-potential |
Workaround: in /usr/share/ ceph-ansible/ ansible. cfg, set retry = 5