Activity log for bug #1720793

Date Who What changed Old value New value Message
2017-10-02 13:27:02 John Fulton bug added bug
2017-10-02 13:30:40 John Fulton bug added subscriber Joe Talerico
2017-10-02 13:55:37 John Fulton description When Mistral kicks off Ceph-Ansible, I am seeing issues like : 2017-09-29 15:38:10,768 p=19459 u=mistral | TASK [ceph-defaults : is ceph running already?] ******************************** 2017-09-29 15:38:10,780 p=19459 u=mistral | [DEPRECATION WARNING]: always_run is deprecated. Use check_mode = no instead.. 2017-09-29 15:38:11,180 p=19459 u=mistral | fatal: [192.168.24.56]: UNREACHABLE! => {"changed": false, "msg": "Failed to connect to the host via ssh: Could not create directory '/home/mistral/.ssh'.\r\nssh_exchange_identification: Connection closed by remote host\r\n", "unreachable": true} 2017-09-29 15:38:11,181 p=19459 u=mistral | fatal: [192.168.24.71]: UNREACHABLE! => {"changed": false, "msg": "Failed to connect to the host via ssh: Could not create directory '/home/mistral/.ssh'.\r\nssh_exchange_identification: Connection closed by remote host\r\n", "unreachable": true} 2017-09-29 15:38:11,188 p=19459 u=mistral | [DEPRECATION WARNING]: always_run is deprecated. Use check_mode = no instead.. Which causes the deployment to fail due to the host being unreachable. However, I am able to login to the hosts that mentions unreachable=1. For the full Ansible log (includes multiple deployments) : http://perf1.perf.lab.eng.bos.redhat.com/jtaleric/OpenStack/logs/092917-ceph-ansible-mistral.log This only has become a problem since growing the overcloud deployment to 3 controllers, 3 ceph nodes, and 26 compute nodes (deployed at once). When Mistral kicks off Ceph-Ansible, I am seeing issues like : 2017-09-29 15:38:10,768 p=19459 u=mistral | TASK [ceph-defaults : is ceph running already?] ******************************** 2017-09-29 15:38:10,780 p=19459 u=mistral | [DEPRECATION WARNING]: always_run is deprecated. Use check_mode = no instead.. 2017-09-29 15:38:11,180 p=19459 u=mistral | fatal: [192.168.24.56]: UNREACHABLE! => {"changed": false, "msg": "Failed to connect to the host via ssh: Could not create directory '/home/mistral/.ssh'.\r\nssh_exchange_identification: Connection closed by remote host\r\n", "unreachable": true} 2017-09-29 15:38:11,181 p=19459 u=mistral | fatal: [192.168.24.71]: UNREACHABLE! => {"changed": false, "msg": "Failed to connect to the host via ssh: Could not create directory '/home/mistral/.ssh'.\r\nssh_exchange_identification: Connection closed by remote host\r\n", "unreachable": true} 2017-09-29 15:38:11,188 p=19459 u=mistral | [DEPRECATION WARNING]: always_run is deprecated. Use check_mode = no instead.. Which causes the deployment to fail due to the host being unreachable. However, I am able to login to the hosts that mentions unreachable=1. This only has become a problem since growing the overcloud deployment to 3 controllers, 3 ceph nodes, and 26 compute nodes (deployed at once).
2017-10-02 20:45:11 Giulio Fidente summary ceph-ansible starts before hosts are ready create_admin_via_nova returns before the ssh key is installed on all nodes
2017-10-02 20:50:22 OpenStack Infra tripleo: status Triaged In Progress
2017-10-02 21:11:22 Emilien Macchi tripleo: milestone queens-1
2017-10-03 07:32:41 Giulio Fidente tags tripleo-common pike-backport-potential tripleo-common
2017-10-14 10:23:08 OpenStack Infra tripleo: status In Progress Fix Released
2017-10-22 17:14:46 OpenStack Infra tags pike-backport-potential tripleo-common in-stable-pike pike-backport-potential tripleo-common