We have a tripleo Ussuri deployment that was deployed around May.
Until now the cloud has been operating fine and we have been able to expand without issues
3 controllers and 34 computes and external ceph.
Yesterday we tried to add a few new compute nodes But the installation does not go further than step 2
with an error in the ansible log
2021-01-26 23:22:41,647 p=972858 u=stack n=ansible | fatal: [controller-0]: FAILED! => changed=false msg: '[''mysql_init_bundle''] failed to start, check logs in /var/log/containers/stdouts/'
I dont really see anything in the mysql_init log.
The only thing is see is that it seems be failing on
bash-4.4# puppet apply --verbose --detailed-exitcodes --summarize --color=false --modulepath /etc/puppet/modules:/opt/stack/puppet-modules:/usr/share/openstack-puppet/modules --tags file,file_line,concat,augeas,pacemaker::resource::bundle,pacemaker::property,pacemaker::resource::ocf,pacemaker::constraint::order,pacemaker::constraint::colocation,galera_ready,mysql_database,mysql_grant,mysql_user -e 'noop_resource('\''package'\''); include tripleo::profile::base::pacemaker;include tripleo::profile::pacemaker::database::mysql_bundle' but when I run it manually within the container I think it does not return an exit code and it just hangs. (I dont know if I am running it correctly though)
The overcloud command looks like this
openstack overcloud deploy --templates ~/templates --stack-only \ -e /home/stack/containers-prepare-parameter.yaml \ -e /home/stack/templates/node-info.yaml \ -r /home/stack/templates/roles_data.yaml \ -n /home/stack/templates/network_data.yaml \ -e /home/stack/templates/environments/network-environment-OVS.yaml \ -e /home/stack/templates/environments/network-isolation.yaml \ -e /home/stack/templates/environments/ceph-ansible/ceph-ansible-external.yaml \ -e /home/stack/templates/ceph-config.yaml \ -e /home/stack/templates/environments/docker-ha.yaml \ -e /home/stack/templates/environments/ssl/enable-tls.yaml \ -e /home/stack/templates/environments/ssl/inject-trust-anchor-hiera.yaml \ -e /home/stack/templates/environments/ssl/inject-trust-anchor.yaml \ -e /home/stack/templates/environments/ssl/tls-endpoints-public-dns.yaml \ -e /home/stack/templates/environments/predictable-placement/custom-domain.yaml \ -e /home/stack/templates/cloudname.yaml \ -e /home/stack/templates/environments/manila-cephfsnative-config.yaml \ -e /home/stack/templates/environments/ceph-ansible/ceph-mds.yaml \ -e /home/stack/templates/manila-cephfsnative-config.yaml \ -e /home/stack/templates/environments/enable-legacy-telemetry.yaml \ -e /home/stack/templates/environments/neutron-ovs-dvr.yaml \ -e /home/stack/templates/environments/services/octavia.yaml \ -e /home/stack/templates/overcloud_dashboard_hardening.yaml \ -e /home/stack/templates/novafixes.yaml \
--timeout 1500
Also attaching the last lines from ansible.log