#queens-latest
hi ,
when replacing failed controller (controller-2 removed) we face the following
after new controller added to the ironic/nova etc and the failed one removed
1 - pcs save the the dead containers of the removed controller in the bundle resources ( rabbitmq/galera/haproxy/ -bundle )
Docker container set: rabbitmq-bundle [172.31.255.1:8787/cbis/centos-binary-rabbitmq:pcmklatest]
rabbitmq-bundle-0 (ocf::heartbeat:rabbitmq-cluster): Started overcloud-controller-0
rabbitmq-bundle-1 (ocf::heartbeat:rabbitmq-cluster): Started overcloud-controller-1
rabbitmq-bundle-2 (ocf::heartbeat:rabbitmq-cluster): Stopped
assuming those should"v removed .... ( *-2 was in the removed controller / could not remove them manually ... cib.xml / pcs resource bundle update ... )
2 - during update following errors trigger
overcloud.AllNodesDeploySteps.ControllerDeployment_Step2.3:
resource_type: OS::Heat::StructuredDeployment
physical_resource_id: ffc0b7c2-2bae-40cb-8815-9263ac810f15
status: CREATE_FAILED
status_reason: |
Error: resources[3]: Deployment to server failed: deploy_status_code : Deployment exited with non-zero status code: 2
deploy_stdout: |
...
"+ puppet apply --verbose --detailed-exitcodes --summarize --color=false --modulepath /etc/puppet/modules:/opt/stack/puppet-modules:/usr/share/openstack-puppet/modules --tags file,file_line,concat,augeas,tripleo::firewall::rule,pacemaker::resource::bundle,pacemaker::property,pacemaker::resource::ip,pacemaker::resource::ocf,pacemaker::constraint::order,pacemaker::constraint::colocation -e 'include ::tripleo::profile::base::pacemaker; include ::tripleo::profile::pacemaker::haproxy_bundle'",
" with Stdlib::Compat::Hash. There is further documentation for validate_legacy function in the README. at [\"/etc/puppet/modules/tripleo/manifests/firewall/rule.pp\", 134]:",
"Warning: Scope(Haproxy::Config[haproxy]): haproxy: The $merge_options parameter will default to true in the next major release. Please review the documentation regarding the implications."
]
}
to retry, use: --limit @/var/lib/heat-config/heat-config-ansible/5f0e11da-8d21-48bd-8167-df39259a5190_playbook.retry
PLAY RECAP *********************************************************************
localhost : ok=6 changed=2 unreachable=0 failed=1
(truncated, view all with --long)
deploy_stderr: |
any advice ?
One initial error (although not at step2) is expected. Did you follow https:/ /access. redhat. com/documentati on/en-us/ red_hat_ openstack_ platform/ 12/html- single/ director_ installation_ and_usage/ #sect-Replacing _Controller_ Nodes