Running an OpenSearch upgrade where the container will not change causes shard allocation to remain disabled
Affects | Status | Importance | Assigned to | Milestone | ||
---|---|---|---|---|---|---|
kolla-ansible | Status tracked in Caracal | |||||
Antelope |
Fix Committed
|
Medium
|
Unassigned | |||
Bobcat |
Fix Released
|
Medium
|
Unassigned | |||
Caracal |
Fix Released
|
Medium
|
Matt Crees | |||
Zed |
Fix Released
|
Medium
|
Unassigned |
Bug Description
Shard allocation is disabled at the start of the OpenSearch upgrade task: https:/
This is set as a transient setting, meaning it will be removed once the containers are restarted. However, if there is not change in the OpenSearch container it will not be restarted so the cluster is left in a broken state: unable to allocate shards.
This is hit, for example, if all services are being upgraded and another non-opensearch service fails. An operator will want to rerun the full upgrade again, assuming this is safe due to idempotency, so that the remaining services are still upgraded in order.
Fix proposed to branch: master /review. opendev. org/c/openstack /kolla- ansible/ +/905851
Review: https:/