StarlingX

Bug #1833609
Comment #7

Comment 7 for bug 1833609

Revision history for this message

Lin Shuicheng (shuicheng) wrote on 2019-06-27:

It is armada related issue. The issue is that there is only 3 job defined in ceph_rgw chart. After swact && lock/unlock standby controller, the pod created for job doesn't exist any more. So there is no pod for ceph_rgw. And armada needs at least 1 pod to pass resource wait. So it is stuck due to there is no pod.
Here is the armada log:
2019-06-24 10:51:58.074 36 DEBUG armada.handlers.wait [-] [chart=openstack-ceph-rgw]: Resolved `wait.resources` list: [{'type': 'job', 'required': False, 'labels': {'release_group': 'osh-openstack-ceph-rgw'}}, {'type': 'pod', 'labels': {'release_group': 'osh-openstack-ceph-rgw'}}] __init__ /usr/local/lib/python3.6/dist-packages/armada/handlers/wait.py:87^[[00m
required is False for job, but True for pod.
And in function _wait() in wait.py, here is the code logic:
if (not found_resources) and not self.required:
return None

Need fix the issue in armada, or find some workaround in STX.