openstack overcloud ceph deploy fails
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
New
|
Undecided
|
Unassigned |
Bug Description
First issue:
- `Error: container-init binary not found on the host: stat /usr/libexec/
- Installing `catatonit` seems to resolve this issue
Second issue:
- `error={"changed": true, "cmd": ["podman", "run", "--rm", "--net=host", "--ipc=host", "--volume", "/etc/ceph:
- ~~This one I don't know how to debug~~. The latter is because there was no ceph storage node defined
- There are no checks here to confirm that ceph storage and ceph controllers are present in `baremetal_
Third issue:
- There is no test if there are 3 ceph storage nodes configured. Without it, the ceph deployment gets stuck at `Wait for expected number of osds to be running`
Fourth issue:
- The playbook does not check if a default route gateway is properly defined
- I have found that it is necessary to have this setup
Final issue:
- There is a ceph version mismatch between the image built `cephadm` and containers, i.e. pacific vs quincy
- Manually installing `centos-