validations ceph-health incorrectly reports 0 OSDs
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
tripleo |
Fix Released
|
High
|
Giulio Fidente |
Bug Description
The Get ceph health task:
executes a shell command which can fail and if it fails it reports there are 0 OSDs, even if there are >0 OSDs.
2019-07-30 21:15:24,037 p=300 u=mistral | TASK [ceph : Get OSD stat percentage] *******
2019-07-30 21:15:24,037 p=300 u=mistral | task path: /usr/share/
2019-07-30 21:15:24,225 p=300 u=mistral | Tuesday 30 July 2019 21:15:24 +0000 (0:00:00.488) 0:28:24.400 **********
2019-07-30 21:15:24,809 p=300 u=mistral | changed: [undercloud -> 192.168.24.3] => {
"changed": true,
"cmd": "docker exec ceph-mon-
"delta": "0:00:00.268931",
"end": "2019-07-30 21:15:24.778989",
"rc": 0,
"start": "2019-07-30 21:15:24.510058"
}
STDERR:
2019-07-30 21:15:24.743 7ff70647f700 -1 Errors while parsing config file!
2019-07-30 21:15:24.743 7ff70647f700 -1 parse_file: cannot open /etc/ceph/
2019-07-30 21:15:24.743 7ff70647f700 -1 parse_file: cannot open /root/.
2019-07-30 21:15:24.743 7ff70647f700 -1 parse_file: cannot open ceph.conf: (2) No such file or directory
Error initializing cluster client: ObjectNotFound(
2019-07-30 21:15:24,996 p=300 u=mistral | TASK [ceph : Fail if there is an unacceptable percentage of in OSDs] ***********
2019-07-30 21:15:24,997 p=300 u=mistral | task path: /usr/share/
2019-07-30 21:15:25,028 p=300 u=mistral | Tuesday 30 July 2019 21:15:25 +0000 (0:00:00.803) 0:28:25.204 **********
2019-07-30 21:15:25,168 p=300 u=mistral | fatal: [undercloud -> 192.168.24.3]: FAILED! => {
"changed": false
}
MSG:
Only 0.0% of OSDs are in, but 66% are required
Changed in tripleo: | |
importance: | Undecided → High |
Probably related failures on master atm.
https:/ /logs.opendev. org/30/ 673730/ 2/check/ tripleo- ci-centos- 7-scenario001- standalone/ e624803/ logs/undercloud /home/zuul/ standalone_ deploy. log.txt. gz#_2019- 08-06_09_ 00_24
Looks like cpeh-ansible run reports the osds up
https:/ /logs.opendev. org/30/ 673730/ 2/check/ tripleo- ci-centos- 7-scenario001- standalone/ e624803/ logs/undercloud /home/zuul/ undercloud- ansible- AT4Mmi/ ceph-ansible/ ceph_ansible_ command. log.txt. gz