gnocchi pool have many more objects per pg than average
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ceph Monitor Charm |
Fix Released
|
Medium
|
James Page |
Bug Description
On a fresh deployment of queens on xenial, ceph reports status warning
and never comes to good health:
# ceph -s
cluster:
id: <uuid>
health: HEALTH_WARN
1 pools have many more objects per pg than average
services:
mon: 6 daemons, quorum juju-<hash>
mgr: juju-<hash>
osd: 180 osds: 180 up, 180 in
data:
pools: 21 pools, 2544 pgs
objects: 83723 objects, 1601 MB
usage: 193 GB used, 1309 TB / 1309 TB avail
pgs: 2544 active+clean
io:
client: 1091 B/s rd, 1 op/s rd, 0 op/s wr
# # ceph health detail
HEALTH_WARN 1 pools have many more objects per pg than average
MANY_OBJECTS_PER_PG 1 pools have many more objects per pg than average
pool gnocchi objects per pg (2606) is more than 81.4375 times cluster average (32)
This trigger alarms in nagios as it expects ceph to report good health status,
so expected behaviour on new deployment is that the status goes good, and if it
does leave that status, the thing would recover itself in a reasonable time.
Thanks!
José.
Changed in charm-ceph-mon: | |
status: | Confirmed → Triaged |
Changed in charm-ceph-mon: | |
status: | Fix Committed → Fix Released |
I also see this on a Bionic/Queens deployment:
$ sudo ceph -s 7e13-11e8- a8c7-00163eb368 65
cluster:
id: 80f2a4b2-
health: HEALTH_WARN
1 pools have many more objects per pg than average
services: 6-lxd-0, juju-182b52- 4-lxd-0, juju-182b52- 5-lxd-0 4-lxd-0( active) , standbys: juju-182b52- 6-lxd-0, juju-182b52-5-lxd-0
mon: 3 daemons, quorum juju-182b52-
mgr: juju-182b52-
osd: 12 osds: 12 up, 12 in
rgw: 1 daemon active
data: clean+scrubbing +deep
pools: 22 pools, 420 pgs
objects: 3502k objects, 421 GB
usage: 938 GB used, 10235 GB / 11174 GB avail
pgs: 419 active+clean
1 active+
io:
client: 6280 B/s rd, 17836 B/s wr, 7 op/s rd, 29 op/s wr