ceph-osd failed to start if filesystem is recovered at mount time
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Fuel for OpenStack |
Invalid
|
Low
|
MOS Ceph | ||
6.0.x |
Won't Fix
|
Low
|
MOS Ceph | ||
7.0.x |
Won't Fix
|
Low
|
MOS Ceph | ||
8.0.x |
Won't Fix
|
Low
|
MOS Ceph | ||
Mitaka |
Invalid
|
Low
|
MOS Ceph |
Bug Description
I have faced with this issue during working with CI test "thead_3" on CentOS, in particular with the "ceph_ha_restart" test.
When a node, where Ceph was installed, reboots/reset and filesystem used for ceph damages, then ceph-osd fails to start while this filesystem checks for errors during mounting.
Manual restart 'ceph' service makes ceph-osd alive.
Looks like the filesystem is still in 'ro' state when ceph-osd is starting.
It is hard to reproduce because of a short moment of race condition between ceph-osd starting and filesystem checking.
It can be investigated by turning a ceph filesystem into 'ro' state and restarting ceph service:
# service ceph stop
# mount -f -o remount,ro /dev/vda4 #/dev/vda4 - ceph partiton
or
# xfs_freeze -f /var/lib/
# service ceph start
# service ceph status
=== mon.node-1 ===
mon.node-1: running {"version"
=== osd.1 ===
osd.1: not running.
In the diagnostic snapshot, for 'node-1' ceph-osd was started on 2014-09-20 15:41:20, and filesystem check was started on 2014-09-20 15:41:21 (16:41:21 in the kernel.log):
================
2014-09-
2014-09-
2014-09-
================
Cluster configuration:
CentOS/HA, nova-network/flat, 3 controller+ceph; 2 compute+ceph; 1 ceph.
{"build_id": "2014-09-
tags: | added: ceph |
Changed in fuel: | |
milestone: | 5.1 → 6.0 |
Changed in fuel: | |
assignee: | nobody → Fuel Library Team (fuel-library) |
Changed in fuel: | |
importance: | Undecided → Medium |
status: | New → Triaged |
Changed in fuel: | |
importance: | Medium → Low |
Changed in fuel: | |
milestone: | 6.0 → 6.1 |
no longer affects: | fuel/6.1.x |
Changed in fuel: | |
status: | Triaged → Won't Fix |
Changed in fuel: | |
milestone: | 6.1 → 8.0 |
status: | Won't Fix → Triaged |
no longer affects: | fuel/8.0.x |
tags: | added: area-mos |
We no longer fix Low bugs in 8.0, closing as Won't Fix