os-prober blocks writes to raw partitions
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
ceph (Juju Charms Collection) |
Invalid
|
Undecided
|
Unassigned | ||
ceph (Ubuntu) |
Invalid
|
Medium
|
Unassigned | ||
os-prober (Ubuntu) |
Confirmed
|
Undecided
|
Unassigned |
Bug Description
This morning automatic package upgrade on our running system:
libsigc+
libgtk2.
openssh-
killed five OSD out of 15 on our ceph 0.80.6 cluster of 5 machines :
root@g2:
2014-10-22 07:41:36.783358 7f4d33d55700 -1 journal FileJournal:
2014-10-22 07:41:36.783617 7f4d33d55700 -1 journal FileJournal:
2014-10-22 07:41:36.800201 7f4d33d55700 -1 os/FileJournal.cc: In function 'void FileJournal:
2014-10-22 07:41:36.847389 7f4d33d55700 -1 *** Caught signal (Aborted) **
root@n7:
2014-10-22 07:42:18.169142 7f9b977df700 -1 journal FileJournal:
root@n7:
2014-10-22 07:42:17.509579 7f6efa27b700 -1 osd.9 15390 heartbeat_check: no reply from osd.13 since back 2014-10-22 07:41
2014-10-22 07:42:17.509593 7f6efa27b700 -1 osd.9 15390 heartbeat_check: no reply from osd.14 since back 2014-10-22 07:41
2014-10-22 07:42:17.945433 7f6ef6a74700 -1 journal FileJournal:
2014-10-22 07:42:17.960678 7f6ef6a74700 -1 os/FileJournal.cc: In function 'void FileJournal:
root@stri:
2014-10-22 00:42:01.140574 7fa929b8a700 -1 journal FileJournal:
2014-10-22 00:42:01.141439 7fa929b8a700 -1 journal FileJournal:
root@stri:
2014-10-22 00:41:54.828719 7f438eb45700 -1 osd.14 15388 heartbeat_check: no reply from osd.7 since back 2014-10-22 00:41:34.499777 front 2014-10-22 00:41:34.499777 (cutoff 2014-10-22 00:41:34.828717)
2014-10-22 00:41:55.241586 7f437217f700 0 -- 192.168.
2014-10-22 00:42:01.235014 7f438b33e700 -1 journal FileJournal:
2014-10-22 00:42:01.235032 7f438b33e700 -1 journal FileJournal:
The OSD all died just after a run of os-prober according to the logs:
Oct 22 07:41:36 g2 os-prober: debug: running /usr/lib/
os-prober likely did an operation on the journal partition causing the write errors on the OSD.
Changed in os-prober (Ubuntu): | |
status: | New → Confirmed |
I did a quick test and was not able to reproduce - but my test environment is virtual so that may be making a difference.