Ceph install fails by using removable devices
| Affects | Status | Importance | Assigned to | Milestone | |
|---|---|---|---|---|---|
| | OpenStack ceph charm |
Medium
|
Chris Holcombe | ||
| | ceph (Juju Charms Collection) |
Medium
|
Chris Holcombe | ||
Bug Description
It appears that ceph-install detects and attempts to use removable devices (CD, SD card) as valid block storage:
unit-ceph-0: 2015-02-10 04:05:31 INFO mon-relation-
unit-ceph-0: 2015-02-10 04:05:31 ERROR juju.worker.uniter uniter.go:486 hook failed: exit status 1
This causes the landscape install to fail at 94%.
Storage devices on all boxes in the cluster:
/dev/sda - 136GB RAID0
/dev/sdb - 136GB RAID0
/dev/sdc - CD drive
/dev/sdd - SD card
This could be due to how they're being reported in the debian installer (?). During install time the devices are reported as sdc and sdd respectively, but post install they are sda and sdb. However, ceph does appear to correctly provision /dev/sdb for itself.
Attached is the log set from landscape.
| Drew (drew-6) wrote : | #1 |
| Adam Collard (adam-collard) wrote : | #2 |
| Drew (drew-6) wrote : | #3 |
For unit-ceph-2 I removed all existing partitions on /dev/sdb (which had the swap partition on it and an old MBR) and re-deployed landscape. This time ceph-disk-prepare correctly partitions /dev/sdb. However it still errors out when trying to partition /dev/sdc (the CD drive), which causes the whole build to fail:
2015-02-10 20:59:24 INFO juju-log mon:60: Making dir /var/lib/charm/ceph root:root 555
2015-02-10 20:59:24 WARNING juju-log mon:60: Not a valid ipv6 address: 192.168.1.6
2015-02-10 20:59:24 WARNING juju-log mon:60: Not a valid ipv6 address: 192.168.1.7
2015-02-10 20:59:24 WARNING juju-log mon:60: Not a valid ipv6 address: 192.168.1.8
2015-02-10 20:59:24 INFO juju-log mon:60: Making dir /var/run/ceph root:root 755
2015-02-10 20:59:24 INFO juju-log mon:60: Making dir /var/lib/
2015-02-10 20:59:24 INFO mon-relation-
2015-02-10 20:59:24 INFO mon-relation-
2015-02-10 20:59:24 INFO mon-relation-
2015-02-10 20:59:24 INFO mon-relation-
2015-02-10 20:59:24 INFO mon-relation-
2015-02-10 20:59:24 INFO mon-relation-
2015-02-10 20:59:24 INFO mon-relation-
2015-02-10 20:59:34 INFO juju-log mon:60: Looks like /dev/sda is in use, skipping.
2015-02-10 20:59:34 INFO juju-log mon:60: Path /dev/vda does not exist - bailing
Reading package lists... Donerelation-
Building dependency tree lation-changed
Reading state information... Donetion-changed
2015-02-10 20:59:38 INFO mon-relation-
2015-02-10 20:59:38 INFO mon-relation-
2015-02-10 20:59:38 INFO mon-relation-
2015-02-10 20:59:40 INFO mon-relation-
2015-02-10 20:59:40 INFO mon-relation-
2015-02-10 20:59:40 INFO mon-relation-
2015-02-10 20:59:40 INFO mon-relation-
2015-02-10 20:59:40 INFO mon-relation-
2015-02-10 20:59:40 INFO mon-relation-
2015-02-10 20:59:40 INFO mon-relation-
2015-02-10 20:59:41 INFO mon-relation-
2015-02-10 20:59:42 INFO mon-relation-
2015-02-10 20:59:43 INFO mon-relation-
2015-02-10 20:59:...
| Andreas Hasenack (ahasenack) wrote : | #4 |
This is an issue with the ceph charm. It shouldn't treat the cdrom as a disk it can partition/format, and probably also don't get confused by an existing partitioning scheme in a real disk.
| information type: | Proprietary → Public |
| affects: | landscape → ceph (Juju Charms Collection) |
| Andreas Hasenack (ahasenack) wrote : | #5 |
Drew, this was your paste, right?
http://
In that case, the above is the lshw information for the machine in question.
| tags: | added: cloud-installer landscape |
| James Page (james-page) wrote : | #6 |
We need to bake some better inteligence into the code that currently checks whether a block device configured is usable for ceph; anything readonly should be ignored.
| Changed in ceph (Juju Charms Collection): | |
| status: | New → Triaged |
| importance: | Undecided → Medium |
| Chris Holcombe (xfactor973) wrote : | #7 |
The way I did this with my ceph manager code I wrote was to check some udev attributes and rule out everything that didn't fit. I actually ran into the same problems with my early code where it was seeing cd rom drives, ramdisks, etc as disks that were fit for using.
| Changed in ceph (Juju Charms Collection): | |
| assignee: | nobody → Chris Holcombe (xfactor973) |
| David Britton (davidpbritton) wrote : | #8 |
Note also that this presents itself when other disk-like things are presented to the OS that cannot be used:
| Chris Holcombe (xfactor973) wrote : | #9 |
I think that shouldn't be a problem so long as we filter on disks that are SSD or spinning and have a size > 0. I've not encountered disks with zero size before but I've also never used Dell hardware.
| tags: | added: kanban-cross-team |
| tags: | removed: kanban-cross-team |
| Thiago (thisab) wrote : | #10 |
I found this entry on one of the broken (ceph-osd/4) containers:
2016-06-19 00:57:33 INFO mon-relation-
2016-06-19 00:57:33 INFO mon-relation-
2016-06-19 00:57:33 INFO mon-relation-
2016-06-19 00:57:33 INFO mon-relation-
2016-06-19 00:57:33 INFO mon-relation-
2016-06-19 00:57:33 INFO mon-relation-
2016-06-19 00:57:33 INFO mon-relation-
2016-06-19 00:57:33 INFO mon-relation-
2016-06-19 00:57:33 INFO mon-relation-
2016-06-19 00:57:33 INFO mon-relation-
2016-06-19 00:57:33 ERROR juju-log mon:32: Unable to initialize device: /dev/sdc
I even opened another thread: https:/
| Thiago (thisab) wrote : | #11 |
As can be seen here: http://
In some machines the output is always the same for ceph-osd:
2016-06-27 15:32:46 INFO juju-log mon:27: Path /dev/vdb does not exist - bailing
2016-06-27 15:32:46 INFO mon-relation-
| Chad Smith (chad.smith) wrote : | #12 |
As of Landscape 16.05, ceph-osd[
| Changed in charm-ceph: | |
| assignee: | nobody → Chris Holcombe (xfactor973) |
| importance: | Undecided → Medium |
| status: | New → Triaged |
| Changed in ceph (Juju Charms Collection): | |
| status: | Triaged → Invalid |

unit-ceph-2: 2015-02-10 04:05:20 INFO mon-relation- changed #015Reading state information... 0%#015#015Reading state information... 0%#015#015Reading state information... Done changed changed ******* ******* ******* ******* ******* ******* ******* ******* ******* changed Found invalid GPT and valid MBR; converting MBR to GPT format changed in memory. changed ******* ******* ******* ******* ******* ******* ******* ******* ******* changed changed Warning: The kernel is still using the old partition table. changed The new table will be used at the next reboot. changed GPT data structures destroyed! You may now partition the disk using fdisk or changed other utilities. changed Warning: The kernel is still using the old partition table. changed The new table will be used at the next reboot. changed The operation has completed successfully. changed Warning: The kernel is still using the old partition table. changed The new table will be used at the next reboot. changed The operation has completed successfully. changed Error: Partition(s) 5 on /dev/sdb have been written, but we have been unable to inform the kernel of the change, probably because it/they are in use. As a result, the old partition(s) will remain in use. You should reboot now before making further changes. changed Warning: The kernel is still using the old partition table. changed The new table will be used at the next reboot. changed The operation has completed successfully. changed Error: Error informing the kernel about modifications to partition /dev/sdb1 -- Device or resource busy. This means Linux won't know about any changes you made to /dev/sdb1 until you reboot -- so you shouldn't mount it or use it in any way before rebooting. changed Error: Failed to add partition 1 (Device or resource busy) changed ceph-disk: Error: partition 1 for /dev/sdb does not appear to exist changed Traceback (most re...
unit-ceph-2: 2015-02-10 04:05:22 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:22 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:22 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:22 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:22 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:22 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:22 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:22 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:22 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:22 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:22 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:22 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:22 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:23 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:23 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:23 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:24 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:25 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:25 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:25 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:26 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:26 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:26 INFO mon-relation-
unit-ceph-2: 2015-02-10 04:05:26 ERROR juju-log mon:60: Unable to initialize device: /dev/sdb
unit-ceph-2: 2015-02-10 04:05:26 INFO mon-relation-