First Ceph node provides cluster auth keys and can't be replaced

Bug #1211693 reported by Ian Wells
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
Cisco Openstack
Triaged
Wishlist
Unassigned

Bug Description

The current Ceph install uses the first mon node to create the admin key. (Conversely, the mon key - used to decide if a node has permission to join the mnon quorum - is statically defined in the site.pp file.)

This means that if the first mon node is replaced - as is permissible and expected in the event of a hardware failure in an HA system - it will recreate the admin key, and retransmit it to the entire cluster, changing the cluster's credentials and likely resulting in odd failures (particularly if anything off-system is using the Ceph cluster).

In parallel, the mon key is hardcoded into the site.pp file and, since it's not obvious that it should be changed and there are no instructions as to what format its replacement should be in, likely all clusters installed will have the same mon key and it serves as no security at all.

I suggest the following:

- in install_os_puppet or similar, create both a mon and admin key and store them on the boot node
- pass the mon and admin keys to all nodes during installation
- pass the mon and admin keys to nodes during reinstallation

Added bonus is that puppet runs fewer times to get the install done, as the admin key is available on the first run (otherwise the mon node takes 3 runs to settle and it's not until the second that the key is put in storedconfig for other nodes to use).

Ian Wells (ijw-ubuntu)
summary: - Can't replace mon node with Ceph running
+ First Ceph node provides cluster auth keys and can't be replaced
description: updated
Changed in openstack-cisco:
status: New → Triaged
importance: Undecided → Wishlist
assignee: nobody → Don Talton (dotalton)
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.