[RFE] pre and post failover actions

Bug #1925035 reported by Peter Matulis
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
OpenStack Cinder-Ceph charm
Triaged
Wishlist
Unassigned

Bug Description

The failover to a secondary site for Cinder volume replication (volumes backed by two Ceph clusters) takes too long (and can even fail) due to default Cinder configuration affecting timeouts and retries.

In the situation where one site (cluster) is known to be down I don't see why a deliberate choice should be subject to long timeouts/retries.

The workaround is to manually change the defaults pre-failover and then revert them post-failover:

$ juju ssh cinder-ceph-a/0
> sudo apt install -y crudini
> sudo crudini --set /etc/cinder/cinder.conf cinder-ceph-a rados_connect_timeout 1
> sudo crudini --set /etc/cinder/cinder.conf cinder-ceph-a rados_connection_retries 1
> sudo crudini --set /etc/cinder/cinder.conf cinder-ceph-a rados_connection_interval 0
> sudo crudini --set /etc/cinder/cinder.conf cinder-ceph-a replication_connect_timeout 1
> sudo systemctl restart cinder-volume
> exit

$ cinder failover-host cinder@cinder-ceph-a

$ juju ssh cinder-ceph-a/0
> sudo crudini --del /etc/cinder/cinder.conf cinder-ceph-a rados_connect_timeout
> sudo crudini --del /etc/cinder/cinder.conf cinder-ceph-a rados_connection_retries
> sudo crudini --del /etc/cinder/cinder.conf cinder-ceph-a rados_connection_interval
> sudo crudini --del /etc/cinder/cinder.conf cinder-ceph-a replication_connect_timeout
> sudo systemctl restart cinder-volume
> exit

Is it possible to have actions for this?:

$ juju run-action --wait cinder-ceph-a pre-failover
$ cinder failover-host cinder@cinder-ceph-a
$ juju run-action --wait cinder-ceph-a post-failover

An alternative would be to implement a --force option in the cinder API client.

description: updated
description: updated
tags: added: openstack-advocacy
Revision history for this message
Alex Kavanagh (ajkavanagh) wrote :

Triage; looks like an interesting addition.

Changed in charm-cinder-ceph:
importance: Undecided → Wishlist
status: New → Triaged
tags: added: onboarding
tags: added: good-first-bug
tags: removed: onboarding
Changed in charm-cinder-ceph:
assignee: nobody → Muhammad Ahmad (ahmadfsbd)
Changed in charm-cinder-ceph:
assignee: Muhammad Ahmad (ahmadfsbd) → nobody
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.