[RFE] pre and post failover actions
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Cinder-Ceph charm |
Triaged
|
Wishlist
|
Unassigned |
Bug Description
The failover to a secondary site for Cinder volume replication (volumes backed by two Ceph clusters) takes too long (and can even fail) due to default Cinder configuration affecting timeouts and retries.
In the situation where one site (cluster) is known to be down I don't see why a deliberate choice should be subject to long timeouts/retries.
The workaround is to manually change the defaults pre-failover and then revert them post-failover:
$ juju ssh cinder-ceph-a/0
> sudo apt install -y crudini
> sudo crudini --set /etc/cinder/
> sudo crudini --set /etc/cinder/
> sudo crudini --set /etc/cinder/
> sudo crudini --set /etc/cinder/
> sudo systemctl restart cinder-volume
> exit
$ cinder failover-host cinder@
$ juju ssh cinder-ceph-a/0
> sudo crudini --del /etc/cinder/
> sudo crudini --del /etc/cinder/
> sudo crudini --del /etc/cinder/
> sudo crudini --del /etc/cinder/
> sudo systemctl restart cinder-volume
> exit
Is it possible to have actions for this?:
$ juju run-action --wait cinder-ceph-a pre-failover
$ cinder failover-host cinder@
$ juju run-action --wait cinder-ceph-a post-failover
An alternative would be to implement a --force option in the cinder API client.
description: | updated |
description: | updated |
tags: | added: openstack-advocacy |
tags: | added: good-first-bug |
tags: | removed: onboarding |
Changed in charm-cinder-ceph: | |
assignee: | nobody → Muhammad Ahmad (ahmadfsbd) |
Changed in charm-cinder-ceph: | |
assignee: | Muhammad Ahmad (ahmadfsbd) → nobody |
Triage; looks like an interesting addition.