OpenStack Cinder-Ceph charm

[RFE] pre and post failover actions

Bug #1925035 reported by Peter Matulis on 2021-04-19

This bug affects 1 person

Affects		Status	Importance	Assigned to	Milestone
	OpenStack Cinder-Ceph charm	Triaged	Wishlist	Unassigned

Bug Description

The failover to a secondary site for Cinder volume replication (volumes backed by two Ceph clusters) takes too long (and can even fail) due to default Cinder configuration affecting timeouts and retries.

In the situation where one site (cluster) is known to be down I don't see why a deliberate choice should be subject to long timeouts/retries.

The workaround is to manually change the defaults pre-failover and then revert them post-failover:

$ juju ssh cinder-ceph-a/0
> sudo apt install -y crudini
> sudo crudini --set /etc/cinder/cinder.conf cinder-ceph-a rados_connect_timeout 1
> sudo crudini --set /etc/cinder/cinder.conf cinder-ceph-a rados_connection_retries 1
> sudo crudini --set /etc/cinder/cinder.conf cinder-ceph-a rados_connection_interval 0
> sudo crudini --set /etc/cinder/cinder.conf cinder-ceph-a replication_connect_timeout 1
> sudo systemctl restart cinder-volume
> exit

$ cinder failover-host cinder@cinder-ceph-a

$ juju ssh cinder-ceph-a/0
> sudo crudini --del /etc/cinder/cinder.conf cinder-ceph-a rados_connect_timeout
> sudo crudini --del /etc/cinder/cinder.conf cinder-ceph-a rados_connection_retries
> sudo crudini --del /etc/cinder/cinder.conf cinder-ceph-a rados_connection_interval
> sudo crudini --del /etc/cinder/cinder.conf cinder-ceph-a replication_connect_timeout
> sudo systemctl restart cinder-volume
> exit

Is it possible to have actions for this?:

$ juju run-action --wait cinder-ceph-a pre-failover
$ cinder failover-host cinder@cinder-ceph-a
$ juju run-action --wait cinder-ceph-a post-failover

An alternative would be to implement a --force option in the cinder API client.

See original description

Tags:

Peter Matulis (petermatulis) on 2021-04-19

description:	updated
description:	updated

Peter Matulis (petermatulis) on 2021-05-05

tags:

added: openstack-advocacy

Revision history for this message

Alex Kavanagh (ajkavanagh) wrote on 2021-05-06:

Triage; looks like an interesting addition.

Changed in charm-cinder-ceph:
importance:	Undecided → Wishlist
status:	New → Triaged
tags:	added: onboarding

Corey Bryant (corey.bryant) on 2021-05-12

tags:	added: good-first-bug
tags:	removed: onboarding

Muhammad Ahmad (ahmadfsbd) on 2022-08-01

Changed in charm-cinder-ceph:
assignee:	nobody → Muhammad Ahmad (ahmadfsbd)

Muhammad Ahmad (ahmadfsbd) on 2022-08-01

Changed in charm-cinder-ceph:
assignee:	Muhammad Ahmad (ahmadfsbd) → nobody

Report a bug

This report contains Public information

Everyone can see this information.

You are

Subscribing...

Edit bug mail

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.