LSB: Ubuntu 14.04.4 LTS
openstack: cloud:trusty-liberty
cinder packages: 2:7.0.1-0ubuntu1~cloud0
cinder charm: lp:charms/trusty/cinder;revno=106
Juju: 1.25.3.1
num_units: 3
related to hacluster charm (which remains idle)
Symptoms:
All three units constantly run cluster-relation-changed, causing restart of all cinder upstart jobs as well as apache2.
If a unit is stopped, the other two stop looping. By restarting stopped unit and juju resolving it, loop starts on all three.
Temp solution to end loop (only applied on cinder/1):
"""
@hooks.hook('cluster-relation-changed',
'cluster-relation-departed')
@restart_on_change(restart_map(), stopstart=True)
def cluster_changed():
#check_db_initialised()
#CONFIGS.write_all()
pass
"""
Such temporary solution was done at 13:08 (see attached 20160325-unit-cinder-1.log). Once all three units settled, I rolledback cluster_changed() config to original code (uncommenting check_db_initialised and CONFIGS.write_all), at 13:09.
Please let me know if you need further details.
cinder-0 attachment shows juju status when all three units are in the loop.
juju status-history cinder/0 shows transition between states when I was stopping peer units (showing active, as it stops looping) or cinder/0 unit itself (showing error state).