removal of principal ceph-osd charm stuck in 'executing' state

Bug #1655486 reported by Shane Peters
30
This bug affects 5 people
Affects Status Importance Assigned to Milestone
Canonical Juju
Fix Released
High
Christian Muirhead
Ceph OSD Charm
Invalid
Undecided
Unassigned
ceph-osd (Juju Charms Collection)
Invalid
Undecided
Unassigned

Bug Description

Attempting to remove ceph-osd with ntp subordinate results in the charm getting stuck in terminated/executing/(stop) state. Of course, without the ntp relation, it removes cleanly.

Oddly enough, this doesn't occur in the 'ceph' charm.

# juju remove-application ceph-osd

Unit Workload Agent Machine Public address Ports Message
ceph-osd/27* terminated executing 55 10.191.162.249 (stop)
  ntp/49 active idle 10.191.162.249 Unit is ready

I've tried manually removing relations as well as the units themselves to no avail.

Reproduce
===================
$ juju deploy ntp
$ juju deploy ceph --config ceph_config.yaml
$ juju deploy ceph-osd --config ceph_config.yaml

$ juju add-relation ceph ntp
$ juju add-relation ceph-osd ntp
$ juju add-relation ceph-osd ceph

$ juju remove-application ceph-osd

Tags: sts
Shane Peters (shaner)
tags: added: sts
Revision history for this message
Shane Peters (shaner) wrote :

Series: Xenial
Juju: 2.0.2
Charm: ceph-osd 10.2.3

Revision history for this message
James Page (james-page) wrote :

Please can you attach the log files from /var/log/juju/*.

Marking 'Incomplete' for now - I'm not sure why this specifically only effects the ceph-osd charm (not the ceph charm as described).

Revision history for this message
Shane Peters (shaner) wrote :
description: updated
Revision history for this message
Anastasia (anastasia-macmood) wrote :

This looks oddly similar to https://bugs.launchpad.net/bugs/1655132... Maybe a duplicate?

Changed in juju:
status: New → Triaged
importance: Undecided → High
milestone: none → 2.2.0
Revision history for this message
Billy Olsen (billy-olsen) wrote :

I did not encounter this in my environment when I removed the ceph-osd service/application with the ntp subordinate. I suspect this is more of an issue on the juju unit agent (evidence somewhat agrees with another sighting that was unrelated to ceph-osd charm). I'll mark the task against the charm invalid.

Changed in ceph-osd (Juju Charms Collection):
status: New → Incomplete
Revision history for this message
Billy Olsen (billy-olsen) wrote :

Actually marking as Incomplete pending evidence it is an issue in the charm.

James Page (james-page)
Changed in charm-ceph-osd:
status: New → Incomplete
Changed in ceph-osd (Juju Charms Collection):
status: Incomplete → Invalid
Curtis Hovey (sinzui)
Changed in juju:
milestone: 2.2-beta1 → 2.2-beta2
Curtis Hovey (sinzui)
Changed in juju:
milestone: 2.2-beta2 → 2.2-beta3
Revision history for this message
William Grant (wgrant) wrote :

I've seen this with cs:ubuntu with a single subordinate on a fresh 2.1.1 env. terminated/executing/(stop) persisted even across multiple reboots of the controller and the affected unit. There was nothing in the unit agent log to indicate that something had gone wrong after the hook fired the first time, and the log for the hook firing never appeared again.

Changed in juju:
milestone: 2.2-beta3 → 2.2-beta4
Changed in juju:
milestone: 2.2-beta4 → 2.2-rc1
Revision history for this message
Christian Muirhead (2-xtian) wrote :

I can reproduce this behaviour using wordpress and mysql with ntp related to both, it seems to be caused by the same problem as https://bugs.launchpad.net/juju/+bug/1686696.

PR for the fix: https://github.com/juju/juju/pull/7369

Changed in juju:
status: Triaged → In Progress
assignee: nobody → Christian Muirhead (2-xtian)
Changed in juju:
status: In Progress → Fix Committed
Shane Peters (shaner)
Changed in charm-ceph-osd:
status: Incomplete → Invalid
Changed in juju:
status: Fix Committed → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Duplicates of this bug

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.