ceph-create-keys:ceph-mon is not in quorum: u'probing' spams for 2+ hours, then mon-relation-changed hook fails
Bug #1774648 reported by
Jason Hobbs
This bug report is a duplicate of:
Bug #1774666: Bond interfaces stuck at 1500 MTU on Bionic.
Edit
Remove
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ceph Monitor Charm |
Incomplete
|
High
|
Chris MacNaughton |
Bug Description
2018-06-01 08:33:58 DEBUG mon-relation-
...
2018-06-01 10:47:20 DEBUG mon-relation-
Then we get a traceback:
http://
bundle & overlay:
http://
This is with bionic queens - we have never seen this in many tests on xenial queens. We're hitting this 100% of the time in bionic queens.
description: | updated |
tags: | added: cdo-release-blocker |
description: | updated |
Changed in charm-ceph-mon: | |
importance: | Undecided → Critical |
assignee: | nobody → Chris MacNaughton (chris.macnaughton) |
status: | New → In Progress |
To post a comment you must log in.
I'm having trouble recreating this failure, I'll continue trying to reproduce it but it seems to have worked correctly on this simple deploy:
Model Controller Cloud/Region Version SLA
ruxton-ruxton-maas ruxton-ruxton-maas ruxton-maas 2.3.8 unsupported
App Version Status Scale Charm Store Rev OS Notes
ceph-mon 12.2.4 active 3 ceph-mon jujucharms 327 ubuntu
ceph-osd 12.2.4 active 3 ceph-osd jujucharms 335 ubuntu
magpie active 3 magpie jujucharms 33 ubuntu
Unit Workload Agent Machine Public address Ports Message
ceph-mon/0 active idle 0/lxd/0 10.245.168.49 Unit is ready and clustered
ceph-mon/1* active idle 1/lxd/0 10.245.168.47 Unit is ready and clustered
ceph-mon/2 active idle 2/lxd/0 10.245.168.48 Unit is ready and clustered
ceph-osd/0* active idle 0 10.245.168.44 Unit is ready (1 OSD)
ceph-osd/1 active idle 1 10.245.168.45 Unit is ready (1 OSD)
ceph-osd/2 active idle 2 10.245.168.46 Unit is ready (1 OSD)
magpie/1 active idle 0/lxd/0 10.245.168.49 icmp ok, local hostname ok, dns ok, net mtu ok: 1500, 940 mbit/s
magpie/2* active idle 1/lxd/0 10.245.168.47 icmp ok, local hostname ok, dns ok, iperf leader, mtu: 1500
magpie/3 active idle 2/lxd/0 10.245.168.48 icmp ok, local hostname ok, dns ok, net mtu ok: 1500, 941 mbit/s
Machine State DNS Inst id Series AZ Message
0 started 10.245.168.44 tmrm8d bionic AZ1 Deployed
0/lxd/0 started 10.245.168.49 juju-df9140-0-lxd-0 bionic AZ1 Container started
1 started 10.245.168.45 66wnqf bionic AZ2 Deployed
1/lxd/0 started 10.245.168.47 juju-df9140-1-lxd-0 bionic AZ2 Container started
2 started 10.245.168.46 cbxwat bionic default Deployed
2/lxd/0 started 10.245.168.48 juju-df9140-2-lxd-0 bionic default Container started