ceph-radosgw charm fails deploy on first unit every time
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ceph RADOS Gateway Charm |
Fix Released
|
High
|
Unassigned |
Bug Description
I'm deploying openstack with all latest charms as of now, specifically ceph-radosgw charm at cs:ceph-
I have repeated this at least 10 times, and the result is exactly the same every time:
- first unit (the /0 one) goes into error state (log below)
- other two units (/1 and /2) get into Ready state
- then I remove-unit ceph-radosgw/0 and add-unit again
- it comes back with no problem (as /3) and clusters ok with other two
Here is some logs:
ubuntu@
Model Controller Cloud/Region Version SLA Timestamp
openstack foundations-maas maas_cloud 2.8.6 unsupported 02:41:48Z
App Version Status Scale Charm Store Rev OS Notes
ceph-radosgw 12.2.13 error 3 ceph-radosgw jujucharms 291 ubuntu
filebeat 5.6.16 waiting 3 filebeat jujucharms 33 ubuntu
hacluster-radosgw active 3 hacluster jujucharms 72 ubuntu
landscape-client maintenance 3 landscape-client jujucharms 33 ubuntu
logrotate active 3 logrotated jujucharms 2 ubuntu
nrpe-container active 3 nrpe jujucharms 69 ubuntu
public-
telegraf active 3 telegraf jujucharms 39 ubuntu
Unit Workload Agent Machine Public address Ports Message
ceph-radosgw/0* error idle 18/lxd/3 10.243.168.77 80/tcp hook failed: "mon-relation-
filebeat/31 active idle 10.243.168.77 Filebeat ready.
hacluster-
landscape-
logrotate/33 active idle 10.243.168.77 Unit is ready.
nrpe-container/0* active idle 10.243.168.77 icmp,5666/tcp ready
public-
telegraf/33 active idle 10.243.168.77 9103/tcp Monitoring ceph-radosgw/0
ceph-radosgw/1 active idle 19/lxd/3 10.243.168.68 80/tcp Unit is ready
filebeat/61 active idle 10.243.168.68 Filebeat ready.
hacluster-
landscape-
logrotate/64 active idle 10.243.168.68 Unit is ready.
nrpe-container/29 active idle 10.243.168.68 icmp,5666/tcp ready
public-
telegraf/63 active idle 10.243.168.68 9103/tcp Monitoring ceph-radosgw/1
ceph-radosgw/2 active idle 20/lxd/3 10.243.168.104 80/tcp Unit is ready
filebeat/43 active idle 10.243.168.104 Filebeat ready.
hacluster-
landscape-
logrotate/45 active idle 10.243.168.104 Unit is ready.
nrpe-container/12 active idle 10.243.168.104 icmp,5666/tcp ready
public-
telegraf/45 active idle 10.243.168.104 9103/tcp Monitoring ceph-radosgw/2
Machine State DNS Inst id Series AZ Message
18 started 10.243.168.59 brtlvmrs0642co bionic zone3 Deployed
18/lxd/3 started 10.243.168.77 juju-d51b21-
19 started 10.243.168.53 brtlvmrs0641co bionic zone2 Deployed
19/lxd/3 started 10.243.168.68 juju-d51b21-
20 started 10.243.168.36 brtlvmrs0640co bionic zone1 Deployed
20/lxd/3 started 10.243.168.104 juju-d51b21-
root@juju-
Synchronizing state of radosgw.service with SysV service script with /lib/systemd/
Executing: /lib/systemd/
Unit /etc/systemd/
Reading package lists... Done
Building dependency tree
Reading state information... Done
python-dbus is already the newest version (1.2.6-1).
The following package was automatically installed and is no longer required:
libfreetype6
Use 'apt autoremove' to remove it.
0 upgraded, 0 newly installed, 0 to remove and 0 not upgraded.
parse error setting 'err_to_syslog' to '' (Expected option value to be integer, got '')
parse error setting 'log_to_syslog' to '' (Expected option value to be integer, got '')
unable to get monitor info from DNS SRV with service name: ceph-mon
no monitors specified to connect to.
couldn't init storage provider
2020-11-26 02:39:39.406753 7f21fb27ce40 -1 failed for service _ceph-mon._tcp
parse error setting 'err_to_syslog' to '' (Expected option value to be integer, got '')
parse error setting 'log_to_syslog' to '' (Expected option value to be integer, got '')
unable to get monitor info from DNS SRV with service name: ceph-mon
no monitors specified to connect to.
couldn't init storage provider
2020-11-26 02:39:42.444537 7f64cc084e40 -1 failed for service _ceph-mon._tcp
parse error setting 'err_to_syslog' to '' (Expected option value to be integer, got '')
parse error setting 'log_to_syslog' to '' (Expected option value to be integer, got '')
unable to get monitor info from DNS SRV with service name: ceph-mon
no monitors specified to connect to.
couldn't init storage provider
2020-11-26 02:39:48.488368 7f6f70fd8e40 -1 failed for service _ceph-mon._tcp
parse error setting 'err_to_syslog' to '' (Expected option value to be integer, got '')
parse error setting 'log_to_syslog' to '' (Expected option value to be integer, got '')
unable to get monitor info from DNS SRV with service name: ceph-mon
no monitors specified to connect to.
couldn't init storage provider
2020-11-26 02:39:57.534079 7fb6c8f9fe40 -1 failed for service _ceph-mon._tcp
parse error setting 'err_to_syslog' to '' (Expected option value to be integer, got '')
parse error setting 'log_to_syslog' to '' (Expected option value to be integer, got '')
unable to get monitor info from DNS SRV with service name: ceph-mon
no monitors specified to connect to.
couldn't init storage provider
2020-11-26 02:40:09.634685 7f18a2bece40 -1 failed for service _ceph-mon._tcp
parse error setting 'err_to_syslog' to '' (Expected option value to be integer, got '')
parse error setting 'log_to_syslog' to '' (Expected option value to be integer, got '')
unable to get monitor info from DNS SRV with service name: ceph-mon
no monitors specified to connect to.
couldn't init storage provider
2020-11-26 02:40:24.686832 7f65dac70e40 -1 failed for service _ceph-mon._tcp
Traceback (most recent call last):
File "./hooks/
hooks.
File "/var/lib/
self.
File "./hooks/
_mon_relation()
File "/var/lib/
stopstart, restart_functions)
File "/var/lib/
r = lambda_f()
File "/var/lib/
(lambda: f(*args, **kwargs)), __restart_
File "./hooks/
if zone not in multisite.
File "/var/lib/
result = json.loads(
File "/var/lib/
return f(*args, **kwargs)
File "/var/lib/
return subprocess.
File "/usr/lib/
**kwargs)
File "/usr/lib/
output=stdout, stderr=stderr)
subprocess.
root@juju-
parse error setting 'err_to_syslog' to '' (Expected option value to be integer, got '')
parse error setting 'log_to_syslog' to '' (Expected option value to be integer, got '')
unable to get monitor info from DNS SRV with service name: ceph-mon
no monitors specified to connect to.
couldn't init storage provider
2020-11-26 02:38:31.161470 7fb1169f7e40 -1 failed for service _ceph-mon._tcp
Changed in charm-ceph-radosgw: | |
assignee: | nobody → Aurelien Lourot (aurelien-lourot) |
status: | Triaged → In Progress |
tags: | added: ps5 |
Changed in charm-ceph-radosgw: | |
status: | In Progress → Fix Committed |
milestone: | none → 20.10 |
Changed in charm-ceph-radosgw: | |
status: | Fix Committed → Fix Released |
assignee: | Aurelien Lourot (aurelien-lourot) → nobody |
Seen a few times on OSCI: https:/ /review. opendev. org/c/openstack /charm- ceph-radosgw/ +/764305