Intermittent deploy failure
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ceph RADOS Gateway Charm |
Fix Released
|
High
|
Frode Nordahl |
Bug Description
$ juju status ceph-radosgw --relations
Model Controller Cloud/Region Version SLA Timestamp
zaza-fc6306dea031 fnordahl-
App Version Status Scale Charm Store Rev OS Notes
ceph-radosgw 15.1.0 blocked 1 ceph-radosgw jujucharms 356 ubuntu
Unit Workload Agent Machine Public address Ports Message
ceph-radosgw/0* blocked idle 6 10.5.0.3 80/tcp Services not running that should be: <email address hidden>
Machine State DNS Inst id Series AZ Message
6 started 10.5.0.3 4bb9dfd8-
Relation provider Requirer Interface Type Message
ceph-mon:radosgw ceph-radosgw:mon ceph-radosgw regular
ceph-radosgw:
keystone:
2020-03-21 11:26:53 INFO juju-log identity-
2020-03-21 11:26:53 INFO juju-log identity-
2020-03-21 11:26:55 DEBUG juju-log identity-
2020-03-21 11:26:55 INFO juju-log identity-
2020-03-21 11:26:57 DEBUG juju-log identity-
2020-03-21 11:27:00 DEBUG juju-log identity-
2020-03-21 11:27:00 INFO juju-log identity-
2020-03-21 11:27:01 INFO juju-log identity-
2020-03-21 11:27:01 INFO juju-log identity-
2020-03-21 11:27:01 INFO juju-log identity-
2020-03-21 11:27:01 DEBUG juju-log identity-
2020-03-21 11:27:03 INFO juju-log identity-
2020-03-21 11:27:03 INFO juju-log identity-
2020-03-21 11:27:03 INFO juju-log identity-
2020-03-21 11:27:03 DEBUG juju-log identity-
2020-03-21 11:27:04 INFO juju-log identity-
2020-03-21 11:27:04 INFO juju-log identity-
2020-03-21 11:27:04 INFO juju-log identity-
2020-03-21 11:27:04 INFO juju-log identity-
2020-03-21 11:27:04 DEBUG juju-log identity-
2020-03-21 11:27:06 INFO juju-log identity-
2020-03-21 11:27:06 INFO juju-log identity-
2020-03-21 11:27:06 INFO juju-log identity-
2020-03-21 11:27:06 DEBUG identity-
2020-03-21 11:27:06 DEBUG identity-
2020-03-21 11:27:06 DEBUG identity-
2020-03-21 11:27:06 DEBUG identity-
2020-03-21 11:27:07 DEBUG identity-
2020-03-21 11:27:07 DEBUG identity-
2020-03-21 11:27:07 INFO juju-log identity-
# systemctl status apache2
● apache2.service - The Apache HTTP Server
Loaded: loaded (/lib/systemd/
Drop-In: /lib/systemd/
Active: failed (Result: exit-code) since Sat 2020-03-21 11:27:06 UTC; 3h 15min ago
Process: 6895 ExecReload=
Process: 14108 ExecStart=
Main PID: 5564 (code=exited, status=1/FAILURE)
Mar 21 11:27:06 juju-ddb957-
Mar 21 11:27:06 juju-ddb957-
Mar 21 11:27:06 juju-ddb957-
Mar 21 11:27:06 juju-ddb957-
Mar 21 11:27:06 juju-ddb957-
Mar 21 11:27:06 juju-ddb957-
Mar 21 11:27:06 juju-ddb957-
Mar 21 11:27:06 juju-ddb957-
# netstat -nepa |grep LISTEN
tcp 0 0 0.0.0.0:80 0.0.0.0:* LISTEN 0 38964 7263/haproxy
tcp 0 0 252.0.3.1:53 0.0.0.0:* LISTEN 0 23121 2374/dnsmasq
tcp 0 0 127.0.0.53:53 0.0.0.0:* LISTEN 101 15544 611/systemd-resolve
tcp 0 0 0.0.0.0:22 0.0.0.0:* LISTEN 0 18826 910/sshd
tcp 0 0 127.0.0.1:8888 0.0.0.0:* LISTEN 0 38962 7263/haproxy
tcp6 0 0 :::80 :::* LISTEN 0 38965 7263/haproxy
tcp6 0 0 :::22 :::* LISTEN 0 18837 910/sshd
# systemctl status <email address hidden>
● <email address hidden> - Ceph rados gateway
Loaded: loaded (/lib/systemd/
Active: failed (Result: exit-code) since Sat 2020-03-21 11:27:07 UTC; 3h 17min ago
Process: 14228 ExecStart=
Main PID: 14228 (code=exited, status=1/FAILURE)
Mar 21 11:27:07 juju-ddb957-
Mar 21 11:27:07 juju-ddb957-
Mar 21 11:27:07 juju-ddb957-
Mar 21 11:27:07 juju-ddb957-
Mar 21 11:27:07 juju-ddb957-
Mar 21 11:27:07 juju-ddb957-
Mar 21 11:27:07 juju-ddb957-
Mar 21 11:27:07 juju-ddb957-
# journalctl -b |grep radosgw
[ ... ]
Mar 21 11:27:07 juju-ddb957-
Mar 21 11:27:07 juju-ddb957-
Mar 21 11:27:07 juju-ddb957-
Mar 21 11:27:07 juju-ddb957-
Mar 21 11:27:07 juju-ddb957-
Mar 21 11:27:07 juju-ddb957-
Mar 21 11:27:07 juju-ddb957-
Mar 21 11:27:07 juju-ddb957-
Mar 21 11:27:07 juju-ddb957-
Mar 21 11:27:07 juju-ddb957-
Mar 21 11:27:07 juju-ddb957-
The ceph-radosgw charm appear to never pick up the broker request response from ceph-mon:
2020-03-21 11:25:14 DEBUG juju-log mon:36: Request already sent but not complete, not sending new request
The response is only present on one of the unit to unit relations, but that may or may not be ok:
ubuntu@test:~$ juju run --unit ceph-radosgw/0 'relation-get -r mon:36 - ceph-mon/0'
auth: cephx
ceph-public-
egress-subnets: 10.5.0.38/32
fsid: f82f86bc-
ingress-address: 10.5.0.38
private-address: 10.5.0.38
rgw.juju-
ubuntu@test:~$ juju run --unit ceph-radosgw/0 'relation-get -r mon:36 - ceph-mon/1'
auth: cephx
broker-
while processing requests: {''api-version'': 1, ''ops'': [{''op'': ''create-pool'',
''name'': ''default.
20, ''group'': ''objects'', ''group-
None, ''max-objects'': None}, {''op'': ''create-pool'', ''name'': ''default.
''replicas'': 3, ''pg_num'': None, ''weight'': 0.1, ''group'': ''objects'', ''group-
None, ''app-name'': ''rgw'', ''max-bytes'': None, ''max-objects'': None}, {''op'':
''create-pool'', ''name'': ''default.
None, ''weight'': 0.1, ''group'': ''objects'', ''group-
''rgw'', ''max-bytes'': None, ''max-objects'': None}, {''op'': ''create-pool'',
''name'': ''default.rgw.gc'', ''replicas'': 3, ''pg_num'': None, ''weight'': 0.1,
''group'': ''objects'', ''group-
None, ''max-objects'': None}, {''op'': ''create-pool'', ''name'': ''default.
''replicas'': 3, ''pg_num'': None, ''weight'': 0.1, ''group'': ''objects'', ''group-
None, ''app-name'': ''rgw'', ''max-bytes'': None, ''max-objects'': None}, {''op'':
''create-pool'', ''name'': ''default.
None, ''weight'': 0.1, ''group'': ''objects'', ''group-
''rgw'', ''max-bytes'': None, ''max-objects'': None}, {''op'': ''create-pool'',
''name'': ''default.
''group'': ''objects'', ''group-
None, ''max-objects'': None}, {''op'': ''create-pool'', ''name'': ''default.
''replicas'': 3, ''pg_num'': None, ''weight'': 0.1, ''group'': ''objects'', ''group-
None, ''app-name'': ''rgw'', ''max-bytes'': None, ''max-objects'': None}, {''op'':
''create-pool'', ''name'': ''default.
None, ''weight'': 0.1, ''group'': ''objects'', ''group-
''rgw'', ''max-bytes'': None, ''max-objects'': None}, {''op'': ''create-pool'',
''name'': ''default.
0.1, ''group'': ''objects'', ''group-
None, ''max-objects'': None}, {''op'': ''create-pool'', ''name'': ''default.
''replicas'': 3, ''pg_num'': None, ''weight'': 0.1, ''group'': ''objects'', ''group-
None, ''app-name'': ''rgw'', ''max-bytes'': None, ''max-objects'': None}, {''op'':
''create-pool'', ''name'': ''default.
None, ''weight'': 0.1, ''group'': ''objects'', ''group-
''rgw'', ''max-bytes'': None, ''max-objects'': None}, {''op'': ''create-pool'',
''name'': ''default.
1.0, ''group'': ''objects'', ''group-
None, ''max-objects'': None}, {''op'': ''create-pool'', ''name'': ''default.
''replicas'': 3, ''pg_num'': None, ''weight'': 3.0, ''group'': ''objects'', ''group-
None, ''app-name'': ''rgw'', ''max-bytes'': None, ''max-objects'': None}, {''op'':
''create-pool'', ''name'': ''.rgw.root'', ''replicas'': 3, ''pg_num'': None, ''weight'':
0.1, ''group'': ''objects'', ''group-
None, ''max-objects'': None}], ''request-id'': ''f41b0e16-
ceph-public-
egress-subnets: 10.5.0.18/32
fsid: f82f86bc-
ingress-address: 10.5.0.18
private-address: 10.5.0.18
rgw.juju-
ubuntu@test:~$ juju run --unit ceph-radosgw/0 'relation-get -r mon:36 - ceph-mon/2'
auth: cephx
ceph-public-
egress-subnets: 10.5.0.4/32
fsid: f82f86bc-
ingress-address: 10.5.0.4
private-address: 10.5.0.4
rgw.juju-
Note that the 'broker-
while processing requests:' was caused by a bug in the Ceph Octopus PG autoscaling code re bug 1868587
summary: |
- Intermittent deploy failure + [Ussuri] Intermittent deploy failure |
description: | updated |
summary: |
- [Ussuri] Intermittent deploy failure + Intermittent deploy failure |
description: | updated |
Changed in charm-ceph-radosgw: | |
status: | New → Triaged |
importance: | Undecided → High |
summary: |
- Intermittent deploy failure + Intermittent deploy failure with certificates relation |
description: | updated |
summary: |
- Intermittent deploy failure with certificates relation + Intermittent deploy failure |
description: | updated |
description: | updated |
Changed in charm-ceph-radosgw: | |
milestone: | 20.05 → 20.08 |
Changed in charm-ceph-radosgw: | |
status: | Fix Committed → Fix Released |
https:/ /review. opendev. org/#/c/ 714400/