mysql-innodb-cluster stuck in "waiting" state for Focal Ussuri deployment
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
MySQL InnoDB Cluster Charm |
Expired
|
Undecided
|
Unassigned |
Bug Description
During deployment of Focal Ussuri, mysql-innodb-
"SystemError: RuntimeError: Dba.create_cluster: Group Replication failed to start"
mysql-innodb-
Ussuri Bundle - cs:bundle/
mysql-innodb-
cs:mysql-
cs:~openstack-
Debug logs -
charm: cs:mysql-
=======
root@focalmaas:~# juju status mysql-innodb-
Model Controller Cloud/Region Version SLA Timestamp
controller maas-controller mymaas/default 2.8.6 unsupported 10:56:10Z
App Version Status Scale Charm Store Rev OS Notes
mysql-innodb-
Unit Workload Agent Machine Public address Ports Message
mysql-innodb-
mysql-innodb-
mysql-innodb-
Machine State DNS Inst id Series AZ Message
1 started 172.172.1.4 NODE2 focal default Deployed
1/lxd/3 started 172.172.1.9 juju-9981a4-1-lxd-3 focal default Container started
2 started 172.172.1.5 NODE3 focal default Deployed
2/lxd/2 started 172.172.1.18 juju-9981a4-2-lxd-2 focal default Container started
3 started 172.172.1.6 NODE4 focal default Deployed
3/lxd/2 started 172.172.1.11 juju-9981a4-3-lxd-2 focal default Container started
root@focalmaas:~#
charm: cs:~openstack-
=======
root@focalmaas:~# juju status mysql-innodb-
Model Controller Cloud/Region Version SLA Timestamp
controller maas-controller mymaas/default 2.8.6 unsupported 11:21:57Z
App Version Status Scale Charm Store Rev OS Notes
mysql-innodb-
Unit Workload Agent Machine Public address Ports Message
mysql-innodb-
mysql-innodb-
mysql-innodb-
Machine State DNS Inst id Series AZ Message
1 started 172.172.1.4 NODE2 focal default Deployed
1/lxd/7 started 172.172.1.11 juju-9981a4-1-lxd-7 focal default Container started
2 started 172.172.1.5 NODE3 focal default Deployed
2/lxd/6 started 172.172.1.9 juju-9981a4-2-lxd-6 focal default Container started
3 started 172.172.1.6 NODE4 focal default Deployed
3/lxd/6 started 172.172.1.18 juju-9981a4-3-lxd-6 focal default Container started
root@focalmaas:~#
unit-mysql-
unit-mysql-
WARNING: The member will only proceed according to its exitStateAction if auto-rejoin fails (i.e. all retry attempts are exhausted).
Validating instance configuration at 172.172.
This instance reports its own address as 172.172.1.11:3306
Instance configuration is suitable.
WARNING: The member will only proceed according to its exitStateAction if auto-rejoin fails (i.e. all retry attempts are exhausted).
NOTE: Group Replication will communicate with other members using '172.172.
Creating InnoDB cluster 'jujuCluster' on '172.172.
Adding Seed Instance...
ERROR: Unable to start Group Replication for instance '172.172.
Traceback (most recent call last):
File "<string>", line 2, in <module>
SystemError: RuntimeError: Dba.create_cluster: Group Replication failed to start: MySQL Error 3092 (HY000): 172.172.1.11:3306: The server is not configured properly to be an active member of the group. Please see more details on error log.
description: | updated |
I believe we may be seeing a similar issue here: https:/ /solutions. qa.canonical. com/testruns/ testRun/ 90f2cf18- 088c-49ef- abbc-a73d45530c fe /oil-jenkins. canonical. com/artifacts/ 90f2cf18- 088c-49ef- abbc-a73d45530c fe/generated/ generated/ kubernetes/ juju-crashdump- kubernetes- 2021-04- 23-17.51. 35.tar. gz
Crashdump: https:/
in this example mysql/0 stays waiting and doe snot join the cluster, causing the other two units to hang as well.