after reboot all mysql-routers report error
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
OpenStack Snap |
Triaged
|
Critical
|
Unassigned |
Bug Description
One node deployment of 2023.1/stable. After a reboot and 15 minutes wait all mysql-router units are in error
```
Unit Workload Agent Address Ports Message
certificate-
cinder-
cinder-ceph/0* active idle 10.1.106.129
cinder-
cinder-mysql/0* blocked idle 10.1.106.147 failed to recover cluster.
cinder/0* active idle 10.1.106.154
glance-
glance-mysql/0* active idle 10.1.106.188 Primary
glance/0* active idle 10.1.106.130
heat-cfn-
heat-cfn/0* active idle 10.1.106.180
heat-mysql-
heat-mysql/0* blocked idle 10.1.106.135 failed to recover cluster.
heat/0* active idle 10.1.106.163
horizon-
horizon-mysql/0* active idle 10.1.106.139 Primary
horizon/0* active idle 10.1.106.150
keystone-
keystone-mysql/0* blocked idle 10.1.106.152 failed to recover cluster.
keystone/0* waiting idle 10.1.106.149 (workload) Not all relations are ready
neutron-
neutron-mysql/0* active idle 10.1.106.141 Primary
neutron/0* active idle 10.1.106.178
nova-api-
nova-cell-
nova-mysql-
nova-mysql/0* blocked idle 10.1.106.159 failed to recover cluster.
nova/0* blocked idle 10.1.106.191 (container:
ovn-central/0* active idle 10.1.106.186
ovn-relay/0* active idle 10.1.106.184
placement-
placement-mysql/0* blocked idle 10.1.106.143 failed to recover cluster.
placement/0* blocked idle 10.1.106.189 (container:
rabbitmq/0* active idle 10.1.106.176
traefik/0* active idle 10.1.106.187
```
Logs and artifacts - https:/
It is not a one off issue, I can reproduce it every time
tags: | added: cdo-qa |
From the logs, we can see that the issue is not MySQL routers, but the MySQL instances.
Added comment about this: /github. com/canonical/ mysql-k8s- operator/ issues/ 329
https:/
Traceback (most recent call last): juju/agents/ unit-nova- mysql-0/ charm/lib/ charms/ mysql/v0/ mysql.py" , line 1875, in reboot_ from_complete_ outage _run_mysqlsh_ script( "\n".join( reboot_ from_outage_ command) ) juju/agents/ unit-nova- mysql-0/ charm/src/ mysql_k8s_ helpers. py", line 675, in _run_mysqlsh_script r(e.stderr) mysql.v0. mysql.MySQLClie ntError: Cannot set LC_ALL to locale en_US.UTF-8: No such file or directory 31T16:23: 20Z: Loading startup files... 31T16:23: 20Z: Loading plugins... 31T16:23: 20Z: Connecting to MySQL at: <email address hidden> 31T16:23: 20Z: Shell.connect: tid=95: CONNECTED: nova-mysql- 0.nova- mysql-endpoints 31T16:23: 20Z: Connecting to MySQL at: mysql://<email address hidden> :3306?connect- timeout= 5000 31T16:23: 20Z: Dba.reboot_ cluster_ from_complete_ outage: tid=96: CONNECTED: nova-mysql- 0.nova- mysql-endpoints :3306 31T16:23: 20Z: Connecting to MySQL at: mysql://<email address hidden> :3306?connect- timeout= 5000 31T16:23: 20Z: Dba.reboot_ cluster_ from_complete_ outage: tid=97: CONNECTED: nova-mysql- 0.nova- mysql-endpoints :3306 31T16:23: 20Z: Group Replication 'group_name' value: 28668da3- 7802-11ee- b520-061a844833 53 31T16:23: 20Z: Metadata 'group_name' value: 28668da3- 7802-11ee- b520-061a844833 53 31T16:23: 20Z: Connecting to MySQL at: mysql://<email address hidden> :3306?connect- timeout= 5000 31T16:23: 20Z: Dba.reboot_ cluster_ from_complete_ outage: tid=98: CONNECTED: nova-mysql- 0.nova- mysql-endpoints .openstack. svc.cluster. local:3306 31T16:23: 20Z: Connecting to MySQL at: mysql://<email address hidden> :3306?connect- timeout= 5000 31T16:23: 20Z: Dba.reboot_ cluster_ from_complete_ outage: tid=99: CONNECTED: nova-mysql- 0.nova- mysql-endpoints .openstack. svc.cluster. local:3306 b520e0bc6c2593c 08ac554766c08fe 32' 31T16:23: 20Z: ClusterSet info: member, primary, not primary_ invalidated, not removed from set, primary status: UNKNOWN b520e0bc6c2593c 08ac554766c08fe 32' from complete outage...
File "/var/lib/
self.
File "/var/lib/
raise MySQLClientErro
charms.
verbose: 2023-10-
verbose: 2023-10-
verbose: 2023-10-
verbose: 2023-10-
verbose: 2023-10-
verbose: 2023-10-
verbose: 2023-10-
verbose: 2023-10-
verbose: 2023-10-
verbose: 2023-10-
verbose: 2023-10-
verbose: 2023-10-
verbose: 2023-10-
verbose: 2023-10-
No PRIMARY member found for cluster 'cluster-
verbose: 2023-10-
Restoring the Cluster 'cluster-
[31mERROR: [0mRuntimeError: The current session instance does not belong to the Cluster: 'cluster- b520e0bc6c2593c 08ac554766c08fe 32'. cluster_ from_complete_ outage: The current session instance does not belong to the Cluster: 'cluster- b520e0bc6c2593c 08ac554766c08fe 32'.
Traceback (most recent call last):
File "<string>", line 2, in <module>
RuntimeError: Dba.reboot_