Bug #1816842 “Containers: openstack pods failed after force rebo...” : Bugs : StarlingX

Revision history for this message

Ghada Khalil (gkhalil) wrote on 2019-02-22:

#1

Marking as release gating; issue impacts container env. Medium priority as issue is intermittent

Changed in starlingx:
importance:	Undecided → Medium
assignee:	nobody → Chris Friesen (cbf123)
status:	New → Triaged
tags:	added: stx.2019.05 stx.containers

Ken Young (kenyis) on 2019-04-05

tags:

added: stx.2.0
removed: stx.2019.05

Ghada Khalil (gkhalil) on 2019-04-09

tags:

added: stx.retestneeded

Revision history for this message

Bart Wensley (bartwensley) wrote on 2019-04-09:

#2

Download full text (3.6 KiB)

The force reboot of controller-0 happened here:
[2019-02-20 01:40:28,952] 139 INFO MainThread host_helper.reboot_hosts:: Rebooting active controller: controller-0
[2019-02-20 01:40:28,952] 262 DEBUG MainThread ssh.send :: Send 'sudo reboot -f'

It appears that all the openstack pods were restarted. I guess this is because etcd goes away when controller-0 is killed. It looks to me like the problem is that the mariadb pods did not come up properly:
mariadb-ingress-9d475c8c7-46kgs 0/1 Running 0 16h 172.16.1.77 controller-1 <none>
mariadb-ingress-9d475c8c7-7td6w 0/1 Running 0 16h 172.16.1.76 controller-1 <none>
mariadb-ingress-error-pages-6b55f4468c-nhkvv 1/1 Running 0 16h 172.16.1.78 controller-1 <none>
mariadb-server-0 0/1 Running 0 16h 172.16.0.201 controller-0 <none>
mariadb-server-1 0/1 Running 0 16h 172.16.1.89 controller-1 <none>

The garbd seems to be OK:
osh-openstack-garbd-garbd-5744f5f85-cjhrb 1/1 Running 0 18h 172.16.2.2 compute-0 <none>

The mariadb-server-0 pod seems to be stuck in a loop - the following logs are repeating forever:
2019-02-20 17:59:24,021 - OpenStack-Helm Mariadb - INFO - Cluster info has been uptodate 0 times out of the required 12
2019-02-20 17:59:24,022 - OpenStack-Helm Mariadb - INFO - Checking to see if cluster data is fresh
2019-02-20 17:59:24,027 - OpenStack-Helm Mariadb - INFO - The data we have from the cluster is too old to make a decision for node mariadb-server-1
2019-02-20 17:59:24,027 - OpenStack-Helm Mariadb - INFO - The data we have from the cluster is ok for node mariadb-server-0
2019-02-20 17:59:27,372 - OpenStack-Helm Mariadb - INFO - Updating grastate configmap

The mariadb-server-1 pod stops generating logs shortly after it comes up:
2019-02-20 01:50:51,516 - OpenStack-Helm Mariadb - INFO - Cluster info has been uptodate 0 times out of the required 12
2019-02-20 01:50:51,516 - OpenStack-Helm Mariadb - INFO - Checking to see if cluster data is fresh
2019-02-20 01:50:51,521 - OpenStack-Helm Mariadb - INFO - The data we have from the cluster is ok for node mariadb-server-1
2019-02-20 01:50:51,521 - OpenStack-Helm Mariadb - INFO - The data we have from the cluster is too old to make a decision for node mariadb-server-0
2019-02-20 01:50:51,545 - OpenStack-Helm Mariadb - INFO - Updating grastate configmap
2019-02-20 01:51:01,531 - OpenStack-Helm Mariadb - INFO - Cluster info has been uptodate 0 times out of the required 12
2019-02-20 01:51:01,531 - OpenStack-Helm Mariadb - INFO - Checking to see if cluster data is fresh
2019-02-20 01:51:01,568 - OpenStack-Helm Mariadb - INFO - Updating grastate configmap

The garbd pod can't seem to connect to either of the mariadb-servers:
2019-02-20 18:03:27.728 INFO: (f14c4149, 'tcp://0.0.0.0:4567') connection to peer 00000000 with addr tcp://172.16.0.175:4567 timed out, no messages seen in PT3S
2019-02-20 18:03:30.228 INFO: (f14c4149, 'tcp://0.0.0.0:4567') connection to peer 00000000 with addr tcp://172.16.0.21:4567 timed out, no messages seen in PT3S
2019-02-20 18:03:32.729 INFO: (f14c4149, 'tcp://0.0.0.0:4567') connection to peer 00000000 with addr tcp://172.16.0.175:4567 timed out, no messages seen in PT3S
2019-02-20 18...

The force reboot of controller-0 happened here: 
[2019-02-20 01:40:28,952] 139 INFO MainThread host_helper.reboot_hosts:: Rebooting active controller: controller-0 
[2019-02-20 01:40:28,952] 262 DEBUG MainThread ssh.send :: Send 'sudo reboot -f'

It appears that all the openstack pods were restarted. I guess this is because etcd goes away when controller-0 is killed. It looks to me like the problem is that the mariadb pods did not come up properly: 
mariadb-ingress-9d475c8c7-46kgs 0/1 Running 0 16h 172.16.1.77 controller-1 <none> 
mariadb-ingress-9d475c8c7-7td6w 0/1 Running 0 16h 172.16.1.76 controller-1 <none> 
mariadb-ingress-error-pages-6b55f4468c-nhkvv 1/1 Running 0 16h 172.16.1.78 controller-1 <none> 
mariadb-server-0 0/1 Running 0 16h 172.16.0.201 controller-0 <none> 
mariadb-server-1 0/1 Running 0 16h 172.16.1.89 controller-1 <none>

The garbd seems to be OK: 
osh-openstack-garbd-garbd-5744f5f85-cjhrb 1/1 Running 0 18h 172.16.2.2 compute-0 <none>

The mariadb-server-0 pod seems to be stuck in a loop - the following logs are repeating forever: 
2019-02-20 17:59:24,021 - OpenStack-Helm Mariadb - INFO - Cluster info has been uptodate 0 times out of the required 12 
2019-02-20 17:59:24,022 - OpenStack-Helm Mariadb - INFO - Checking to see if cluster data is fresh 
2019-02-20 17:59:24,027 - OpenStack-Helm Mariadb - INFO - The data we have from the cluster is too old to make a decision for node mariadb-server-1 
2019-02-20 17:59:24,027 - OpenStack-Helm Mariadb - INFO - The data we have from the cluster is ok for node mariadb-server-0 
2019-02-20 17:59:27,372 - OpenStack-Helm Mariadb - INFO - Updating grastate configmap

The mariadb-server-1 pod stops generating logs shortly after it comes up: 
2019-02-20 01:50:51,516 - OpenStack-Helm Mariadb - INFO - Cluster info has been uptodate 0 times out of the required 12 
2019-02-20 01:50:51,516 - OpenStack-Helm Mariadb - INFO - Checking to see if cluster data is fresh 
2019-02-20 01:50:51,521 - OpenStack-Helm Mariadb - INFO - The data we have from the cluster is ok for node mariadb-server-1 
2019-02-20 01:50:51,521 - OpenStack-Helm Mariadb - INFO - The data we have from the cluster is too old to make a decision for node mariadb-server-0 
2019-02-20 01:50:51,545 - OpenStack-Helm Mariadb - INFO - Updating grastate configmap 
2019-02-20 01:51:01,531 - OpenStack-Helm Mariadb - INFO - Cluster info has been uptodate 0 times out of the required 12 
2019-02-20 01:51:01,531 - OpenStack-Helm Mariadb - INFO - Checking to see if cluster data is fresh 
2019-02-20 01:51:01,568 - OpenStack-Helm Mariadb - INFO - Updating grastate configmap

The garbd pod can't seem to connect to either of the mariadb-servers: 
2019-02-20 18:03:27.728 INFO: (f14c4149, 'tcp://0.0.0.0:4567&#39;) connection to peer 00000000 with addr tcp://172.16.0.175:4567 timed out, no messages seen in PT3S 
2019-02-20 18:03:30.228 INFO: (f14c4149, 'tcp://0.0.0.0:4567&#39;) connection to peer 00000000 with addr tcp://172.16.0.21:4567 timed out, no messages seen in PT3S 
2019-02-20 18:03:32.729 INFO: (f14c4149, 'tcp://0.0.0.0:4567&#39;) connection to peer 00000000 with addr tcp://172.16.0.175:4567 timed out, no messages seen in PT3S 
2019-02-20 18:03:35.229 INFO: (f14c4149, 'tcp://0.0.0.0:4567&#39;) connection to peer 00000000 with addr tcp://172.16.0.21:4567 timed out, no messages seen in PT3S 
2019-02-20 18:03:37.729 INFO: (f14c4149, 'tcp://0.0.0.0:4567&#39;) connection to peer 00000000 with addr tcp://172.16.0.175:4567 timed out, no messages seen in PT3S 
2019-02-20 18:03:40.229 INFO: (f14c4149, 'tcp://0.0.0.0:4567&#39;) connection to peer 00000000 with addr tcp://172.16.0.21:4567 timed out, no messages seen in PT3S

Revision history for this message

Chris Friesen (cbf123) wrote on 2019-04-09:

#3

The issue with mariadb-server-0 is a known issue. The problem is related to the design of the openstack-helm chart. Basically they're prioritizing data integrity over coming back into service, so when we're starting up a new mariadb cluster (i.e. there is no running server to join) then as long as one of the mariadb-server pods is not updating the state configmap the other mariadb-server pods will not come up. We plan on adding a monitor to deal with this case after a certain amount of time.

The issue with mariadb-server-1 is unknown at this point...need to figure out why it stopped updating the state configmap. Is the above the full logs from mariadb-server-1?

Revision history for this message

Chris Friesen (cbf123) wrote on 2019-04-09:

#4

I'm not surprised that garbd can't connect, as neither of the mysqld daemons are actually running...we're still in the startup code in the mariadb-server pods.

Revision history for this message

Chris Friesen (cbf123) wrote on 2019-06-10:

#5

Download full text (4.6 KiB)

Looking at the logs, we see the mariadb-server-1 logs ending here:

{"log":"2019-02-20 01:50:51,545 - OpenStack-Helm Mariadb - INFO - Updating grastate configmap\n","stream":"stderr","time":"2019-02-20T01:50:51.54616385Z"}
{"log":"2019-02-20 01:51:01,531 - OpenStack-Helm Mariadb - INFO - Cluster info has been uptodate 0 times out of the required 12\n","stream":"stderr","time":"2019-02-20T01:51:01.531861929Z"}
{"log":"2019-02-20 01:51:01,531 - OpenStack-Helm Mariadb - INFO - Checking to see if cluster data is fresh\n","stream":"stderr","time":"2019-02-20T01:51:01.531891753Z"}
{"log":"2019-02-20 01:51:01,568 - OpenStack-Helm Mariadb - INFO - Updating grastate configmap\n","stream":"stderr","time":"2019-02-20T01:51:01.568883913Z"}

The mariadb-server-0 logs show another story:

{"log":"2019-02-20 01:50:58,967 - OpenStack-Helm Mariadb - INFO - The data we have from the cluster is ok for node mariadb-server-0\n","stream":"stderr","time":"2019-02-20T01:50:58.967962473Z"}
{"log":"2019-02-20 01:51:08,978 - OpenStack-Helm Mariadb - INFO - Cluster info has been uptodate 1 times out of the required 12\n","stream":"stderr","time":"2019-02-20T01:51:08.978298167Z"}
{"log":"2019-02-20 01:51:08,978 - OpenStack-Helm Mariadb - INFO - Checking to see if cluster data is fresh\n","stream":"stderr","time":"2019-02-20T01:51:08.97833839Z"}
{"log":"2019-02-20 01:51:08,979 - OpenStack-Helm Mariadb - INFO - Updating grastate configmap\n","stream":"stderr","time":"2019-02-20T01:51:08.979132811Z"}
{"log":"2019-02-20 01:51:21,870 WARNING Retrying (Retry(total=2, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', error(104, 'Connection reset by peer'))': /api/v1/namespaces/openst
ack/configmaps/osh-openstack-mariadb-mariadb-state\n","stream":"stderr","time":"2019-02-20T01:51:21.870759801Z"}
{"log":"2019-02-20 01:51:21,870 WARNING Retrying (Retry(total=2, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', error(104, 'Connection reset by peer'))': /api/v1/namespaces/openst
ack/configmaps/osh-openstack-mariadb-mariadb-state\n","stream":"stderr","time":"2019-02-20T01:51:21.871024039Z"}
{"log":"2019-02-20 01:51:21,882 - OpenStack-Helm Mariadb - INFO - The data we have from the cluster is too old to make a decision for node mariadb-server-1\n","stream":"stderr","time":"2019-02-20T01:51:21.883047989Z"}
{"log":"2019-02-20 01:51:21,883 - OpenStack-Helm Mariadb - INFO - The data we have from the cluster is too old to make a decision for node mariadb-server-0\n","stream":"stderr","time":"2019-02-20T01:51:21.88352455Z"}
{"log":"2019-02-20 01:51:31,893 - OpenStack-Helm Mariadb - INFO - Cluster info has been uptodate 0 times out of the required 12\n","stream":"stderr","time":"2019-02-20T01:51:31.893865638Z"}
{"log":"2019-02-20 01:51:31,893 - OpenStack-Helm Mariadb - INFO - Checking to see if cluster data is fresh\n","stream":"stderr","time":"2019-02-20T01:51:31.893908681Z"}
{"log":"2019-02-20 01:51:31,898 - OpenStack-Helm Mariadb - INFO - The data we have from the cluster is too old to make a decision for node mariadb-server-1\n","stream":"stderr","time...

Looking at the logs, we see the mariadb-server-1 logs ending here:

{"log":"2019-02-20 01:50:51,545 - OpenStack-Helm Mariadb - INFO - Updating grastate configmap\n","stream":"stderr","time":"2019-02-20T01:50:51.54616385Z"}
{"log":"2019-02-20 01:51:01,531 - OpenStack-Helm Mariadb - INFO - Cluster info has been uptodate 0 times out of the required 12\n","stream":"stderr","time":"2019-02-20T01:51:01.531861929Z"}
{"log":"2019-02-20 01:51:01,531 - OpenStack-Helm Mariadb - INFO - Checking to see if cluster data is fresh\n","stream":"stderr","time":"2019-02-20T01:51:01.531891753Z"}
{"log":"2019-02-20 01:51:01,568 - OpenStack-Helm Mariadb - INFO - Updating grastate configmap\n","stream":"stderr","time":"2019-02-20T01:51:01.568883913Z"}

The mariadb-server-0 logs show another story:

{"log":"2019-02-20 01:50:58,967 - OpenStack-Helm Mariadb - INFO - The data we have from the cluster is ok for node mariadb-server-0\n","stream":"stderr","time":"2019-02-20T01:50:58.967962473Z"}
{"log":"2019-02-20 01:51:08,978 - OpenStack-Helm Mariadb - INFO - Cluster info has been uptodate 1 times out of the required 12\n","stream":"stderr","time":"2019-02-20T01:51:08.978298167Z"}
{"log":"2019-02-20 01:51:08,978 - OpenStack-Helm Mariadb - INFO - Checking to see if cluster data is fresh\n","stream":"stderr","time":"2019-02-20T01:51:08.97833839Z"}
{"log":"2019-02-20 01:51:08,979 - OpenStack-Helm Mariadb - INFO - Updating grastate configmap\n","stream":"stderr","time":"2019-02-20T01:51:08.979132811Z"}
{"log":"2019-02-20 01:51:21,870 WARNING Retrying (Retry(total=2, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', error(104, 'Connection reset by peer'))': /api/v1/namespaces/openst
ack/configmaps/osh-openstack-mariadb-mariadb-state\n","stream":"stderr","time":"2019-02-20T01:51:21.870759801Z"}
{"log":"2019-02-20 01:51:21,870 WARNING Retrying (Retry(total=2, connect=None, read=None, redirect=None, status=None)) after connection broken by 'ProtocolError('Connection aborted.', error(104, 'Connection reset by peer'))': /api/v1/namespaces/openst
ack/configmaps/osh-openstack-mariadb-mariadb-state\n","stream":"stderr","time":"2019-02-20T01:51:21.871024039Z"}
{"log":"2019-02-20 01:51:21,882 - OpenStack-Helm Mariadb - INFO - The data we have from the cluster is too old to make a decision for node mariadb-server-1\n","stream":"stderr","time":"2019-02-20T01:51:21.883047989Z"}
{"log":"2019-02-20 01:51:21,883 - OpenStack-Helm Mariadb - INFO - The data we have from the cluster is too old to make a decision for node mariadb-server-0\n","stream":"stderr","time":"2019-02-20T01:51:21.88352455Z"}
{"log":"2019-02-20 01:51:31,893 - OpenStack-Helm Mariadb - INFO - Cluster info has been uptodate 0 times out of the required 12\n","stream":"stderr","time":"2019-02-20T01:51:31.893865638Z"}
{"log":"2019-02-20 01:51:31,893 - OpenStack-Helm Mariadb - INFO - Checking to see if cluster data is fresh\n","stream":"stderr","time":"2019-02-20T01:51:31.893908681Z"}
{"log":"2019-02-20 01:51:31,898 - OpenStack-Helm Mariadb - INFO - The data we have from the cluster is too old to make a decision for node mariadb-server-1\n","stream":"stderr","time":"2019-02-20T01:51:31.898558239Z"}
{"log":"2019-02-20 01:51:31,898 - OpenStack-Helm Mariadb - INFO - The data we have from the cluster is too old to make a decision for node mariadb-server-0\n","stream":"stderr","time":"2019-02-20T01:51:31.898631348Z"}
{"log":"2019-02-20 01:51:31,909 - OpenStack-Helm Mariadb - INFO - Updating grastate configmap\n","stream":"stderr","time":"2019-02-20T01:51:31.909790831Z"}
{"log":"2019-02-20 01:51:41,908 - OpenStack-Helm Mariadb - INFO - Cluster info has been uptodate 0 times out of the required 12\n","stream":"stderr","time":"2019-02-20T01:51:41.908430614Z"}
{"log":"2019-02-20 01:51:41,908 - OpenStack-Helm Mariadb - INFO - Checking to see if cluster data is fresh\n","stream":"stderr","time":"2019-02-20T01:51:41.908466489Z"}
{"log":"2019-02-20 01:51:41,912 - OpenStack-Helm Mariadb - INFO - The data we have from the cluster is too old to make a decision for node mariadb-server-1\n","stream":"stderr","time":"2019-02-20T01:51:41.912964648Z"}
{"log":"2019-02-20 01:51:41,912 - OpenStack-Helm Mariadb - INFO - The data we have from the cluster is ok for node mariadb-server-0\n","stream":"stderr","time":"2019-02-20T01:51:41.913080719Z"}
{"log":"2019-02-20 01:51:41,931 - OpenStack-Helm Mariadb - INFO - Updating grastate configmap\n","stream":"stderr","time":"2019-02-20T01:51:41.931925188Z"}

We see a broken connection at 2019-02-20 01:51:21, after which only mariadb-server-0 is updating its state and mariadb-server-1 appears hung.

Revision history for this message

Chris Friesen (cbf123) wrote on 2019-06-20:

#6

Looking at the garbd logs on compute-0, we see this message which is odd given that controller-1 was rebooted:

{"log":"2019-02-20 01:23:20.981 WARN: Protocol violation. JOIN message sender 1.0 (mariadb-server-0.mariadb-discovery.openstack.svc.cluster.local) is not in state transfer (SYNCED). Message ignored.\n","stream":"stderr","time":"2019-02-20T01:23:20.98203438Z"}

Because of this, garbd does not think it's part of the primary cluster anymore, and so it doesn't allow mariadb-server-1 to rejoin the cluster:

{"log":"2019-02-20 01:23:22.590 WARN: Rejecting JOIN message from 0.0 (mariadb-server-1.mariadb-discovery.openstack.svc.cluster.local): new State Transfer required.\n","stream":"stderr","time":"2019-02-20T01:23:22.590158149Z"}
{"log":"2019-02-20 01:23:22.590 WARN: SYNC message from non-JOINED 0.0 (mariadb-server-1.mariadb-discovery.openstack.svc.cluster.local, PRIMARY). Ignored.\n","stream":"stderr","time":"2019-02-20T01:23:22.59054396Z"}

We're then left with garbd running but not thinking that either of the mariadb server pods are legit:

{"log":"2019-02-20 01:40:37.392 INFO: (f14c4149, 'tcp://0.0.0.0:4567') connection to peer 00000000 with addr tcp://172.16.0.175:4567 timed out, no messages seen in PT3S\n","stream":"stderr","time":"2019-02-20T01:40:37.392957049Z"}
{"log":"2019-02-20 01:40:37.392 INFO: (f14c4149, 'tcp://0.0.0.0:4567') connection to peer 00000000 with addr tcp://172.16.0.21:4567 timed out, no messages seen in PT3S\n","stream":"stderr","time":"2019-02-20T01:40:37.393031422Z"}
{"log":"2019-02-20 01:40:42.393 INFO: (f14c4149, 'tcp://0.0.0.0:4567') connection to peer 00000000 with addr tcp://172.16.0.21:4567 timed out, no messages seen in PT3S\n","stream":"stderr","time":"2019-02-20T01:40:42.393191002Z"}
{"log":"2019-02-20 01:40:42.393 INFO: (f14c4149, 'tcp://0.0.0.0:4567') connection to peer 00000000 with addr tcp://172.16.0.175:4567 timed out, no messages seen in PT3S\n","stream":"stderr","time":"2019-02-20T01:40:42.393237831Z"}

These messages continue until 2019-02-20 02:56:31 without any indication that it sees the new mariadb-server pods start up around 01:49:57 or 01:50.54. I'm suspicious that maybe something caused those pods to be deleted or restarted with different IP addresses and garbd was still watching for the old ones--this needs to be tested, I have no evidence yet.

Looking at the garbd logs on compute-0, we see this message which is odd given that controller-1 was rebooted:

{"log":"2019-02-20 01:23:20.981  WARN: Protocol violation. JOIN message sender 1.0 (mariadb-server-0.mariadb-discovery.openstack.svc.cluster.local) is not in state transfer (SYNCED). Message ignored.\n","stream":"stderr","time":"2019-02-20T01:23:20.98203438Z"}

Because of this, garbd does not think it's part of the primary cluster anymore, and so it doesn't allow mariadb-server-1 to rejoin the cluster:

{"log":"2019-02-20 01:23:22.590  WARN: Rejecting JOIN message from 0.0 (mariadb-server-1.mariadb-discovery.openstack.svc.cluster.local): new State Transfer required.\n","stream":"stderr","time":"2019-02-20T01:23:22.590158149Z"}
{"log":"2019-02-20 01:23:22.590  WARN: SYNC message from non-JOINED 0.0 (mariadb-server-1.mariadb-discovery.openstack.svc.cluster.local, PRIMARY). Ignored.\n","stream":"stderr","time":"2019-02-20T01:23:22.59054396Z"}

We're then left with garbd running but not thinking that either of the mariadb server pods are legit:

{"log":"2019-02-20 01:40:37.392  INFO: (f14c4149, 'tcp://0.0.0.0:4567') connection to peer 00000000 with addr tcp://172.16.0.175:4567 timed out, no messages seen in PT3S\n","stream":"stderr","time":"2019-02-20T01:40:37.392957049Z"}
{"log":"2019-02-20 01:40:37.392  INFO: (f14c4149, 'tcp://0.0.0.0:4567') connection to peer 00000000 with addr tcp://172.16.0.21:4567 timed out, no messages seen in PT3S\n","stream":"stderr","time":"2019-02-20T01:40:37.393031422Z"}
{"log":"2019-02-20 01:40:42.393  INFO: (f14c4149, 'tcp://0.0.0.0:4567') connection to peer 00000000 with addr tcp://172.16.0.21:4567 timed out, no messages seen in PT3S\n","stream":"stderr","time":"2019-02-20T01:40:42.393191002Z"}
{"log":"2019-02-20 01:40:42.393  INFO: (f14c4149, 'tcp://0.0.0.0:4567') connection to peer 00000000 with addr tcp://172.16.0.175:4567 timed out, no messages seen in PT3S\n","stream":"stderr","time":"2019-02-20T01:40:42.393237831Z"}

These messages continue until 2019-02-20 02:56:31 without any indication that it sees the new mariadb-server pods start up around 01:49:57 or 01:50.54.  I'm suspicious that maybe something caused those pods to be deleted or restarted with different IP addresses and garbd was still watching for the old ones--this needs to be tested, I have no evidence yet.

Maria Guadalupe Perez Ibara (maria-gp) on 2019-07-25

tags:

added: stx.regression

Revision history for this message

Chris Friesen (cbf123) wrote on 2019-08-13:

#7

While there are some unresolved questions lingering about this issue, after making some changes in this area to deal with other issues the original problem doesn't seem to be hitting us anymore.

I'm going to close the issue for now, if it shows up again we'll work the new problem.

Changed in starlingx:
status:	Triaged → In Progress
status:	In Progress → Fix Committed

Frank Miller (sensfan22) on 2019-08-14

Changed in starlingx:
status:	Fix Committed → Fix Released

Revision history for this message

Yang Liu (yliu12) wrote on 2019-08-20:

#8

We have not seen this issue in recent sanity runs. Closing.

tags:

removed: stx.retestneeded

liuqing (ml169807) on 2021-03-06

description:

updated

StarlingX

Containers: openstack pods failed after force rebooting active controller

Bug Description

Other bug subscribers

Remote bug watches