"openstack server list" cmd not working in 5 mins after active controller reboot
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Medium
|
Chris Friesen |
Bug Description
Brief Description
-----------------
reboot active controller, check server list by using " openstack server list" cmd
Severity
--------
Major
Steps to Reproduce
------------------
as description
TC-name:
Expected Behavior
------------------
Actual Behavior
----------------
Reproducibility
---------------
Reproducible
System Configuration
-------
Two node system
Multi-node system
Lab-name: WCP_99-103
Branch/Pull Time/Commit
-------
stx master as of 2019-05-15_18-01-07
Last Pass
---------
2019-05-07_14-47-37
Timestamp/Logs
--------------
[2019-05-16 17:09:54,981] 262 DEBUG MainThread ssh.send :: Send 'openstack --os-username 'admin' --os-password 'Li69nux*' --os-project-name admin --os-auth-url http://
[2019-05-16 17:09:56,945] 387 DEBUG MainThread ssh.expect :: Output:
+------
| ID | Name | Status | Networks | Image | Flavor |
+------
| 99004abf-
+------
[2019-05-16 17:19:06,438] 262 DEBUG MainThread ssh.send :: Send 'sudo reboot -f'
[2019-05-16 17:25:12,026] 262 DEBUG MainThread ssh.send :: Send 'openstack --os-username 'admin' --os-password 'Li69nux*' --os-project-name admin --os-auth-url http://
[2019-05-16 17:27:39,637] 387 DEBUG MainThread ssh.expect :: Output:
Unable to establish connection to http://
controller-1:~$
[2019-05-16 17:31:28,757] 262 DEBUG MainThread ssh.send :: Send 'openstack --os-username 'admin' --os-password 'Li69nux*' --os-project-name admin --os-auth-url http://
[2019-05-16 17:31:30,579] 387 DEBUG MainThread ssh.expect :: Output:
+------
| ID | Name | Status | Networks | Image | Flavor |
+------
| 8dbec0a9-
| 99004abf-
+------
Test Activity
-------------
Sanity
tags: | added: stx.retestneeded |
Changed in starlingx: | |
status: | New → Triaged |
Changed in starlingx: | |
status: | Triaged → Fix Committed |
Changed in starlingx: | |
status: | Fix Committed → Fix Released |
Changed in starlingx: | |
status: | In Progress → Fix Released |
The mariadb pods are still coming up during that 5 minute window
2019-05- 16T17:30: 04.666 controller-0 kubelet[92421]: info E0516 17:30:04.666795 92421 pod_workers.go:190] Error syncing pod 7ad4aaea- 77f6-11e9- bfac-3cfdfe9f73 30 ("mariadb- server- 1_openstack( 7ad4aaea- 77f6-11e9- bfac-3cfdfe9f73 30)"), skipping: failed to "StartContainer" for "mariadb" with CrashLoopBackOff: "Back-off 1m20s restarting failed container=mariadb pod=mariadb- server- 1_openstack( 7ad4aaea- 77f6-11e9- bfac-3cfdfe9f73 30)"
That is the last mariadb restart failure log in /var/log/daemon.log
I believe it started working after this, which would explain why the calls start to work.