stx-openstack: nova-compute service and hypervisor stuck in an enable/disable loop
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Medium
|
Luan Nunes Utimura |
Bug Description
Brief Description
-----------------
After applying stx-openstack and performing a lock/unlock on controller-0, it has been observed that both nova-compute service and hypervisor are stuck in an enable/disable loop.
Severity
--------
Major.
Steps to Reproduce
------------------
On AIO-SX:
1) Apply stx-openstack;
2) Lock/unlock controller-0;
3) After the unlock, verify the intermittency by:
Watching the compute services and hypervisors:
$ watch -d 'openstack compute service list --long; openstack hypervisor list --long'
Following the NFV-related logs:
$ tail -f /var/log/nfv*.log
Expected Behavior
------------------
After the unlock, both nova-compute service and hypervisor are up and running.
Actual Behavior
----------------
After the unlock, both nova-compute service and hypervisor are stuck in an enable/disable loop, changing state every 1 minute.
Reproducibility
---------------
Reproducible on an AIO-SX.
System Configuration
-------
AIO-SX.
Branch/Pull Time/Commit
-------
Master.
Last Pass
---------
N/A.
Timestamp/Logs
--------------
Output of `fm event-list`:
+------
| Time Stamp | State | Event | Reason Text | Entity Instance ID | Severity |
| | | Log | | | |
| | | ID | | | |
+------
| 2023-04-02T | log | 275. | Host controller-0 hypervisor is now unlocked-enabled | host=controller
| 05:02:33. | | 001 | | 79a504d8-
| 120261 | | | | | |
| | | | | | |
| 2023-04-02T | log | 275. | Host controller-0 hypervisor is now locked-disabled | host=controller
| 05:01:17. | | 001 | | 79a504d8-
| 480581 | | | | | |
| | | | | | |
| 2023-04-02T | log | 275. | Host controller-0 hypervisor is now locked-enabled | host=controller
| 05:01:11. | | 001 | | 79a504d8-
| 290467 | | | | | |
| | | | | | |
| 2023-04-02T | log | 275. | Host controller-0 hypervisor is now unlocked-enabled | host=controller
| 04:59:50. | | 001 | | 79a504d8-
| 014403 | | | | | |
| | | | | | |
| 2023-04-02T | log | 275. | Host controller-0 hypervisor is now locked-disabled | host=controller
| 04:58:34. | | 001 | | 79a504d8-
| 920412 | | | | | |
| | | | | | |
| 2023-04-02T | log | 275. | Host controller-0 hypervisor is now locked-enabled | host=controller
| 04:58:29. | | 001 | | 79a504d8-
| 131584 | | | | | |
| | | | | | |
| 2023-04-02T | log | 275. | Host controller-0 hypervisor is now unlocked-enabled | host=controller
| 04:57:08. | | 001 | | 79a504d8-
| 139377 | | | | | |
| | | | | | |
| 2023-04-02T | log | 275. | Host controller-0 hypervisor is now locked-disabled | host=controller
| 04:55:53. | | 001 | | 79a504d8-
| 755190 | | | | | |
| | | | | | |
| 2023-04-02T | log | 275. | Host controller-0 hypervisor is now locked-enabled | host=controller
| 04:55:47. | | 001 | | 79a504d8-
| 549136 | | | | | |
| | | | | | |
| 2023-04-02T | log | 275. | Host controller-0 hypervisor is now unlocked-enabled | host=controller
| 04:54:27. | | 001 | | 79a504d8-
| 669352 | | | | | |
| | | | | | |
+------
Test Activity
-------------
Developer Testing.
Workaround
----------
The intermittency seems to stop after disabling the guest plugin for VIM:
$ sudo sed -i 's/guest_
$ sudo sm-restart-safe service vim; sudo sm-restart-safe service vim-api
Changed in starlingx: | |
assignee: | nobody → Luan Nunes Utimura (lutimura) |
Changed in starlingx: | |
importance: | Undecided → Medium |
tags: | added: stx.9.0 stx.config stx.nfv |
Fix proposed to branch: master /review. opendev. org/c/starlingx /stx-puppet/ +/879359
Review: https:/