controller-0 is degraded due to failure of pci-irq-affinity-agent process after reboot - libvirt failed to connect to ovs
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Invalid
|
Medium
|
cheng li |
Bug Description
Brief Description
-----------------
In DX system, after active controller reboot, system failed to recover in 180 seconds as LP-1829432, but alarm 200.006 raised as " controller-0 is degraded due to the failure of its 'pci-irq-
Severity
--------
Major
Steps to Reproduce
------------------
reboot active controller
TC-name: test_evacuate_vms
Expected Behavior
------------------
200.006 alarm should be cleared after system recovered from reboot
Actual Behavior
----------------
200.006 alarm not cleared
Reproducibility
---------------
Intermittent
System Configuration
-------
Two node system
Lab-name:
Branch/Pull Time/Commit
-------
stx master as of 20190604T144018Z
Last Pass
---------
2019-06-03_18-34-53
Timestamp/Logs
--------------
controller-0:~$
[2019-06-07 10:48:47,307] 139 INFO MainThread host_helper.
[2019-06-07 10:48:47,307] 262 DEBUG MainThread ssh.send :: Send 'sudo reboot -f'
[2019-06-07 10:54:02,726] 262 DEBUG MainThread ssh.send :: Send 'fm --os-username 'admin' --os-password 'Li69nux*' --os-project-name admin --os-auth-url http://
[2019-06-07 10:54:04,803] 387 DEBUG MainThread ssh.expect :: Output:
+------
| UUID | Alarm ID | Reason Text | Entity ID | Severity | Time Stamp |
+------
| 08339a9b-
[2019-06-07 11:45:46,732] 262 DEBUG MainThread ssh.send :: Send 'fm --os-username 'admin' --os-password 'Li69nux*' --os-project-name admin --os-auth-url http://
[2019-06-07 11:45:48,315] 387 DEBUG MainThread ssh.expect :: Output:
+------
| UUID | Alarm ID | Reason Text | Entity ID | Severity | Time Stamp |
+------
| 08339a9b-
Test Activity
-------------
Sanity
description: | updated |
tags: | added: stx.retestneeded |
description: | updated |
Changed in starlingx: | |
assignee: | Forrest Zhao (forrest.zhao) → cheng li (chengli3) |
Changed in starlingx: | |
importance: | Undecided → Medium |
status: | New → Triaged |
summary: |
- 200.006 alarm "controller-0 is degraded due to the failure of its 'pci- - irq-affinity-agent' process" after reboot + controller-0 is degraded due to failure of pci-irq-affinity-agent + process after reboot - libvirt failed to connect to ovs |
tags: | removed: stx.retestneeded |
This sounds similar to https:/ /bugs.launchpad .net/starlingx/ +bug/1828877 which was already fixed on 2019-05-15. Assigning to Zhipeng to investigate.