All-in-one: pci-irq-affinity-agent fails to start - controller-0 stuck in degraded state after initial unlock
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
High
|
zhipeng liu |
Bug Description
Brief Description
-----------------
pci-irq-
pmond reports the following (continuously):
/var/log/pmond.log (snippet)
2019-05-
2019-05-
2019-05-
2019-05-
controller-0 stuck in degraded state:
[wrsroot@
+----+-
| id | hostname | personality | administrative | operational | availability |
+----+-
| 1 | controller-0 | controller | unlocked | enabled | degraded |
+----+-
(Alarm snippet)
fm alarm-list
[wrsroot@
+------
| Alarm | Reason Text | Entity ID | Severity | Time Stamp |
| ID | | | | |
+------
| 200. | controller-0 is degraded due to the failure of its 'pci-irq-
| 006 | process. Auto recovery of this major process is in progress. | affinity-agent | | 40:46.408005 |
| | | | | |
+------
[wrsroot@
Mon May 13 18:43:31 UTC 2019
The issue is possibly caused by:
https:/
Severity
--------
Major: System cannot be fully installed
Steps to Reproduce
------------------
Install controller-0 as All-in-one dublex mode
Expected Behavior
------------------
controller-0 should not be in degraded state after initial unlock
Actual Behavior
----------------
pci-irq-
controller-0 never gets out of degraded state
Reproducibility
---------------
100% reproducible on build: 20190512T233000Z
System Configuration
-------
1+1 system (AIO-DX)
Internal lab name: cgcs-wildcat-69-70
Branch/Pull Time/Commit
-------
BUILD_ID=
JOB="STX_
<email address hidden>"
Last Pass
---------
20190508T233000Z
Timestamp/Logs
--------------
Attached
Test Activity
-------------
Lab install
tags: | added: stx.retestneeded |
tags: |
added: stx.metal removed: stx.integ |
summary: |
- pci-irq-affinity-agent fails to start - controller-0 stuck in degraded - state after initial unlock + All-in-one: pci-irq-affinity-agent fails to start - controller-0 stuck + in degraded state after initial unlock |
tags: |
added: stx.integ removed: stx.metal |
tags: | added: stx.sanity |
As per above, it is suspected that this issue is introduced by: https:/ /review. opendev. org/#/c/ 640264/
Assigning to Zhipheng Liu to investigate