AIO: PCI-IRQ-Affinity-Agent repeated restarts due to pmon
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
High
|
zhipeng liu |
Bug Description
Brief Description
-----------------
On AIO-DX system, see intermittent restarts of PCI-IRQ-
2019-07-
. .
2019-07-
2019-07-
Eric MacDonald and I investigated this on R430_3_4 (July 10-15 timeframe) for kubelet.service and PCI-IRQ-
I had corrected the problem by adding the following config setting in stx-config:
diff --git a/puppet-
index 5ad4466..ce6832d 100644
--- a/puppet-
+++ b/puppet-
@@ -13,3 +13,4 @@ restarts = 3 ; restarts before error assertion
startuptime = 5 ; seconds to wait after process start
interval = 5 ; number of seconds to wait between restarts
debounce = 20 ; number of seconds to wait before degrade clear
+subfunction = last-config ; run monitor only after last config is run
I had also tested with the following one-liner change in stx-integ, but that not delivered with the kubelet affinity changes -- that is missing.
diff --git a/utilities/
index 544cee0..a40a13c 100644
--- a/utilities/
+++ b/utilities/
@@ -7,3 +7,4 @@ severity = major ; minor, major, critical
restarts = 3 ; restarts before error assertion
interval = 5 ; number of seconds to wait between restarts
debounce = 20 ; number of seconds to wait before degrade clear
+subfunction = last-config
After this line is added, expect to see the following in /var/log/pmond.log ;
2019-07-
Severity
--------
Major: System/Feature is usable but degraded.
tags: | added: in-r-stx20 |
PCI Interrupt affinity handling was added as a SB in stx.2.0. Marking this as high priority/stx.2.0 gating as the process is restarting. Requesting Zhipeng address this issue.