mtcAgent segfaults on controller-0 initial unlock if lo interface is reset
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Invalid
|
Low
|
Eric MacDonald |
Bug Description
Brief Description
-----------------
This is a follow-up on https:/
In the above LP, the code to configure the SR-IOV interfaces resulted in the full network manifest being re-applied. The side effect of that was that all platform interfaces may be brought down/up. This, in turn, resulted in issues with the system maintenance code including a segfault.
The SR-IOV configuration code has been updated to be more targeted, so the initial trigger has been addressed. This is a low priority follow-up bug to look at the mtcAgent segfault to determine if the code should be improved to address it.
Severity
--------
Minor -- the trigger for the segfault has already been addressed, so the segfault is not likely to be hit again
Steps to Reproduce
------------------
Originally the issue was triggered on the initial unlock of controller-0
Given that the trigger has already been fixed, there are no steps to reproduce other than forcing this code path explicitly
Expected Behavior
------------------
no mtcAgent segfaults are seen when the lo interface is reset before the initial unlock of controller-0
Actual Behavior
----------------
mtcAgent segfaults are reported in the logs
Reproducibility
---------------
N/A -- trigger is removed
System Configuration
-------
One node system
Lab-name: SM-3, wcp-11
Branch/Pull Time/Commit
-------
Load: 2020-03-22_16-04-38
Last Pass
---------
Load: 2020-03-22_04-10-00
Timestamp/Logs
--------------
Logs are attached to https:/
Key notes:
There seems to have been an mtcAgent segfault. The corresponding kern.log segfault log is:
2020-03-
Note the message a few seconds before the segfault:
2020-03-
Not sure if this is related to the crash, but this is coming from the apply_network_
controller-0:~$ cat /var/log/user.log | grep ifcfg-lo
2020-03-
2020-03-
2020-03-
2020-03-
2020-03-
2020-03-
2020-03-
2020-03-
2020-03-
2020-03-
2020-03-
2020-03-
Test Activity
-------------
installation
Low priority / not gating any stx release - the trigger for this issue has been addressed, so this will no longer be hit on initial controller-0 unlocks