stuck memory alarm on AIO-SX

Bug #1802535 reported by Chris Friesen
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
StarlingX
Fix Released
Medium
Eric MacDonald

Bug Description

With the Nov 5 load I installed and then ran "config_controller --kubernetes". Afterwards it came up with a critical memory alarm that didn't clear even after memory consumption was significantly reduced.

After stopping/starting collectd the alarm was cleared.

Talking with Eric, he indicated that the problem was related to the following log:

2018-11-08T11:43:55.684 controller-0 collectd[178658]: info alarm notifier host=controller-0 reported no value (Host controller-0, plugin memory type percent (instance used): All data sources are within range again. Current value of "value" is nan.)

He also said that "collectd for some reason responded with a notification string that the code was never integrated to recognize. The "nan" value is unrecognixed by the fm_notifier.py file."

Revision history for this message
Ghada Khalil (gkhalil) wrote :

Targeting stx.2019.03 - stuck alarm; not urgent, but should be investigated and fixed before the release date.

Changed in starlingx:
importance: Undecided → Medium
summary: - stuck memory alarm on AIOSX
+ stuck memory alarm on AIO-SX
Changed in starlingx:
status: New → Triaged
assignee: nobody → Eric MacDonald (rocksolidmtce)
tags: added: stx.2019.03 stx.metal
Ken Young (kenyis)
tags: added: stx.2019.05
removed: stx.2019.03
Ken Young (kenyis)
Changed in starlingx:
assignee: Eric MacDonald (rocksolidmtce) → Cindy Xie (xxie1)
Changed in starlingx:
assignee: Cindy Xie (xxie1) → chen haochuan (martin1982)
Ken Young (kenyis)
tags: added: stx.2.0
removed: stx.2019.05
Revision history for this message
chen haochuan (martin1982) wrote :

still could reproduce with latest image

Revision history for this message
Ghada Khalil (gkhalil) wrote :

Assigning to Eric MacDonald; it turns out he is working on a fix for this issue. Sorry for the inconvenience Chen.

Changed in starlingx:
assignee: chen haochuan (martin1982) → Eric MacDonald (rocksolidmtce)
Changed in starlingx:
status: Triaged → Fix Released
Revision history for this message
Ghada Khalil (gkhalil) wrote :
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.