After uncontrolled swact "Alarm id 100.104 File System threshold exceeded" was not seen on new active controller
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
StarlingX |
Fix Released
|
Medium
|
Eric MacDonald |
Bug Description
Bug Description : File System threshold exceeded; 80%, actual 81% major alarm was generated on active controller(
Before the swact below alarm and samples are seen
Data base sample
--------------
time host instance type type_instance value
1549046510149831000 controller-1 root percent_bytes used 46.05934143066406
1549046507409139000 compute-1 root percent_bytes used 27.08646011352539
1549046498961410000 controller-0 root percent_bytes used 81.13135528564453
1549046494668312000 compute-0 root percent_bytes used 27.009244918823242
]$ fm alarm-list
+------
| Alarm ID | Reason Text | Entity ID | Severity | Time Stamp |
+------
| 100.104 | File System threshold exceeded; 80%, actual 81% | host=controller-0. | major | 2019-02-01T18: |
| | | filesystem=/ | | 11:38.834446 |
| | | | | |
+------
After the swact below alarm and samples are seen
File system remains same
controller-
Filesystem 1K-blocks Used Available Use% Mounted on
/dev/sda3 20027216 16270376 2716456 86% /
devtmpfs 49343708 0 49343708 0% /dev
tmpfs 49362692 0 49362692 0% /dev/shm
tmpfs 49362692 11516 49351176 1% /run
tmpfs 49362692 0 49362692 0% /sys/fs/cgroup
tmpfs 1048576 176 1048400 1% /tmp
/dev/mapper/
/dev/mapper/
/dev/mapper/
/dev/mapper/
/dev/sda2 487634 114065 343873 25% /boot
/dev/mapper/
Sample data on data base
--------------
time host instance type type_instance value
1549046510149831000 controller-1 root percent_bytes used 46.05934143066406
1549046507409139000 compute-1 root percent_bytes used 27.08646011352539
1549046498961410000 controller-0 root percent_bytes used 81.13135528564453
1549046494668312000 compute-0 root percent_bytes used 27.009244918823242
Controller-1 fm-manager.log shows that alarm was deleted.
2019-02-
2019-02-
2019-02-
2019-02-
2019-02-
2019-02-
2019-02-
2019-02-
2019-02-
2019-02-
2019-02-
2019-02-
2019-02-01T19:38:
Severity
--------
Major
Steps to Reproduce
------------------
1. Generate the alarm by filling up the disk on active controller
2. sudo reboot active.
3. Verify alarm on new active controller
Expected Behavior
------------------
Alarm showing on new active controller
Actual Behavior
----------------
As per description
Reproducibility
---------------
System Configuration
-------
storage system
Branch/Pull Time/Commit
-------
StarlingX_
Timestamp/Logs
--------------
2019-01-24_20-18-00
description: | updated |
Changed in starlingx: | |
assignee: | nobody → Eric MacDonald (rocksolidmtce) |
Changed in starlingx: | |
status: | Triaged → In Progress |
Changed in starlingx: | |
assignee: | Eric MacDonald (rocksolidmtce) → Cindy Xie (xxie1) |
Changed in starlingx: | |
assignee: | Cindy Xie (xxie1) → chen haochuan (martin1982) |
tags: |
added: stx.2.0 removed: stx.2019.05 |
Changed in starlingx: | |
status: | In Progress → Fix Committed |
status: | Fix Committed → In Progress |
tags: | added: stx.retestneeded |
Changed in starlingx: | |
assignee: | chen haochuan (martin1982) → Eric MacDonald (rocksolidmtce) |
tags: | removed: stx.2.0 stx.metal stx.retestneeded |
tags: | added: stx.2.0 stx.metal |
Marking as release gating - issue related to collectd feature