Frozen alarm state after updating alarm-definition (Python API)
Bug #1671565 reported by
jobrs
This bug affects 1 person
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Monasca |
In Progress
|
Undecided
|
Unassigned |
Bug Description
We recognized that after updating an alarm-expression, the alarm state would often be frozen, not changing anymore.
Example:
Given measurements of 1000.0, 1000.0, ... (every minute, verified with metric-stats command)
The alarm rule
avg(kafka.
fire shortly after creating the alarm-definition.
When the alarm-definition is now changed to
avg(kafka.
the alarm status does not change. When the alarm is deleted, a new alarm object is created and remains in status UNDETERMINED-
To post a comment you must log in.
After looking at the event queue in Kafka, a first explanation is that the alarm-definitio n-update messages are invalid. Instead of the actual metric dimensions, a single element dimension set {'uname': 'notification'} is reported in the metricDefinition.
It seems like a typo in the unit-test caused an implementation error here. Check the attached subalarm- upd-patch. txt.
Now the message on the bus is at least correct and a few error entries disappear from thresholder logs. Still it does not yield the effect. No alarm state change.