Event alarms do not take effect immediately
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Aodh |
New
|
Undecided
|
Unassigned |
Bug Description
When a new event alarm is created, it may take up to event_alarm_
Ideally there would be a hard guarantee that the cache has been invalidated by the time the alarm create call returns to the client, to prevent race conditions entirely. However, IIUC there is no RPC API between aodh-api and aodh-evaluator over which do do this, so a fairly intrusive architecture change would be required.
A partial solution might be for aodh-evaluator to listen for the aodh-api notifications about the alarm state changes, and respond by invalidating its cache before further processing. However, it's less clear that this is guaranteed to eliminate race conditions, even with only a single aodh-evaluator process. When scaling out aodh-evaluator, this solution would only work if the messages are sharded across the different evaluators by tenant, which is probably not the case.
Changed in aodh: | |
assignee: | nobody → Vishakha Agarwal (vishakha.agarwal) |
Changed in aodh: | |
assignee: | Vishakha Agarwal (vishakha.agarwal) → nobody |
Hi Zane,
Are you facing the problem in master too? I am not able to reproduce this issue kindly help.