dhcp agent reporting state as down during the initial sync
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
neutron |
Fix Released
|
Low
|
Bertrand Lallau |
Bug Description
When dhcp agent is started, neutron agent-list reports its state as dead until the initial sync is complete.
This can lead to unwanted alarms in monitoring systems, especially in large environments where the initial sync may take hours. During this time, systemctl shows that the agent is actually alive while neutron agent-list reports it as down.
Technical details:
If I'm right, this line [0] is the exact point where the initial sync takes place right after the first state report (with start_flag=True) is sent to the server. As it's being done in the same thread, it won't send a second state report until it's done with the sync.
Doing it in a separate thread would let the heartbeat task to continue sending state reports to the server but I don't know whether this have any unwanted side effects.
[0] https:/
Changed in neutron: | |
importance: | Undecided → Wishlist |
Changed in neutron: | |
assignee: | nobody → Miguel Angel Ajo (mangelajo) |
status: | Incomplete → In Progress |
Changed in neutron: | |
assignee: | Miguel Angel Ajo (mangelajo) → Bertrand Lallau (bertrand-lallau) |
@Daniel,
This seems like it may be a nice enhancement. Do you have specific information from large deployers that you could share here, so we can help the drivers to prioritize this enhancement?