router ha was split brain when restarting l3 agent docker

Bug #2107323 reported by Khoi
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
neutron
Expired
Undecided
Unassigned

Bug Description

I deploy a multi node cluster with 2 network nodes separately with l3_ha=true and max_l3_agents_per_router=2. It is ok but when I restart neutron_l3_agent on a host which has an active router then 2 routers become standby-standby or active-active.

It is weird because I log in to neutron_l3_agent and see that

/var/lib/neutron/kolla/ha_confs/431e2992-d4c5-42b9-96b3-1c932b6b750f/state have right state on both side but neutron consider ha_state is active-active or standby-standby.

I feel 2 network nodes very easily to split brain. Could I have some ideas on this problem. Thank you.

openstack 2025.1-master
kolla-ansible2025.1 master
ubuntu 22.04

Revision history for this message
Lajos Katona (lajos-katona) wrote :

Related mail thread: https://<email address hidden>/thread/PL4DZKMA5UFSLOFLXD6LBP2WNBHKPS3A/

Revision history for this message
Lajos Katona (lajos-katona) wrote :

Do you perhaps see any related logs in l3-agent log for the split brain routers?

Changed in neutron:
status: New → Incomplete
Revision history for this message
Khoi (khoinh5) wrote : Re: [Bug 2107323] Re: router ha was split brain when restarting l3 agent docker

Hello, I will update soon.
Nguyen Huu Khoi

On Tue, Apr 22, 2025 at 4:21 PM Lajos Katona <email address hidden>
wrote:

> ** Changed in: neutron
> Status: New => Incomplete
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/2107323
>
> Title:
> router ha was split brain when restarting l3 agent docker
>
> Status in neutron:
> Incomplete
>
> Bug description:
> I deploy a multi node cluster with 2 network nodes separately with
> l3_ha=true and max_l3_agents_per_router=2. It is ok but when I restart
> neutron_l3_agent on a host which has an active router then 2 routers
> become standby-standby or active-active.
>
> It is weird because I log in to neutron_l3_agent and see that
>
>
> /var/lib/neutron/kolla/ha_confs/431e2992-d4c5-42b9-96b3-1c932b6b750f/state
> have right state on both side but neutron consider ha_state is active-
> active or standby-standby.
>
> I feel 2 network nodes very easily to split brain. Could I have some
> ideas on this problem. Thank you.
>
> openstack 2025.1-master
> kolla-ansible2025.1 master
> ubuntu 22.04
>
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/neutron/+bug/2107323/+subscriptions
>
>

Revision history for this message
Launchpad Janitor (janitor) wrote :

[Expired for neutron because there has been no activity for 60 days.]

Changed in neutron:
status: Incomplete → Expired
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.