Fuel for OpenStack

neutron l3 agent doesnt migrate correctly

Bug #1379272 reported by Stanislav Makar on 2014-10-09

This bug affects 1 person

	Status	Importance	Assigned to	Milestone
Fuel for OpenStack	Invalid	High	Stanislav Makar	Fuel for OpenStack 6.0
5.1.x	Invalid	High	Stanislav Makar	Fuel for OpenStack 5.1.1
6.0.x	Invalid	High	Stanislav Makar	Fuel for OpenStack 6.0

Bug Description

I found it on release 5.1 ISO + patch fuel-5.1_neutron_fix_20141001.patch

1. Create new environment (CentOS, HA mode)
2. Choose VLAN neutron
3. Add 3 controllers, 1 compute
4. Start deployment. It was succeessful
5. Create instance for admin tennant
6. Pause(suspend) primary controller
7. Waiting some time
8. p_neutron-l3-agent migrated to third controller
9. Resume primary controller
10. l3-agent migrate to primary node but namespaces is not created

As we see, we have here two active l3 agents
what is the root cause of this.
reshescheduling works, logs:
2014-10-09 09:38:53,738 - INFO - Started: /usr/bin/q-agent-cleanup.py --agent=l3 --reschedule --remove-dead --admin-auth-url=http://10.108.7.2:35357/v2.0 --auth-token=508MsThA
2014-10-09 09:38:54,142 - INFO - found alive L3 agent: 8ab79c73-4711-4e70-a9bc-e9d576c38cc9
2014-10-09 09:38:54,142 - INFO - found alive L3 agent: 985644a8-c766-40a9-bc1f-b6fda8281953
2014-10-09 09:38:54,176 - INFO - _reschedule_agent_l3: rescheduling orphaned routers
2014-10-09 09:38:54,176 - INFO - _reschedule_agent_l3: ended rescheduling of orphaned routers

See original description

Tags:

Revision history for this message

Stanislav Makar (smakar) wrote on 2014-10-09:

fuel-5.1_neutron_fix_20141001.patch Edit (17.1 KiB, text/x-diff)

description:

updated

Revision history for this message

Stanislav Makar (smakar) wrote on 2014-10-09:

after some time ( ~20-30min) - we have only one l3 agent but namespaces is not presented on new node (primiry)

Changed in fuel:
importance:	Undecided → High

Stanislav Makar (smakar) on 2014-10-09

tags:

added: fuel-lib-neutron

Bogdan Dobrelya (bogdando) on 2014-10-09

tags:

added: to-be-covered-by-tests

Mike Scherbakov (mihgen) on 2014-10-15

tags:

added: neutron
removed: fuel-lib-neutron

Revision history for this message

Stanislav Makar (smakar) wrote on 2014-10-16:

Looks like it is rare case due to that I have not reproduced it again on my new environments.
I have found that it could be connected with controller vm's time after vm resuming , which is the same when you pause it and goes until ntpd corrects it ( service ntpd restart - to speed up it). The result of it is also "christmas lights "syndrome of nova services which are up or down because all controllers have different time.

Pausing and resuming vms - is artificial case and it is not connected with real world.
So I will close it.

Report a bug

This report contains Public information

Everyone can see this information.

You are

Subscribing...

Edit bug mail

Other bug subscribers

Bug attachments

fuel-5.1_neutron_fix_20141001.patch Edit

Add attachment

Remote bug watches

Bug watches keep track of this bug in other bug trackers.