Activity log for bug #1818614

Date Who What changed Old value New value Message
2019-03-05 08:03:32 Slawek Kaplonski bug added bug
2019-03-05 16:42:25 Miguel Lavalle tags functional-tests gate-failure l3-ha functional-tests gate-failure l3-dvr-backlog
2019-03-06 12:06:17 Slawek Kaplonski neutron: assignee Slawek Kaplonski (slaweq)
2019-03-08 15:49:07 OpenStack Infra neutron: status Confirmed In Progress
2019-03-14 19:08:36 OpenStack Infra neutron: status In Progress Fix Released
2019-03-15 10:08:22 Bernard Cafarelli tags functional-tests gate-failure l3-dvr-backlog functional-tests gate-failure l3-dvr-backlog neutron-proactive-backport-potential
2019-03-15 10:08:28 Bernard Cafarelli tags functional-tests gate-failure l3-dvr-backlog neutron-proactive-backport-potential functional-tests gate-failure l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential
2019-03-15 16:15:06 OpenStack Infra tags functional-tests gate-failure l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential functional-tests gate-failure in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential
2019-03-15 16:15:18 OpenStack Infra tags functional-tests gate-failure in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential functional-tests gate-failure in-stable-queens in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential
2019-03-16 23:19:13 OpenStack Infra tags functional-tests gate-failure in-stable-queens in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential functional-tests gate-failure in-stable-pike in-stable-queens in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential
2019-03-25 13:55:26 Edward Hope-Morley description Recently many L3 HA related functional tests are failing. The common thing in all those errors is fact that it fails when waiting for l3 ha router to become master. Example stack trace: ft2.12: neutron.tests.functional.agent.l3.test_ha_router.LinuxBridgeL3HATestCase.test_ha_router_lifecycle_StringException: Traceback (most recent call last): File "neutron/tests/base.py", line 174, in func return f(self, *args, **kwargs) File "neutron/tests/base.py", line 174, in func return f(self, *args, **kwargs) File "neutron/tests/functional/agent/l3/test_ha_router.py", line 81, in test_ha_router_lifecycle self._router_lifecycle(enable_ha=True, router_info=router_info) File "neutron/tests/functional/agent/l3/framework.py", line 274, in _router_lifecycle common_utils.wait_until_true(lambda: router.ha_state == 'master') File "neutron/common/utils.py", line 690, in wait_until_true raise WaitTimeout(_("Timed out after %d seconds") % timeout) neutron.common.utils.WaitTimeout: Timed out after 60 seconds Example failure: http://logs.openstack.org/79/633979/21/check/neutron-functional-python27/ce7ef07/logs/testr_results.html.gz Logstash query: http://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%22ha_state%20%3D%3D%20'master')%5C%22 [Impact] Need to get this added to the Ubuntu packages in order to safeguard against missed VRRP transitions due to ip -o monitor not running at the time the transition occurs. We have seen many cases in the fields where neutron routers end up as active on multiple l3 agents (via neutron api) which leads to a number of problems. [Test Case] * deploy Openstack (any version that supports l3ha) * create HA router with max-l3-agents=2 * check neutron l3-agent-list-hosting-router for master location * on both hosts that are running the l3-agent do pid=`pgrep -f "/usr/bin/neutron-keepalived-state-change --router_id=$ROUTER_UUID"` ps -f --ppid $pid pkill -f "/usr/bin/neutron-keepalived-state-change --router_id=$ROUTER_UUID" ps -f --ppid $pid <<<<<<<<<<< this should return nothing now pkill -f "/var/lib/neutron/ha_confs/$ROUTER_UUID/keepalived.conf" * without this patch you should now see both agents reporting the router as "active" * with the patch this should not happen (once neutron-keepalived-state-change has been restarted) [Regression Potential] ==================================================================== Recently many L3 HA related functional tests are failing. The common thing in all those errors is fact that it fails when waiting for l3 ha router to become master. Example stack trace: ft2.12: neutron.tests.functional.agent.l3.test_ha_router.LinuxBridgeL3HATestCase.test_ha_router_lifecycle_StringException: Traceback (most recent call last):   File "neutron/tests/base.py", line 174, in func     return f(self, *args, **kwargs)   File "neutron/tests/base.py", line 174, in func     return f(self, *args, **kwargs)   File "neutron/tests/functional/agent/l3/test_ha_router.py", line 81, in test_ha_router_lifecycle     self._router_lifecycle(enable_ha=True, router_info=router_info)   File "neutron/tests/functional/agent/l3/framework.py", line 274, in _router_lifecycle     common_utils.wait_until_true(lambda: router.ha_state == 'master')   File "neutron/common/utils.py", line 690, in wait_until_true     raise WaitTimeout(_("Timed out after %d seconds") % timeout) neutron.common.utils.WaitTimeout: Timed out after 60 seconds Example failure: http://logs.openstack.org/79/633979/21/check/neutron-functional-python27/ce7ef07/logs/testr_results.html.gz Logstash query: http://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%22ha_state%20%3D%3D%20'master')%5C%22
2019-03-25 13:55:46 Edward Hope-Morley description [Impact] Need to get this added to the Ubuntu packages in order to safeguard against missed VRRP transitions due to ip -o monitor not running at the time the transition occurs. We have seen many cases in the fields where neutron routers end up as active on multiple l3 agents (via neutron api) which leads to a number of problems. [Test Case] * deploy Openstack (any version that supports l3ha) * create HA router with max-l3-agents=2 * check neutron l3-agent-list-hosting-router for master location * on both hosts that are running the l3-agent do pid=`pgrep -f "/usr/bin/neutron-keepalived-state-change --router_id=$ROUTER_UUID"` ps -f --ppid $pid pkill -f "/usr/bin/neutron-keepalived-state-change --router_id=$ROUTER_UUID" ps -f --ppid $pid <<<<<<<<<<< this should return nothing now pkill -f "/var/lib/neutron/ha_confs/$ROUTER_UUID/keepalived.conf" * without this patch you should now see both agents reporting the router as "active" * with the patch this should not happen (once neutron-keepalived-state-change has been restarted) [Regression Potential] ==================================================================== Recently many L3 HA related functional tests are failing. The common thing in all those errors is fact that it fails when waiting for l3 ha router to become master. Example stack trace: ft2.12: neutron.tests.functional.agent.l3.test_ha_router.LinuxBridgeL3HATestCase.test_ha_router_lifecycle_StringException: Traceback (most recent call last):   File "neutron/tests/base.py", line 174, in func     return f(self, *args, **kwargs)   File "neutron/tests/base.py", line 174, in func     return f(self, *args, **kwargs)   File "neutron/tests/functional/agent/l3/test_ha_router.py", line 81, in test_ha_router_lifecycle     self._router_lifecycle(enable_ha=True, router_info=router_info)   File "neutron/tests/functional/agent/l3/framework.py", line 274, in _router_lifecycle     common_utils.wait_until_true(lambda: router.ha_state == 'master')   File "neutron/common/utils.py", line 690, in wait_until_true     raise WaitTimeout(_("Timed out after %d seconds") % timeout) neutron.common.utils.WaitTimeout: Timed out after 60 seconds Example failure: http://logs.openstack.org/79/633979/21/check/neutron-functional-python27/ce7ef07/logs/testr_results.html.gz Logstash query: http://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%22ha_state%20%3D%3D%20'master')%5C%22 [Impact] Need to get this added to the Ubuntu packages in order to safeguard against missed VRRP transitions due to ip -o monitor not running at the time the transition occurs. We have seen many cases in the fields where neutron routers end up as active on multiple l3 agents (via neutron api) which leads to a number of problems. [Test Case] * deploy Openstack (any version that supports l3ha) * create HA router with max-l3-agents=2 * check neutron l3-agent-list-hosting-router for master location * on both hosts that are running the l3-agent do pid=`pgrep -f "/usr/bin/neutron-keepalived-state-change --router_id=$ROUTER_UUID"` ps -f --ppid $pid pkill -f "/usr/bin/neutron-keepalived-state-change --router_id=$ROUTER_UUID" ps -f --ppid $pid <<<<<<<<<<< this should return nothing now pkill -f "/var/lib/neutron/ha_confs/$ROUTER_UUID/keepalived.conf" * without this patch you should now see both agents reporting the router as "active" * with the patch this should not happen (once neutron-keepalived-state-change has been restarted) [Regression Potential] None expected. ==================================================================== Recently many L3 HA related functional tests are failing. The common thing in all those errors is fact that it fails when waiting for l3 ha router to become master. Example stack trace: ft2.12: neutron.tests.functional.agent.l3.test_ha_router.LinuxBridgeL3HATestCase.test_ha_router_lifecycle_StringException: Traceback (most recent call last):   File "neutron/tests/base.py", line 174, in func     return f(self, *args, **kwargs)   File "neutron/tests/base.py", line 174, in func     return f(self, *args, **kwargs)   File "neutron/tests/functional/agent/l3/test_ha_router.py", line 81, in test_ha_router_lifecycle     self._router_lifecycle(enable_ha=True, router_info=router_info)   File "neutron/tests/functional/agent/l3/framework.py", line 274, in _router_lifecycle     common_utils.wait_until_true(lambda: router.ha_state == 'master')   File "neutron/common/utils.py", line 690, in wait_until_true     raise WaitTimeout(_("Timed out after %d seconds") % timeout) neutron.common.utils.WaitTimeout: Timed out after 60 seconds Example failure: http://logs.openstack.org/79/633979/21/check/neutron-functional-python27/ce7ef07/logs/testr_results.html.gz Logstash query: http://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%22ha_state%20%3D%3D%20'master')%5C%22
2019-03-25 13:55:54 Edward Hope-Morley summary Various L3HA functional tests fails often [SRU] Various L3HA functional tests fails often
2019-03-25 13:57:19 Edward Hope-Morley bug task added cloud-archive
2019-03-25 13:57:38 Edward Hope-Morley nominated for series cloud-archive/pike
2019-03-25 13:57:38 Edward Hope-Morley bug task added cloud-archive/pike
2019-03-25 13:57:38 Edward Hope-Morley nominated for series cloud-archive/rocky
2019-03-25 13:57:38 Edward Hope-Morley bug task added cloud-archive/rocky
2019-03-25 13:57:38 Edward Hope-Morley nominated for series cloud-archive/stein
2019-03-25 13:57:38 Edward Hope-Morley bug task added cloud-archive/stein
2019-03-25 13:57:38 Edward Hope-Morley nominated for series cloud-archive/queens
2019-03-25 13:57:38 Edward Hope-Morley bug task added cloud-archive/queens
2019-03-25 19:43:17 Corey Bryant cloud-archive/stein: status New Fix Released
2019-03-25 19:43:25 Corey Bryant cloud-archive/rocky: status New Triaged
2019-03-25 19:43:29 Corey Bryant cloud-archive/rocky: importance Undecided Critical
2019-03-25 19:43:33 Corey Bryant cloud-archive/rocky: importance Critical High
2019-03-25 19:43:36 Corey Bryant cloud-archive/queens: status New Triaged
2019-03-25 19:43:37 Corey Bryant cloud-archive/queens: importance Undecided High
2019-03-25 19:43:40 Corey Bryant cloud-archive/pike: status New Triaged
2019-03-25 19:43:42 Corey Bryant cloud-archive/pike: importance Undecided High
2019-03-25 19:44:05 Corey Bryant bug task added neutron (Ubuntu)
2019-03-25 19:44:52 Corey Bryant nominated for series Ubuntu Disco
2019-03-25 19:44:52 Corey Bryant bug task added neutron (Ubuntu Disco)
2019-03-25 19:44:52 Corey Bryant nominated for series Ubuntu Cosmic
2019-03-25 19:44:52 Corey Bryant bug task added neutron (Ubuntu Cosmic)
2019-03-25 19:44:52 Corey Bryant nominated for series Ubuntu Bionic
2019-03-25 19:44:52 Corey Bryant bug task added neutron (Ubuntu Bionic)
2019-03-25 19:45:06 Corey Bryant neutron (Ubuntu Bionic): importance Undecided High
2019-03-25 19:45:06 Corey Bryant neutron (Ubuntu Bionic): status New Triaged
2019-03-25 19:45:39 Corey Bryant neutron (Ubuntu Cosmic): importance Undecided High
2019-03-25 19:45:39 Corey Bryant neutron (Ubuntu Cosmic): status New Triaged
2019-03-25 19:46:03 Corey Bryant neutron (Ubuntu Disco): status New Fix Released
2019-03-25 19:46:19 Corey Bryant neutron (Ubuntu Disco): importance Undecided High
2019-03-25 19:46:31 Corey Bryant cloud-archive/stein: importance Undecided High
2019-03-26 13:32:36 Corey Bryant bug added subscriber Ubuntu Stable Release Updates Team
2019-03-26 13:49:33 Corey Bryant cloud-archive/pike: status Triaged Fix Committed
2019-03-26 13:49:36 Corey Bryant tags functional-tests gate-failure in-stable-pike in-stable-queens in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential functional-tests gate-failure in-stable-pike in-stable-queens in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential verification-pike-needed
2019-03-26 16:01:40 Corey Bryant description [Impact] Need to get this added to the Ubuntu packages in order to safeguard against missed VRRP transitions due to ip -o monitor not running at the time the transition occurs. We have seen many cases in the fields where neutron routers end up as active on multiple l3 agents (via neutron api) which leads to a number of problems. [Test Case] * deploy Openstack (any version that supports l3ha) * create HA router with max-l3-agents=2 * check neutron l3-agent-list-hosting-router for master location * on both hosts that are running the l3-agent do pid=`pgrep -f "/usr/bin/neutron-keepalived-state-change --router_id=$ROUTER_UUID"` ps -f --ppid $pid pkill -f "/usr/bin/neutron-keepalived-state-change --router_id=$ROUTER_UUID" ps -f --ppid $pid <<<<<<<<<<< this should return nothing now pkill -f "/var/lib/neutron/ha_confs/$ROUTER_UUID/keepalived.conf" * without this patch you should now see both agents reporting the router as "active" * with the patch this should not happen (once neutron-keepalived-state-change has been restarted) [Regression Potential] None expected. ==================================================================== Recently many L3 HA related functional tests are failing. The common thing in all those errors is fact that it fails when waiting for l3 ha router to become master. Example stack trace: ft2.12: neutron.tests.functional.agent.l3.test_ha_router.LinuxBridgeL3HATestCase.test_ha_router_lifecycle_StringException: Traceback (most recent call last):   File "neutron/tests/base.py", line 174, in func     return f(self, *args, **kwargs)   File "neutron/tests/base.py", line 174, in func     return f(self, *args, **kwargs)   File "neutron/tests/functional/agent/l3/test_ha_router.py", line 81, in test_ha_router_lifecycle     self._router_lifecycle(enable_ha=True, router_info=router_info)   File "neutron/tests/functional/agent/l3/framework.py", line 274, in _router_lifecycle     common_utils.wait_until_true(lambda: router.ha_state == 'master')   File "neutron/common/utils.py", line 690, in wait_until_true     raise WaitTimeout(_("Timed out after %d seconds") % timeout) neutron.common.utils.WaitTimeout: Timed out after 60 seconds Example failure: http://logs.openstack.org/79/633979/21/check/neutron-functional-python27/ce7ef07/logs/testr_results.html.gz Logstash query: http://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%22ha_state%20%3D%3D%20'master')%5C%22 [Impact] Need to get this added to the Ubuntu packages in order to safeguard against missed VRRP transitions due to ip -o monitor not running at the time the transition occurs. We have seen many cases in the fields where neutron routers end up as active on multiple l3 agents (via neutron api) which leads to a number of problems. [Test Case] * deploy Openstack (any version that supports l3ha) * create HA router with max-l3-agents=2 * check neutron l3-agent-list-hosting-router for master location * on both hosts that are running the l3-agent do pid=`pgrep -f "/usr/bin/neutron-keepalived-state-change --router_id=$ROUTER_UUID"` ps -f --ppid $pid pkill -f "/usr/bin/neutron-keepalived-state-change --router_id=$ROUTER_UUID" ps -f --ppid $pid <<<<<<<<<<< this should return nothing now pkill -f "/var/lib/neutron/ha_confs/$ROUTER_UUID/keepalived.conf" * without this patch you should now see both agents reporting the router as "active" * with the patch this should not happen (once neutron-keepalived-state-change has been restarted) [Regression Potential] These patches have already landed in corresponding upstream branches and therefore have undergone reviews + unit and functional testing upstream, therefore regression potential is expected to be low. ==================================================================== Recently many L3 HA related functional tests are failing. The common thing in all those errors is fact that it fails when waiting for l3 ha router to become master. Example stack trace: ft2.12: neutron.tests.functional.agent.l3.test_ha_router.LinuxBridgeL3HATestCase.test_ha_router_lifecycle_StringException: Traceback (most recent call last):   File "neutron/tests/base.py", line 174, in func     return f(self, *args, **kwargs)   File "neutron/tests/base.py", line 174, in func     return f(self, *args, **kwargs)   File "neutron/tests/functional/agent/l3/test_ha_router.py", line 81, in test_ha_router_lifecycle     self._router_lifecycle(enable_ha=True, router_info=router_info)   File "neutron/tests/functional/agent/l3/framework.py", line 274, in _router_lifecycle     common_utils.wait_until_true(lambda: router.ha_state == 'master')   File "neutron/common/utils.py", line 690, in wait_until_true     raise WaitTimeout(_("Timed out after %d seconds") % timeout) neutron.common.utils.WaitTimeout: Timed out after 60 seconds Example failure: http://logs.openstack.org/79/633979/21/check/neutron-functional-python27/ce7ef07/logs/testr_results.html.gz Logstash query: http://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%22ha_state%20%3D%3D%20'master')%5C%22
2019-03-26 16:06:02 Brian Murray neutron (Ubuntu Cosmic): status Triaged Fix Committed
2019-03-26 16:06:07 Brian Murray bug added subscriber SRU Verification
2019-03-26 16:06:11 Brian Murray tags functional-tests gate-failure in-stable-pike in-stable-queens in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential verification-pike-needed functional-tests gate-failure in-stable-pike in-stable-queens in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential verification-needed verification-needed-cosmic verification-pike-needed
2019-03-26 16:55:20 Brian Murray neutron (Ubuntu Bionic): status Triaged Fix Committed
2019-03-26 16:55:26 Brian Murray tags functional-tests gate-failure in-stable-pike in-stable-queens in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential verification-needed verification-needed-cosmic verification-pike-needed functional-tests gate-failure in-stable-pike in-stable-queens in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential verification-needed verification-needed-bionic verification-needed-cosmic verification-pike-needed
2019-03-26 20:11:15 Corey Bryant cloud-archive/rocky: status Triaged Fix Committed
2019-03-26 20:11:23 Corey Bryant tags functional-tests gate-failure in-stable-pike in-stable-queens in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential verification-needed verification-needed-bionic verification-needed-cosmic verification-pike-needed functional-tests gate-failure in-stable-pike in-stable-queens in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential verification-needed verification-needed-bionic verification-needed-cosmic verification-pike-needed verification-rocky-needed
2019-04-01 13:46:51 Corey Bryant cloud-archive/queens: status Triaged Fix Committed
2019-04-01 13:46:53 Corey Bryant tags functional-tests gate-failure in-stable-pike in-stable-queens in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential verification-needed verification-needed-bionic verification-needed-cosmic verification-pike-needed verification-rocky-needed functional-tests gate-failure in-stable-pike in-stable-queens in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential verification-needed verification-needed-bionic verification-needed-cosmic verification-pike-needed verification-queens-needed verification-rocky-needed
2019-04-04 09:26:40 Edward Hope-Morley tags functional-tests gate-failure in-stable-pike in-stable-queens in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential verification-needed verification-needed-bionic verification-needed-cosmic verification-pike-needed verification-queens-needed verification-rocky-needed functional-tests gate-failure in-stable-pike in-stable-queens in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-needed verification-needed-bionic verification-needed-cosmic verification-pike-needed verification-queens-needed verification-rocky-needed
2019-04-04 12:00:49 Edward Hope-Morley tags functional-tests gate-failure in-stable-pike in-stable-queens in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-needed verification-needed-bionic verification-needed-cosmic verification-pike-needed verification-queens-needed verification-rocky-needed functional-tests gate-failure in-stable-pike in-stable-queens in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-failed-cosmic verification-needed verification-needed-bionic verification-pike-needed verification-queens-needed verification-rocky-needed
2019-04-04 12:12:25 Magnus Bergman bug added subscriber Magnus Bergman
2019-04-08 17:21:40 OpenStack Infra tags functional-tests gate-failure in-stable-pike in-stable-queens in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-failed-cosmic verification-needed verification-needed-bionic verification-pike-needed verification-queens-needed verification-rocky-needed functional-tests gate-failure in-stable-ocata in-stable-pike in-stable-queens in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-failed-cosmic verification-needed verification-needed-bionic verification-pike-needed verification-queens-needed verification-rocky-needed
2019-04-08 22:07:07 OpenStack Infra tags functional-tests gate-failure in-stable-ocata in-stable-pike in-stable-queens in-stable-rocky l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-failed-cosmic verification-needed verification-needed-bionic verification-pike-needed verification-queens-needed verification-rocky-needed functional-tests gate-failure in-stable-ocata in-stable-pike in-stable-queens in-stable-rocky in-stable-stein l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-failed-cosmic verification-needed verification-needed-bionic verification-pike-needed verification-queens-needed verification-rocky-needed
2019-04-12 07:48:32 Tobias Urdin bug added subscriber Tobias Urdin
2019-04-15 13:45:38 Łukasz Zemczak tags functional-tests gate-failure in-stable-ocata in-stable-pike in-stable-queens in-stable-rocky in-stable-stein l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-failed-cosmic verification-needed verification-needed-bionic verification-pike-needed verification-queens-needed verification-rocky-needed functional-tests gate-failure in-stable-ocata in-stable-pike in-stable-queens in-stable-rocky in-stable-stein l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-needed verification-needed-bionic verification-needed-cosmic verification-pike-needed verification-queens-needed verification-rocky-needed
2019-04-15 20:23:28 Edward Hope-Morley description [Impact] Need to get this added to the Ubuntu packages in order to safeguard against missed VRRP transitions due to ip -o monitor not running at the time the transition occurs. We have seen many cases in the fields where neutron routers end up as active on multiple l3 agents (via neutron api) which leads to a number of problems. [Test Case] * deploy Openstack (any version that supports l3ha) * create HA router with max-l3-agents=2 * check neutron l3-agent-list-hosting-router for master location * on both hosts that are running the l3-agent do pid=`pgrep -f "/usr/bin/neutron-keepalived-state-change --router_id=$ROUTER_UUID"` ps -f --ppid $pid pkill -f "/usr/bin/neutron-keepalived-state-change --router_id=$ROUTER_UUID" ps -f --ppid $pid <<<<<<<<<<< this should return nothing now pkill -f "/var/lib/neutron/ha_confs/$ROUTER_UUID/keepalived.conf" * without this patch you should now see both agents reporting the router as "active" * with the patch this should not happen (once neutron-keepalived-state-change has been restarted) [Regression Potential] These patches have already landed in corresponding upstream branches and therefore have undergone reviews + unit and functional testing upstream, therefore regression potential is expected to be low. ==================================================================== Recently many L3 HA related functional tests are failing. The common thing in all those errors is fact that it fails when waiting for l3 ha router to become master. Example stack trace: ft2.12: neutron.tests.functional.agent.l3.test_ha_router.LinuxBridgeL3HATestCase.test_ha_router_lifecycle_StringException: Traceback (most recent call last):   File "neutron/tests/base.py", line 174, in func     return f(self, *args, **kwargs)   File "neutron/tests/base.py", line 174, in func     return f(self, *args, **kwargs)   File "neutron/tests/functional/agent/l3/test_ha_router.py", line 81, in test_ha_router_lifecycle     self._router_lifecycle(enable_ha=True, router_info=router_info)   File "neutron/tests/functional/agent/l3/framework.py", line 274, in _router_lifecycle     common_utils.wait_until_true(lambda: router.ha_state == 'master')   File "neutron/common/utils.py", line 690, in wait_until_true     raise WaitTimeout(_("Timed out after %d seconds") % timeout) neutron.common.utils.WaitTimeout: Timed out after 60 seconds Example failure: http://logs.openstack.org/79/633979/21/check/neutron-functional-python27/ce7ef07/logs/testr_results.html.gz Logstash query: http://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%22ha_state%20%3D%3D%20'master')%5C%22 [Impact] Need to get this added to the Ubuntu packages in order to safeguard against missed VRRP transitions due to ip -o monitor not running at the time the transition occurs. We have seen many cases in the fields where neutron routers end up as active on multiple l3 agents (via neutron api) which leads to a number of problems. [Test Case] * deploy Openstack (any version that supports l3ha) * create HA router with max-l3-agents=2 * check neutron l3-agent-list-hosting-router for master location * on both hosts that are running the l3-agent do pid=`pgrep -f "/usr/bin/neutron-keepalived-state-change --router_id=$ROUTER_UUID"` ps -f --ppid $pid pkill -f "/var/lib/neutron/ha_confs/$ROUTER_UUID/keepalived.conf" pkill -f "/usr/bin/neutron-keepalived-state-change --router_id=$ROUTER_UUID" ps -f --ppid $pid # <<<<<<<<<<< this should return nothing now * without this patch you should now see both agents reporting the router as "active" * with the patch this should not happen (once neutron-keepalived-state-change has been restarted by neutron-l3-agent) [Regression Potential] These patches have already landed in corresponding upstream branches and therefore have undergone reviews + unit and functional testing upstream, therefore regression potential is expected to be low. ==================================================================== Recently many L3 HA related functional tests are failing. The common thing in all those errors is fact that it fails when waiting for l3 ha router to become master. Example stack trace: ft2.12: neutron.tests.functional.agent.l3.test_ha_router.LinuxBridgeL3HATestCase.test_ha_router_lifecycle_StringException: Traceback (most recent call last):   File "neutron/tests/base.py", line 174, in func     return f(self, *args, **kwargs)   File "neutron/tests/base.py", line 174, in func     return f(self, *args, **kwargs)   File "neutron/tests/functional/agent/l3/test_ha_router.py", line 81, in test_ha_router_lifecycle     self._router_lifecycle(enable_ha=True, router_info=router_info)   File "neutron/tests/functional/agent/l3/framework.py", line 274, in _router_lifecycle     common_utils.wait_until_true(lambda: router.ha_state == 'master')   File "neutron/common/utils.py", line 690, in wait_until_true     raise WaitTimeout(_("Timed out after %d seconds") % timeout) neutron.common.utils.WaitTimeout: Timed out after 60 seconds Example failure: http://logs.openstack.org/79/633979/21/check/neutron-functional-python27/ce7ef07/logs/testr_results.html.gz Logstash query: http://logstash.openstack.org/#dashboard/file/logstash.json?query=message%3A%5C%22ha_state%20%3D%3D%20'master')%5C%22
2019-04-15 20:26:10 Edward Hope-Morley tags functional-tests gate-failure in-stable-ocata in-stable-pike in-stable-queens in-stable-rocky in-stable-stein l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-needed verification-needed-bionic verification-needed-cosmic verification-pike-needed verification-queens-needed verification-rocky-needed functional-tests gate-failure in-stable-ocata in-stable-pike in-stable-queens in-stable-rocky in-stable-stein l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-done-cosmic verification-needed verification-needed-bionic verification-pike-needed verification-queens-needed verification-rocky-needed
2019-05-01 01:50:37 Chris Halse Rogers removed subscriber Ubuntu Stable Release Updates Team
2019-05-01 02:00:46 Launchpad Janitor neutron (Ubuntu Cosmic): status Fix Committed Fix Released
2019-05-07 14:42:21 Edward Hope-Morley tags functional-tests gate-failure in-stable-ocata in-stable-pike in-stable-queens in-stable-rocky in-stable-stein l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-done-cosmic verification-needed verification-needed-bionic verification-pike-needed verification-queens-needed verification-rocky-needed functional-tests gate-failure in-stable-ocata in-stable-pike in-stable-queens in-stable-rocky in-stable-stein l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-done-cosmic verification-needed verification-needed-bionic verification-pike-needed verification-queens-needed verification-rocky-done
2019-05-07 16:25:10 Edward Hope-Morley tags functional-tests gate-failure in-stable-ocata in-stable-pike in-stable-queens in-stable-rocky in-stable-stein l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-done-cosmic verification-needed verification-needed-bionic verification-pike-needed verification-queens-needed verification-rocky-done functional-tests gate-failure in-stable-ocata in-stable-pike in-stable-queens in-stable-rocky in-stable-stein l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-done-bionic verification-done-cosmic verification-needed verification-pike-needed verification-queens-needed verification-rocky-done
2019-05-07 18:01:04 Edward Hope-Morley tags functional-tests gate-failure in-stable-ocata in-stable-pike in-stable-queens in-stable-rocky in-stable-stein l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-done-bionic verification-done-cosmic verification-needed verification-pike-needed verification-queens-needed verification-rocky-done functional-tests gate-failure in-stable-ocata in-stable-pike in-stable-queens in-stable-rocky in-stable-stein l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-done-bionic verification-done-cosmic verification-needed verification-pike-needed verification-queens-done verification-rocky-done
2019-05-08 10:39:46 Edward Hope-Morley tags functional-tests gate-failure in-stable-ocata in-stable-pike in-stable-queens in-stable-rocky in-stable-stein l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-done-bionic verification-done-cosmic verification-needed verification-pike-needed verification-queens-done verification-rocky-done functional-tests gate-failure in-stable-ocata in-stable-pike in-stable-queens in-stable-rocky in-stable-stein l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-done-bionic verification-done-cosmic verification-needed verification-pike-done verification-queens-done verification-rocky-done
2019-05-08 10:43:23 Edward Hope-Morley tags functional-tests gate-failure in-stable-ocata in-stable-pike in-stable-queens in-stable-rocky in-stable-stein l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-done-bionic verification-done-cosmic verification-needed verification-pike-done verification-queens-done verification-rocky-done functional-tests gate-failure in-stable-ocata in-stable-pike in-stable-queens in-stable-rocky in-stable-stein l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-done verification-done-bionic verification-done-cosmic verification-pike-done verification-queens-done verification-rocky-done
2019-05-09 09:59:28 Launchpad Janitor neutron (Ubuntu Bionic): status Fix Committed Fix Released
2019-05-13 19:15:30 Corey Bryant cloud-archive/rocky: status Fix Committed Fix Released
2019-05-13 19:19:16 Corey Bryant cloud-archive/pike: status Fix Committed Fix Released
2019-05-13 19:32:45 Corey Bryant cloud-archive/queens: status Fix Committed Fix Released
2019-09-13 10:42:36 Slawek Kaplonski tags functional-tests gate-failure in-stable-ocata in-stable-pike in-stable-queens in-stable-rocky in-stable-stein l3-dvr-backlog neutron-easy-proactive-backport-potential neutron-proactive-backport-potential sts-sru-needed verification-done verification-done-bionic verification-done-cosmic verification-pike-done verification-queens-done verification-rocky-done functional-tests gate-failure in-stable-ocata in-stable-pike in-stable-queens in-stable-rocky in-stable-stein l3-dvr-backlog sts-sru-needed verification-done verification-done-bionic verification-done-cosmic verification-pike-done verification-queens-done verification-rocky-done