Activity log for bug #1918145

Date Who What changed Old value New value Message
2021-03-08 14:47:00 LEDUC Florian bug added bug
2021-03-08 14:55:42 LEDUC Florian description * Summary: Slownesses on neutron API with many RBAC rules * High level description: Sharing several networks or security groups to project drastically increase API response time on some routes (/networks or /server/detail). For quite some time we have observing that reponse times are increasing (slowly fur surely) on /networks calls. We have increased the number of Neutron workers, but in vain. Lately, we're observing that it's getting worse (reponse time form 5 to 370 seconds). We discarded possible bottlenecks one by one (our service endpoint performance, neutron API configuration, etc). But we have found that some calls in the DB takes a lot of time. It seems they are stuck in the mariadb database (10.3.10). So we have captured a slow queries in mysql. An example of for /server/detail: --------------------------------- http://paste.openstack.org/show/803334/ We can see that there are more than 2 millions of rows examinated, and around 1657 returned. An example of for /networks: ---------------------------- http://paste.openstack.org/show/803337/ Rows_sent: 517 Rows_examined: 223519 * Pre-conditions: Database tables size: table: - networkrbacs 16928 rows - securitygrouprbacs 1691 rows - keystone.project 1713 rows Control plane nodes are shared with some others services: - RMQ - mariadb - Openstack APIs - DHCP agents It seems the code of those lines are based on https://github.com/openstack/neutron-lib/blob/698e4c8daa7d43018a71122ec5b0cd5b17b55141/neutron_lib/db/model_query.py#L120 * Step-by-step reproduction steps: - Create a lot of projects (at least 1000) - Create a SG in admin account - Create fake networks (vlan, vxlan) with associated - Share the SG and all networks with all projects * Expected output: lower response time, less than 5 seconds (approximatively). * Actual output: May lead to gateway timeout. * Version: ** OpenStack version Stein releases for all components ** CentOS 7.4 with kolla containers ** kolla-ansible for stein release * Environment: We operate all services in Openstack except for Cinder. * Perceived severity: Medium * Summary: Slownesses on neutron API with many RBAC rules * High level description: Sharing several networks or security groups to project drastically increase API response time on some routes (/networks or /server/detail). For quite some time we have observing that reponse times are increasing (slowly fur surely) on /networks calls. We have increased the number of Neutron workers, but in vain. Lately, we're observing that it's getting worse (reponse time form 5 to 370 seconds). We discarded possible bottlenecks one by one (our service endpoint performance, neutron API configuration, etc). But we have found that some calls in the DB takes a lot of time. It seems they are stuck in the mariadb database (10.3.10). So we have captured a slow queries in mysql. An example of for /server/detail: --------------------------------- http://paste.openstack.org/show/803334/ We can see that there are more than 2 millions of rows examinated, and around 1657 returned. An example of for /networks: ---------------------------- http://paste.openstack.org/show/803337/ Rows_sent: 517 Rows_examined: 223519 * Pre-conditions: Database tables size: table:     - networkrbacs 16928 rows     - securitygrouprbacs 1691 rows     - keystone.project 1713 rows Control plane nodes are shared with some others services: - RMQ - mariadb - Openstack APIs - DHCP agents It seems the code of those lines are based on https://github.com/openstack/neutron-lib/blob/698e4c8daa7d43018a71122ec5b0cd5b17b55141/neutron_lib/db/model_query.py#L120 * Step-by-step reproduction steps: - Create a lot of projects (at least 1000) - Create a SG in admin account - Create fake networks (vlan, vxlan) with associated - Share the SG and all networks with all projects * Expected output: lower response time, less than 5 seconds (approximatively). * Actual output: May lead to gateway timeout. * Version:   ** OpenStack version Stein releases for all components (neutron 14.2.0).   ** CentOS 7.4 with kolla containers   ** kolla-ansible for stein release * Environment: We operate all services in Openstack except for Cinder. * Perceived severity: Medium
2021-03-13 04:51:48 Hongbin Lu neutron: status New Confirmed
2021-03-13 04:51:52 Hongbin Lu neutron: importance Undecided Medium
2021-03-13 04:52:52 Hongbin Lu tags db
2021-03-16 08:33:32 Slawek Kaplonski tags db db loadimpact
2021-04-05 16:14:45 Rodolfo Alonso bug added subscriber Rodolfo Alonso
2021-04-06 06:38:48 Slawek Kaplonski neutron: status Confirmed Incomplete
2021-04-29 11:22:05 LEDUC Florian neutron: status Incomplete Confirmed
2021-05-07 16:15:07 Tyler Stachecki bug added subscriber Tyler Stachecki
2021-05-07 16:17:57 Tyler Stachecki neutron: assignee Tyler Stachecki (tstachecki)
2021-06-01 07:13:36 LEDUC Florian attachment added db_model_query.diff.txt https://bugs.launchpad.net/neutron/+bug/1918145/+attachment/5501581/+files/db_model_query.diff.txt
2021-06-02 03:37:58 Tyler Stachecki attachment added preliminary-neutron-rbac-perf.patch https://bugs.launchpad.net/neutron/+bug/1918145/+attachment/5501805/+files/preliminary-neutron-rbac-perf.patch
2021-09-20 13:00:19 LEDUC Florian attachment added PERFORMANCE_ISSUE_20092021.txt https://bugs.launchpad.net/neutron/+bug/1918145/+attachment/5526474/+files/PERFORMANCE_ISSUE_20092021.txt
2021-09-20 16:06:35 OpenStack Infra neutron: status Confirmed In Progress
2021-09-21 14:47:30 LEDUC Florian attachment removed PERFORMANCE_ISSUE_20092021.txt https://bugs.launchpad.net/neutron/+bug/1918145/+attachment/5526474/+files/PERFORMANCE_ISSUE_20092021.txt
2021-09-21 14:49:39 LEDUC Florian attachment added Hostname removed from the last attachment. https://bugs.launchpad.net/neutron/+bug/1918145/+attachment/5526691/+files/PERFORMANCE_ISSUE_20092021.txt
2021-10-02 10:57:12 OpenStack Infra neutron: status In Progress Fix Released
2021-10-28 20:04:07 Tyler Stachecki attachment removed preliminary-neutron-rbac-perf.patch https://bugs.launchpad.net/neutron/+bug/1918145/+attachment/5501805/+files/preliminary-neutron-rbac-perf.patch
2021-10-28 20:05:42 Tyler Stachecki attachment added ussuri-v1.patch https://bugs.launchpad.net/neutron/+bug/1918145/+attachment/5536840/+files/ussuri-v1.patch
2021-10-29 13:32:06 Tyler Stachecki attachment added ussuri-v2.patch https://bugs.launchpad.net/neutron/+bug/1918145/+attachment/5537010/+files/ussuri-v2.patch
2021-10-29 13:32:22 Tyler Stachecki attachment removed ussuri-v1.patch https://bugs.launchpad.net/neutron/+bug/1918145/+attachment/5536840/+files/ussuri-v1.patch
2022-04-14 19:35:02 Slawek Kaplonski neutron: assignee Tyler Stachecki (tstachecki)
2022-04-14 19:35:07 Slawek Kaplonski tags db loadimpact db loadimpact timeout-abandon
2022-06-13 03:06:04 Tyler Stachecki neutron: assignee Tyler Stachecki (tstachecki)
2022-06-21 09:05:31 Clement G bug added subscriber Clement G
2023-01-06 13:55:28 Sahid Orentino neutron: status Fix Released Confirmed
2023-01-16 09:03:45 OpenStack Infra neutron: status Confirmed In Progress
2023-05-31 11:24:21 Rodolfo Alonso neutron: assignee Tyler Stachecki (tstachecki) Rodolfo Alonso (rodolfo-alonso-hernandez)
2023-06-02 12:07:26 OpenStack Infra tags db loadimpact timeout-abandon db in-stable-zed loadimpact timeout-abandon
2023-06-02 12:13:25 OpenStack Infra tags db in-stable-zed loadimpact timeout-abandon db in-stable-wallaby in-stable-zed loadimpact timeout-abandon
2023-06-02 12:13:30 OpenStack Infra tags db in-stable-wallaby in-stable-zed loadimpact timeout-abandon db in-stable-wallaby in-stable-yoga in-stable-zed loadimpact timeout-abandon
2023-06-02 12:13:35 OpenStack Infra tags db in-stable-wallaby in-stable-yoga in-stable-zed loadimpact timeout-abandon db in-stable-wallaby in-stable-xena in-stable-yoga in-stable-zed loadimpact timeout-abandon
2023-06-04 23:31:14 OpenStack Infra neutron: status In Progress Fix Released
2023-11-09 10:59:22 Christian Rohmann bug added subscriber Christian Rohmann