This is strange. You have very good looking information on how to reproduce this bug but in my environment I still cannot do that.
I have two devstack environments, one for master, another for stable/xena. Both are single-host environments. I changed neutron worker settings (api, rpc, metadata) to the same as yours. Also the default db in devstack is mysql, not postgre.
In the master environment I even enabled vpnaas as it was enabled in your config.
But in none of these variations was I able to reproduce the bug.
I am running out of ideas what may be the difference between your environment and mine. A few more though:
* In a production environment you likely have neutron-server running on multiple hosts. If you turn neutron-server off on all hosts except one, can you still reproduce the bug?
* I'm no DB expert (especially not for postgre), but if you use any db replication cluster, can you still reproduce bug if you turn that off and you have a single host DB?
Hi Anton,
This is strange. You have very good looking information on how to reproduce this bug but in my environment I still cannot do that.
I have two devstack environments, one for master, another for stable/xena. Both are single-host environments. I changed neutron worker settings (api, rpc, metadata) to the same as yours. Also the default db in devstack is mysql, not postgre.
Beyond the sleep() suggested by you I have even added another sleep(3) before this line: /opendev. org/openstack/ neutron/ src/commit/ d847b52c4f36e17 c9c360320d160a0 d05330a71c/ neutron/ db/db_base_ plugin_ v2.py#L523
https:/
In the master environment I even enabled vpnaas as it was enabled in your config.
But in none of these variations was I able to reproduce the bug.
I am running out of ideas what may be the difference between your environment and mine. A few more though:
* In a production environment you likely have neutron-server running on multiple hosts. If you turn neutron-server off on all hosts except one, can you still reproduce the bug?
* I'm no DB expert (especially not for postgre), but if you use any db replication cluster, can you still reproduce bug if you turn that off and you have a single host DB?
Just for later reference:
delete side db transaction: /opendev. org/openstack/ neutron/ src/branch/ stable/ xena/neutron/ db/db_base_ plugin_ v2.py#L511- L523 /opendev. org/openstack/ neutron/ src/branch/ stable/ xena/neutron/ services/ segments/ db.py#L344- L370
https:/
segments are deleted from NETWORK PRECOMMIT_DELETE hook:
https:/
list side db transaction: /opendev. org/openstack/ neutron/ src/branch/ stable/ xena/neutron/ plugins/ ml2/plugin. py#L1235- L1244 /opendev. org/openstack/ neutron/ src/branch/ stable/ xena/neutron/ plugins/ ml2/managers. py#L160- L184
https:/
the debug log message in the original report comes from here:
https:/