Bug #1316621 “ebtables calls can race with libvirt” : Bugs : OpenStack Compute (nova)

Revision history for this message

Pavel Sedlák (psedlak) wrote on 2014-05-06:

#1

ebtables-race.log Edit (13.9 KiB, text/plain)

Happened with Havana on RHEL6 and Icehouse on RHEL 7.
As it's flaky I don't have detailed info mostly common logs or versions - though as it's with both Havana and Icehouse on different versions of kernel etc, it seems as not related anyway.

Attaching part of nova-network.log showing that locks were obtained and command failed.

Tracy Jones (tjones-i) on 2014-05-07

tags:

added: libvirt

Solly Ross (sross-7) on 2014-05-19

Changed in nova:
status:	New → Confirmed
importance:	Undecided → Medium

Revision history for this message

Vish Ishaya (vishvananda) wrote on 2014-06-18:

#2

Well that is annoying. If it is that rare, perhaps doing a few retrys is good enough. I'm not sure if there is an easy way to do a shared lock with kvm.

Revision history for this message

Michael Still (mikal) wrote on 2014-06-18:

#3

We ignore the exit code on the delete we do before an insert of a rule, which leaves me thinking a retry would be hard to implement here. I guess we could change the delete to check the list of ebtables rules to make sure the entry exists, but I am unsure how expensive that would be.

Michael Still (mikal) on 2014-06-19

Changed in nova:
assignee:	nobody → Chet Burgess (cfb-n)

Revision history for this message

jazeltq (jazeltq-k) wrote on 2014-08-15:

#4

Download full text (13.1 KiB)

This bug can be reproduced by rally. For example, you can run boot-run-command-delete task.
the import point here is that you should test you cloud with high pressure, then the bug will be reproduced.
I use rally test my could.
rally configuration is
{
    "VMTasks.boot_runcommand_delete": [
        {
            "args": {
                "flavor": {
                    "name": "m1.small"
                },
                "image": {
                    "name": "ubuntu-12-04-raw-rally-test"
                },
                "script": "/home/rally/rally/doc/samples/ec_script/ubuntu_ls_test.sh",
                "interpreter": "bash",
                "username": "root",
                "floating_network": "LTQ",
                "use_floatingip": true,
                "availability_zone": "dell420",
            },
            "runner": {
                "type": "constant",
                "times": 1000,
                "concurrency": 40,
                "timeout": 6000
            },
            "context": {
                "users": {
                    "tenants": 1,
                    "users_per_tenant": 1
                },
                "quotas": {
                     "nova": {
                         "instances": -1,
                         "cores": -1,
                         "ram": -1,
                         "fixed_ips": -1,
                         "floating_ips": -1,
                     }
                 },
            }
        }
    ]
}

the rally post error is
2014-08-15 13:55:53.602 29457 INFO rally.benchmark.runners.base [-] Task bd820c37-2eaf-49d6-99b8-7952d453197d | ITER: 747 END: Error <class 'novaclient.exceptions.BadRequest'>: Error. Unable to associate floating ip (HTTP 400) (Request-ID: req-fa6fa661-e41d-4235-9da7-74ba882dd3c8)

the nova-network.log in the same time is
2014-08-15 13:55:52.990 23291 DEBUG nova.network.linux_net [req-fa6fa661-e41d-4235-9da7-74ba882dd3c8 737b99c364a64253920c67313655e171 a8c948f6e70648608603b5079537c525] IPTablesManager.apply completed with success _apply /usr/lib/python2.7/dist-packages/nova/network/linux_net.py:451
2014-08-15 13:55:52.991 23291 DEBUG nova.openstack.common.lockutils [req-fa6fa661-e41d-4235-9da7-74ba882dd3c8 737b99c364a64253920c67313655e171 a8c948f6e70648608603b5079537c525] Released file lock "iptables" at /var/lock/nova/nova-iptables lock /usr/lib/python2.7/dist-packages/nova/openstack/common/lockutils.py:208
2014-08-15 13:55:52.991 23291 DEBUG nova.openstack.common.lockutils [req-fa6fa661-e41d-4235-9da7-74ba882dd3c8 737b99c364a64253920c67313655e171 a8c948f6e70648608603b5079537c525] Got semaphore "ebtables" lock /usr/lib/python2.7/dist-packages/nova/openstack/common/lockutils.py:166
2014-08-15 13:55:52.992 23291 DEBUG nova.openstack.common.lockutils [req-fa6fa661-e41d-4235-9da7-74ba882dd3c8 737b99c364a64253920c67313655e171 a8c948f6e70648608603b5079537c525] Attempting to grab file lock "ebtables" lock /usr/lib/python2.7/dist-packages/nova/openstack/common/lockutils.py:176
2014-08-15 13:55:52.993 23291 DEBUG nova.openstack.common.lockutils [req-fa6fa661-e41d-4235-9da7-74ba882dd3c8 737b99c364a64253920c67313655e171 a8c948f6e70648608603b5079537c525] Got file lock "eb...

This bug can be reproduced by rally. For example, you can run boot-run-command-delete task. 
the import point here is that you should test you cloud with high pressure, then the bug will be reproduced.
I use rally test my could.
rally configuration is 
{
    "VMTasks.boot_runcommand_delete": [
        {
            "args": {
                "flavor": {
                    "name": "m1.small"
                },
                "image": {
                    "name": "ubuntu-12-04-raw-rally-test"
                },
                "script": "/home/rally/rally/doc/samples/ec_script/ubuntu_ls_test.sh",
                "interpreter": "bash",
                "username": "root",
                "floating_network": "LTQ",
                "use_floatingip": true,
                "availability_zone": "dell420",
            },
            "runner": {
                "type": "constant",
                "times": 1000,
                "concurrency": 40,
                "timeout": 6000
            },
            "context": {
                "users": {
                    "tenants": 1,
                    "users_per_tenant": 1
                },
                "quotas": {
                     "nova": {
                         "instances": -1,
                         "cores": -1,
                         "ram": -1,
                         "fixed_ips": -1,
                         "floating_ips": -1,
                     }
                 },
            }
        }
    ]
}

the rally post error is 
2014-08-15 13:55:53.602 29457 INFO rally.benchmark.runners.base [-] Task bd820c37-2eaf-49d6-99b8-7952d453197d | ITER: 747 END: Error <class 'novaclient.exceptions.BadRequest'>: Error. Unable to associate floating ip (HTTP 400) (Request-ID: req-fa6fa661-e41d-4235-9da7-74ba882dd3c8)

the nova-network.log in the same time is 
2014-08-15 13:55:52.990 23291 DEBUG nova.network.linux_net [req-fa6fa661-e41d-4235-9da7-74ba882dd3c8 737b99c364a64253920c67313655e171 a8c948f6e70648608603b5079537c525] IPTablesManager.apply completed with success _apply /usr/lib/python2.7/dist-packages/nova/network/linux_net.py:451
2014-08-15 13:55:52.991 23291 DEBUG nova.openstack.common.lockutils [req-fa6fa661-e41d-4235-9da7-74ba882dd3c8 737b99c364a64253920c67313655e171 a8c948f6e70648608603b5079537c525] Released file lock "iptables" at /var/lock/nova/nova-iptables lock /usr/lib/python2.7/dist-packages/nova/openstack/common/lockutils.py:208
2014-08-15 13:55:52.991 23291 DEBUG nova.openstack.common.lockutils [req-fa6fa661-e41d-4235-9da7-74ba882dd3c8 737b99c364a64253920c67313655e171 a8c948f6e70648608603b5079537c525] Got semaphore "ebtables" lock /usr/lib/python2.7/dist-packages/nova/openstack/common/lockutils.py:166
2014-08-15 13:55:52.992 23291 DEBUG nova.openstack.common.lockutils [req-fa6fa661-e41d-4235-9da7-74ba882dd3c8 737b99c364a64253920c67313655e171 a8c948f6e70648608603b5079537c525] Attempting to grab file lock "ebtables" lock /usr/lib/python2.7/dist-packages/nova/openstack/common/lockutils.py:176
2014-08-15 13:55:52.993 23291 DEBUG nova.openstack.common.lockutils [req-fa6fa661-e41d-4235-9da7-74ba882dd3c8 737b99c364a64253920c67313655e171 a8c948f6e70648608603b5079537c525] Got file lock "ebtables" at /var/lock/nova/nova-ebtables lock /usr/lib/python2.7/dist-packages/nova/openstack/common/lockutils.py:204
2014-08-15 13:55:52.993 23291 DEBUG nova.openstack.common.lockutils [req-fa6fa661-e41d-4235-9da7-74ba882dd3c8 737b99c364a64253920c67313655e171 a8c948f6e70648608603b5079537c525] Got semaphore / lock "ensure_ebtables_rules" inner /usr/lib/python2.7/dist-packages/nova/openstack/common/lockutils.py:245
2014-08-15 13:55:52.994 23291 DEBUG nova.openstack.common.processutils [req-fa6fa661-e41d-4235-9da7-74ba882dd3c8 737b99c364a64253920c67313655e171 a8c948f6e70648608603b5079537c525] Running cmd (subprocess): sudo nova-rootwrap /etc/nova/rootwrap.conf ebtables -t nat -D PREROUTING --logical-in br100 -p ipv4 --ip-src 10.5.101.41 ! --ip-dst 10.5.101.0/24 -j redirect --redirect-target ACCEPT execute /usr/lib/python2.7/dist-packages/nova/openstack/common/processutils.py:147
2014-08-15 13:55:53.081 23291 DEBUG nova.openstack.common.processutils [req-fa6fa661-e41d-4235-9da7-74ba882dd3c8 737b99c364a64253920c67313655e171 a8c948f6e70648608603b5079537c525] Result was 255 execute /usr/lib/python2.7/dist-packages/nova/openstack/common/processutils.py:172
2014-08-15 13:55:53.083 23291 DEBUG nova.openstack.common.processutils [req-fa6fa661-e41d-4235-9da7-74ba882dd3c8 737b99c364a64253920c67313655e171 a8c948f6e70648608603b5079537c525] Running cmd (subprocess): sudo nova-rootwrap /etc/nova/rootwrap.conf ebtables -t nat -I PREROUTING --logical-in br100 -p ipv4 --ip-src 10.5.101.41 ! --ip-dst 10.5.101.0/24 -j redirect --redirect-target ACCEPT execute /usr/lib/python2.7/dist-packages/nova/openstack/common/processutils.py:147
2014-08-15 13:55:53.170 23291 DEBUG nova.openstack.common.processutils [req-fa6fa661-e41d-4235-9da7-74ba882dd3c8 737b99c364a64253920c67313655e171 a8c948f6e70648608603b5079537c525] Result was 255 execute /usr/lib/python2.7/dist-packages/nova/openstack/common/processutils.py:172
2014-08-15 13:55:53.171 23291 DEBUG nova.openstack.common.lockutils [req-fa6fa661-e41d-4235-9da7-74ba882dd3c8 737b99c364a64253920c67313655e171 a8c948f6e70648608603b5079537c525] Released file lock "ebtables" at /var/lock/nova/nova-ebtables lock /usr/lib/python2.7/dist-packages/nova/openstack/common/lockutils.py:208
2014-08-15 13:55:53.270 23291 ERROR nova.openstack.common.rpc.amqp [req-fa6fa661-e41d-4235-9da7-74ba882dd3c8 737b99c364a64253920c67313655e171 a8c948f6e70648608603b5079537c525] Exception during message handling
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp Traceback (most recent call last):
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp   File "/usr/lib/python2.7/dist-packages/nova/openstack/common/rpc/amqp.py", line 461, in _process_data
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp     **args)
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp   File "/usr/lib/python2.7/dist-packages/nova/openstack/common/rpc/dispatcher.py", line 172, in dispatch
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp     result = getattr(proxyobj, method)(ctxt, **kwargs)
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp   File "/usr/lib/python2.7/dist-packages/nova/network/floating_ips.py", line 390, in _associate_floating_ip
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp     do_associate()
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp   File "/usr/lib/python2.7/dist-packages/nova/openstack/common/lockutils.py", line 246, in inner
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp     return f(*args, **kwargs)
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp   File "/usr/lib/python2.7/dist-packages/nova/network/floating_ips.py", line 383, in do_associate
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp     interface=interface)
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp   File "/usr/lib/python2.7/dist-packages/nova/network/floating_ips.py", line 367, in do_associate
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp     interface, fixed['network'])
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp   File "/usr/lib/python2.7/dist-packages/nova/network/l3.py", line 115, in add_floating_ip
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp     l3_interface_id, network)
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp   File "/usr/lib/python2.7/dist-packages/nova/network/linux_net.py", line 762, in ensure_floating_forward
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp     ensure_ebtables_rules(*floating_ebtables_rules(fixed_ip, network))
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp   File "/usr/lib/python2.7/dist-packages/nova/openstack/common/lockutils.py", line 246, in inner
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp     return f(*args, **kwargs)
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp   File "/usr/lib/python2.7/dist-packages/nova/network/linux_net.py", line 1590, in ensure_ebtables_rules
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp     _execute(*cmd, run_as_root=True)
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp   File "/usr/lib/python2.7/dist-packages/nova/network/linux_net.py", line 1190, in _execute
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp     return utils.execute(*cmd, **kwargs)
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp   File "/usr/lib/python2.7/dist-packages/nova/utils.py", line 177, in execute
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp     return processutils.execute(*cmd, **kwargs)
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp   File "/usr/lib/python2.7/dist-packages/nova/openstack/common/processutils.py", line 178, in execute
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp     cmd=' '.join(cmd))
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp ProcessExecutionError: Unexpected error while running command.
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp Command: sudo nova-rootwrap /etc/nova/rootwrap.conf ebtables -t nat -I PREROUTING --logical-in br100 -p ipv4 --ip-src 10.5.101.41 ! --ip-dst 10.5.101.0/24 -j redirect --redirect-target ACCEPT
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp Exit code: 255
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp Stdout: ''
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp Stderr: "The kernel doesn't support a certain ebtables extension, consider recompiling your kernel or insmod the extension.\n"
2014-08-15 13:55:53.270 23291 TRACE nova.openstack.common.rpc.amqp 
2014-08-15 13:55:53.273 23291 ERROR nova.openstack.common.rpc.common [req-fa6fa661-e41d-4235-9da7-74ba882dd3c8 737b99c364a64253920c67313655e171 a8c948f6e70648608603b5079537c525] Returning exception Unexpected error while running command.
Command: sudo nova-rootwrap /etc/nova/rootwrap.conf ebtables -t nat -I PREROUTING --logical-in br100 -p ipv4 --ip-src 10.5.101.41 ! --ip-dst 10.5.101.0/24 -j redirect --redirect-target ACCEPT
Exit code: 255
Stdout: ''
Stderr: "The kernel doesn't support a certain ebtables extension, consider recompiling your kernel or insmod the extension.\n" to caller
2014-08-15 13:55:53.273 23291 ERROR nova.openstack.common.rpc.common [req-fa6fa661-e41d-4235-9da7-74ba882dd3c8 737b99c364a64253920c67313655e171 a8c948f6e70648608603b5079537c525] ['Traceback (most recent call last):\n', '  File "/usr/lib/python2.7/dist-packages/nova/openstack/common/rpc/amqp.py", line 461, in _process_data\n    **args)\n', '  File "/usr/lib/python2.7/dist-packages/nova/openstack/common/rpc/dispatcher.py", line 172, in dispatch\n    result = getattr(proxyobj, method)(ctxt, **kwargs)\n', '  File "/usr/lib/python2.7/dist-packages/nova/network/floating_ips.py", line 390, in _associate_floating_ip\n    do_associate()\n', '  File "/usr/lib/python2.7/dist-packages/nova/openstack/common/lockutils.py", line 246, in inner\n    return f(*args, **kwargs)\n', '  File "/usr/lib/python2.7/dist-packages/nova/network/floating_ips.py", line 383, in do_associate\n    interface=interface)\n', '  File "/usr/lib/python2.7/dist-packages/nova/network/floating_ips.py", line 367, in do_associate\n    interface, fixed[\'network\'])\n', '  File "/usr/lib/python2.7/dist-packages/nova/network/l3.py", line 115, in add_floating_ip\n    l3_interface_id, network)\n', '  File "/usr/lib/python2.7/dist-packages/nova/network/linux_net.py", line 762, in ensure_floating_forward\n    ensure_ebtables_rules(*floating_ebtables_rules(fixed_ip, network))\n', '  File "/usr/lib/python2.7/dist-packages/nova/openstack/common/lockutils.py", line 246, in inner\n    return f(*args, **kwargs)\n', '  File "/usr/lib/python2.7/dist-packages/nova/network/linux_net.py", line 1590, in ensure_ebtables_rules\n    _execute(*cmd, run_as_root=True)\n', '  File "/usr/lib/python2.7/dist-packages/nova/network/linux_net.py", line 1190, in _execute\n    return utils.execute(*cmd, **kwargs)\n', '  File "/usr/lib/python2.7/dist-packages/nova/utils.py", line 177, in execute\n    return processutils.execute(*cmd, **kwargs)\n', '  File "/usr/lib/python2.7/dist-packages/nova/openstack/common/processutils.py", line 178, in execute\n    cmd=\' \'.join(cmd))\n', 'ProcessExecutionError: Unexpected error while running command.\nCommand: sudo nova-rootwrap /etc/nova/rootwrap.conf ebtables -t nat -I PREROUTING --logical-in br100 -p ipv4 --ip-src 10.5.101.41 ! --ip-dst 10.5.101.0/24 -j redirect --redirect-target ACCEPT\nExit code: 255\nStdout: \'\'\nStderr: "The kernel doesn\'t support a certain ebtables extension, consider recompiling your kernel or insmod the extension.\\n"\n']
2014-08-15 13:55:53.274 23291 DEBUG nova.openstack.common.rpc.amqp [req-fa6fa661-e41d-4235-9da7-74ba882dd3c8 737b99c364a64253920c67313655e171 a8c948f6e70648608603b5079537c525] UNIQUE_ID is e652955be8314743ad38bd3f6621a29c. _add_unique_id /usr/lib/python2.7/dist-packages/nova/openstack/common/rpc/amqp.py:341

Hope this help some one to fixed the bug also.

Revision history for this message

jazeltq (jazeltq-k) wrote on 2014-08-15:

#5

The ebtables problem is also talked about here.
http://www.spinics.net/linux/fedora/libvirt-users/msg06645.html

Revision history for this message

Matthew Treinish (treinish) wrote on 2014-11-05:

#6

Marking as high, because this has been seen more recently in bringing up multi-node gate tests.

tags:	added: testing
Changed in nova:
importance:	Medium → High

Revision history for this message

Daniel Berrange (berrange) wrote on 2014-11-11:

#7

This patch to upstream libvirt adds use of --concurrent to ebtables and --wait to iptables/ip6tables.

https://www.redhat.com/archives/libvir-list/2014-November/msg00330.html

For this to help the race condition we'd need to modify Nova to use the same args too.

Revision history for this message

Chet Burgess (cfb-n) wrote on 2014-11-19:

#8

@berrange

Thats excellent. I was going to propose a change to do just that now that I'm back from vacation. Since thats already done I can work on the other required pieces to make that work in nova.

Since this is currently hurting the gate the current plan is the following.

1) Submit a quick fix that adds a simple retry to nova for ebtables. This should get the gate working smoothly again.

2) Add support timing out long running commans to oslo.concurrency.processutils. ebtables --concurrent will block, forever until it gets the lock. We need a way to reliable time this out after some period of time to prevent nova blocking on this forever.

3) Once we can timeout an operation in processutils we can patch nova to use --concurrent.

I should have patch #1 up in the next day.

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2014-11-21: Fix proposed to nova (master)

#9

Fix proposed to branch: master
Review: https://review.openstack.org/136217

Changed in nova:
status:	Confirmed → In Progress

OpenStack Infra (hudson-openstack) on 2014-11-21

Changed in nova:
assignee:	Chet Burgess (cfb-n) → Brent Eagles (beagles)

OpenStack Infra (hudson-openstack) on 2014-11-21

Changed in nova:
assignee:	Brent Eagles (beagles) → Chet Burgess (cfb-n)

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2014-11-26: Fix merged to nova (master)

#10

Reviewed: https://review.openstack.org/136217
Committed: https://git.openstack.org/cgit/openstack/nova/commit/?id=fb9b2058051b771732f4425c97651128c8060441
Submitter: Jenkins
Branch: master

commit fb9b2058051b771732f4425c97651128c8060441
Author: Chet Burgess <email address hidden>
Date: Thu Nov 20 18:29:15 2014 -0800

Retry ebtables on race

Calls to ebtables can race with libvirt and cause nova, or libvirt
to fail to apply ebtables rules.

The goal of this patch is to provide a simple fix to improve the
stability of the gate.

    We now call ebtables in a simple loop that retries on failure.
    Long term we want to update nova to make use of the --concurrent
    flag in newer versions of ebtables. The --concurrent flag
    implements a lock to prevent multiple invocations of ebtables from
    racing. This will require a newer libvirt and the ability to
    timeout long running execs (--concurrent can block forever if it
    never gets the lock).

A future patch is forthcoming to add support for --concurrent.

DocImpact
Add ebtables_exec_attempts option (default=3).

Change-Id: I3e04782ac4678581462f9bee4bb10d5f3b223457
Partial-Bug: #1316621

Revision history for this message

Chet Burgess (cfb-n) wrote on 2014-12-01:

#11

Will someone with the proper permissions please change the status back to medium? We have a work around for the gate now. I'm still tracking the long term fix for K but the immediate symptoms have been addressed.

Chet Burgess (cfb-n) on 2014-12-04

Changed in nova:
importance:	High → Medium
milestone:	none → kilo-3

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2014-12-09: Fix proposed to nova (master)

#12

Fix proposed to branch: master
Review: https://review.openstack.org/140514

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2014-12-11: Fix merged to nova (master)

#13

Reviewed: https://review.openstack.org/140514
Committed: https://git.openstack.org/cgit/openstack/nova/commit/?id=4f418727f7de689a2387d3a7a2cc90ae9503c91e
Submitter: Jenkins
Branch: master

commit 4f418727f7de689a2387d3a7a2cc90ae9503c91e
Author: Chet Burgess <email address hidden>
Date: Tue Dec 9 14:51:40 2014 -0800

Add backoff to ebtables retry

We need a backoff between ebtables retries. In some tempest tests we
have seen the retries complete in 100ms and still fail.

    We now sleep for ebtables_retry_interval * loop count seconds. With
    a default of 1.0 this means by default we sleep for 1.0s, 2.0s, and
    3.0s before we finally giving up.

Change-Id: I0b9b664a592364bedd11124a1ec921d8ea011704
Partial-Bug: #1316621

Revision history for this message

Davanum Srinivas (DIMS) (dims-v) wrote on 2015-03-04:

#14

Looks like this has merged, switching status to "Fix Committed"

Changed in nova:
status:	In Progress → Fix Committed

Thierry Carrez (ttx) on 2015-03-20

Changed in nova:
status:	Fix Committed → Fix Released

Revision history for this message

Matt Riedemann (mriedem) wrote on 2015-04-02:

#15

The patch from danpb merged into upstream libvirt:

http://libvirt.org/git/?p=libvirt.git;a=commit;h=dc33e6e4a5a5d429198b2c63ff6b63729353e2cf

It's in version 1.2.11 which is way too new for what we're testing with in the gate.

Thierry Carrez (ttx) on 2015-04-30

Changed in nova:
milestone:	kilo-3 → 2015.1.0

OpenStack Infra (hudson-openstack) on 2017-02-09

Changed in neutron:
assignee:	nobody → Kevin Benton (kevinbenton)
status:	New → In Progress

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2017-02-21: Fix merged to neutron (master)

#16

Reviewed: https://review.openstack.org/431773
Committed: https://git.openstack.org/cgit/openstack/neutron/commit/?id=486e2f4eb5a02c98958582e366a4d6081ea897e0
Submitter: Jenkins
Branch: master

commit 486e2f4eb5a02c98958582e366a4d6081ea897e0
Author: Kevin Benton <email address hidden>
Date: Thu Feb 9 15:10:20 2017 -0800

Pass --concurrent flag to ebtables calls

    This flag will force ebtables to acquire a lock so we don't
    have to worry about ebtables errors occuring if something else
    on the system is trying to use ebtables as well.

Closes-Bug: #1316621
Change-Id: I695c01e015fdc201df8f23d9b48f9d3678240266

Changed in neutron:
status:	In Progress → Fix Released

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2017-04-14: Fix included in openstack/neutron 11.0.0.0b1

#17

This issue was fixed in the openstack/neutron 11.0.0.0b1 development milestone.

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2017-04-28: Fix proposed to neutron (stable/ocata)

#18

Fix proposed to branch: stable/ocata
Review: https://review.openstack.org/460916

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2017-04-28: Fix proposed to neutron (stable/newton)

#19

Fix proposed to branch: stable/newton
Review: https://review.openstack.org/460917

Ihar Hrachyshka (ihar-hrachyshka) on 2017-04-28

Changed in neutron:
importance:	Undecided → Medium

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2017-04-28: Fix merged to neutron (stable/ocata)

#20

Reviewed: https://review.openstack.org/460916
Committed: https://git.openstack.org/cgit/openstack/neutron/commit/?id=470833d36d5313d549c9905d9a36af8cfbcc3330
Submitter: Jenkins
Branch: stable/ocata

commit 470833d36d5313d549c9905d9a36af8cfbcc3330
Author: Kevin Benton <email address hidden>
Date: Thu Feb 9 15:10:20 2017 -0800

Pass --concurrent flag to ebtables calls

    This flag will force ebtables to acquire a lock so we don't
    have to worry about ebtables errors occuring if something else
    on the system is trying to use ebtables as well.

    Closes-Bug: #1316621
    Change-Id: I695c01e015fdc201df8f23d9b48f9d3678240266
    (cherry picked from commit 486e2f4eb5a02c98958582e366a4d6081ea897e0)

tags:

added: in-stable-ocata

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2017-04-28: Fix merged to neutron (stable/newton)

#21

Reviewed: https://review.openstack.org/460917
Committed: https://git.openstack.org/cgit/openstack/neutron/commit/?id=f6ae49b020859b7878a992d2bb158b2c912a5765
Submitter: Jenkins
Branch: stable/newton

commit f6ae49b020859b7878a992d2bb158b2c912a5765
Author: Kevin Benton <email address hidden>
Date: Thu Feb 9 15:10:20 2017 -0800

Pass --concurrent flag to ebtables calls

    This flag will force ebtables to acquire a lock so we don't
    have to worry about ebtables errors occuring if something else
    on the system is trying to use ebtables as well.

    Closes-Bug: #1316621
    Change-Id: I695c01e015fdc201df8f23d9b48f9d3678240266
    (cherry picked from commit 486e2f4eb5a02c98958582e366a4d6081ea897e0)
    (cherry picked from commit 470833d36d5313d549c9905d9a36af8cfbcc3330)

tags:

added: in-stable-newton

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2017-06-01: Fix included in openstack/neutron 9.4.0

#22

This issue was fixed in the openstack/neutron 9.4.0 release.

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2017-06-01: Fix included in openstack/neutron 10.0.2

#23

This issue was fixed in the openstack/neutron 10.0.2 release.

OpenStack Compute (nova)

ebtables calls can race with libvirt

Bug Description

Duplicates of this bug

Other bug subscribers

Bug attachments

Remote bug watches

Affects		Status	Importance	Assigned to	Milestone
	OpenStack Compute (nova)	Fix Released	Medium	Chet Burgess	OpenStack Compute (nova) 2015.1.0 "kilo"
	neutron	Fix Released	Medium	Kevin Benton