Bug #1302080 “Host is accessible from instance using Linux bridg...” : Bugs : OpenStack Compute (nova)

Revision history for this message

Darragh O'Reilly (darragh-oreilly) wrote on 2014-04-03:

#1

ip6tables.out Edit (2.3 KiB, text/plain)

Darragh O'Reilly (darragh-oreilly) on 2014-04-03

description:	updated
description:	updated

Jeremy Stanley (fungi) on 2014-04-03

Changed in ossa:
status:	New → Incomplete

Revision history for this message

Jeremy Stanley (fungi) wrote on 2014-04-07:

#2

What releases of Neutron are affected by this bug?

Neutron core security reviewers, any opinion on the scope and exploitability of this?

Revision history for this message

Darragh O'Reilly (darragh-oreilly) wrote on 2014-04-08:

#3

Any releases that have instances plugged into Linux bridges. I have tried with master and the linuxbridge-agent and the ovs-agent. The ovs-agent needs to have Nova configured to use the hybrid VIF driver.

I think it's something that should be documented in the security guide. You would not want tenants to be able to do this for the very same reason you don't give them access to a management network. I just opened it as security bug to be on the safe side.

Revision history for this message

Jeremy Stanley (fungi) wrote on 2014-04-08:

#4

Thanks--I agree this sounds more like a configuration mistake we should warn deployers/operators against making, possibly in the Security Guide or an OSSN. If there are no objections from the Neutron core security reviewers, we should switch this to a public bug and add the security tag to bring it to the attention of the OSSG in case they want to document it.

Revision history for this message

Thierry Carrez (ttx) wrote on 2014-04-14:

#5

neutron core-sec: please confirm we can open this one.

Revision history for this message

Salvatore Orlando (salvatore-orlando) wrote on 2014-04-16:

#6

The host should never be reachable from an instance.
This is therefore a violation of tenant isolation principle.

However if there is no way this can be exploited to access services running on the host (assuming the host has been secured properly) then I think it's ok to open this bug.

Under the assumption that he host is properly secured I don't think there's any possible exploit, but I'd wait for other people to comment.

Revision history for this message

Thierry Carrez (ttx) wrote on 2014-04-22:

#7

this bug should be fixed openly as a strengthening measure.

information type:	Private Security → Public
Changed in ossa:
status:	Incomplete → Won't Fix

Eugene Nikanorov (enikanorov) on 2014-07-28

Changed in neutron:
importance:	Undecided → Medium
status:	New → Confirmed

Aniruddha Singh Gautam (aniruddha-gautam) on 2014-11-07

Changed in neutron:
assignee:	nobody → Aniruddha Singh Gautam (aniruddha-gautam)

Eugene Nikanorov (enikanorov) on 2014-11-23

tags:

added: ipv6

Revision history for this message

Sridhar Gaddam (sridhargaddam) wrote on 2015-08-05:

#8

AFAIU the following patch in nova would address the issue reported in the bug - https://review.openstack.org/#/c/198054

Matt Riedemann (mriedem) on 2015-09-22

tags:	added: network
Changed in nova:
status:	New → In Progress
assignee:	nobody → Adam Kacmarsky (adam-kacmarsky)
importance:	Undecided → Medium
assignee:	Adam Kacmarsky (adam-kacmarsky) → Brian Haley (brian-haley)

OpenStack Infra (hudson-openstack) on 2015-10-29

Changed in nova:
assignee:	Brian Haley (brian-haley) → Rawlin Peters (rawlin-peters)

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2015-11-03: Fix proposed to neutron (master)

#9

Fix proposed to branch: master
Review: https://review.openstack.org/241076

Changed in neutron:
assignee:	Aniruddha Singh Gautam (aniruddha-gautam) → Brian Haley (brian-haley)
status:	Confirmed → In Progress

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2015-11-18: Fix merged to neutron (master)

#10

Reviewed: https://review.openstack.org/241076
Committed: https://git.openstack.org/cgit/openstack/neutron/commit/?id=404eaead793b3192982ae247685970973609be1f
Submitter: Jenkins
Branch: master

commit 404eaead793b3192982ae247685970973609be1f
Author: Brian Haley <email address hidden>
Date: Mon Nov 2 22:04:11 2015 -0500

Disable IPv6 on bridge devices in LinuxBridgeManager

We don't want to create a bridge device with an IPv6 address because
it will see the Router Advertisement from Neutron.

Change-Id: If59a823804d3477c5d8877f46fcc4c018af57a5a
Closes-bug: 1302080

Changed in neutron:
status:	In Progress → Fix Committed

Doug Hellmann (doug-hellmann) on 2015-12-03

Changed in neutron:
status:	Fix Committed → Fix Released

OpenStack Infra (hudson-openstack) on 2015-12-09

Changed in nova:
assignee:	Rawlin Peters (rawlin-peters) → Brian Haley (brian-haley)

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2016-01-22: Fix proposed to neutron (stable/liberty)

#11

Fix proposed to branch: stable/liberty
Review: https://review.openstack.org/271373

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2016-02-01: Fix merged to nova (master)

#12

Reviewed: https://review.openstack.org/198054
Committed: https://git.openstack.org/cgit/openstack/nova/commit/?id=5ab1b1b1c456b8b43edbd1bddd74b96b56ab80e6
Submitter: Jenkins
Branch: master

commit 5ab1b1b1c456b8b43edbd1bddd74b96b56ab80e6
Author: Adam Kacmarsky <email address hidden>
Date: Thu Jul 2 10:13:16 2015 -0600

Disable IPv6 on bridge devices

    The qbr bridge should not have any IPv6 addresses, either
    link-local, or on the tenant's private network due to the
    bridge processing Router Advertisements from Neutron and
    auto-configuring addresses, since it will allow access to
    the hypervisor from a tenant VM.

The bridge only exists to allow the Neutron security group
code to work with OVS, so we can safely disable IPv6 on it.

Closes-bug: 1470931
Partial-bug: 1302080

Change-Id: Ideecab1c21b240bcca71973ed74b0374afb20e5e

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2016-02-01: Fix proposed to nova (stable/liberty)

#13

Fix proposed to branch: stable/liberty
Review: https://review.openstack.org/274796

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2016-02-16: Fix merged to nova (stable/liberty)

#14

Reviewed: https://review.openstack.org/274796
Committed: https://git.openstack.org/cgit/openstack/nova/commit/?id=44401727235c5a9736c4229f7fc581e6a970ff91
Submitter: Jenkins
Branch: stable/liberty

commit 44401727235c5a9736c4229f7fc581e6a970ff91
Author: Adam Kacmarsky <email address hidden>
Date: Thu Jul 2 10:13:16 2015 -0600

Disable IPv6 on bridge devices

    The qbr bridge should not have any IPv6 addresses, either
    link-local, or on the tenant's private network due to the
    bridge processing Router Advertisements from Neutron and
    auto-configuring addresses, since it will allow access to
    the hypervisor from a tenant VM.

The bridge only exists to allow the Neutron security group
code to work with OVS, so we can safely disable IPv6 on it.

Closes-bug: 1470931
Partial-bug: 1302080

Conflicts:
nova/tests/unit/virt/libvirt/test_vif.py

Change-Id: Ideecab1c21b240bcca71973ed74b0374afb20e5e
(cherry picked from commit 5ab1b1b1c456b8b43edbd1bddd74b96b56ab80e6)

tags:

added: in-stable-liberty

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2016-02-26: Fix merged to neutron (stable/liberty)

#15

Reviewed: https://review.openstack.org/271373
Committed: https://git.openstack.org/cgit/openstack/neutron/commit/?id=a381aa07d9c0ea586b649420643b4f91b65979d8
Submitter: Jenkins
Branch: stable/liberty

commit a381aa07d9c0ea586b649420643b4f91b65979d8
Author: Brian Haley <email address hidden>
Date: Mon Nov 2 22:04:11 2015 -0500

Disable IPv6 on bridge devices in LinuxBridgeManager

We don't want to create a bridge device with an IPv6 address because
it will see the Router Advertisement from Neutron.

Conflicts:
neutron/agent/linux/bridge_lib.py

    Change-Id: If59a823804d3477c5d8877f46fcc4c018af57a5a
    Closes-bug: 1302080
    (cherry picked from commit 404eaead793b3192982ae247685970973609be1f)

Revision history for this message

Brian Haley (brian-haley) wrote on 2016-03-30:

#16

The fix to nova has been released, but was tagged with "partial-bug" based on a review comment, it should have stayed "closes-bug" to automatically update this from the infra scripts.

So I will be marking this "fix released" accordingly.

Changed in nova:
status:	In Progress → Fix Released

Revision history for this message

Dustin Lundquist (dlundquist) wrote on 2016-04-14:

#17

It seems like we should fix this in kilo as well, but the fix depends on BridgeManager changes in I4b9d755677bba0d487a261004d9ba9b11116101f. Is it worth a new patch to explicitly disable IPv6 in ensure_bridge() for Kilo?

Revision history for this message

Tore Anderson (toreanderson) wrote on 2016-04-18:

#18

Download full text (3.3 KiB)

Is this bug really fixed? Running Mitaka, it seems not. Using linuxbridges in combination with vxlan, only the vxlan interface gets disable_ipv6=1 set, not the bridge one.

This is from the compute node when booting up an instance (first one on that particular network, so all the interfaces must be provisioned):

# egrep 'brq3cd6a5c8-ec|disable_ipv6' linuxbridge-agent.log
2016-04-18 13:08:41.701 5916 DEBUG neutron.agent.linux.utils [req-1926075e-555e-4363-ab24-4c93d0b5c989 - - - - -] Running command (rootwrap daemon): ['sysctl', '-w', 'net.ipv6.conf.vxlan-65601.disable_ipv6=1'] execute_rootwrap_daemon /usr/lib/python2.7/site-packages/neutron/agent/linux/utils.py:100
2016-04-18 13:08:41.710 5916 DEBUG neutron.agent.linux.utils [req-1926075e-555e-4363-ab24-4c93d0b5c989 - - - - -] Running command (rootwrap daemon): ['ip', 'link', 'set', 'brq3cd6a5c8-ec', 'up'] execute_rootwrap_daemon /usr/lib/python2.7/site-packages/neutron/agent/linux/utils.py:100
2016-04-18 13:08:41.714 5916 DEBUG neutron.agent.linux.utils [req-1926075e-555e-4363-ab24-4c93d0b5c989 - - - - -] Running command (rootwrap daemon): ['brctl', 'addif', 'brq3cd6a5c8-ec', 'vxlan-65601'] execute_rootwrap_daemon /usr/lib/python2.7/site-packages/neutron/agent/linux/utils.py:100
2016-04-18 13:08:41.729 5916 DEBUG neutron.plugins.ml2.drivers.linuxbridge.agent.linuxbridge_neutron_agent [req-1926075e-555e-4363-ab24-4c93d0b5c989 - - - - -] Skip adding device tap323ae2d2-4b to brq3cd6a5c8-ec. It is owned by compute:None and thus added elsewhere. _add_tap_interface /usr/lib/python2.7/site-packages/neutron/plugins/ml2/drivers/linuxbridge/agent/linuxbridge_neutron_agent.py:472

As you can see, disable_ipv6 gets set on the vxlan interface, but not the bridge (nor the tap interface for that matter).

And lo and behold, the bridge interface has acquired an global ipv6 address (because there's a neutron router/L3 agent attached to the network with ipv6-address-mode=slaac+ipv6-ra-mode=slaac):

# ip address list dev brq3cd6a5c8-ec
18: brq3cd6a5c8-ec: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 8950 qdisc noqueue state UP
    link/ether b6:e4:c1:aa:90:70 brd ff:ff:ff:ff:ff:ff
    inet6 2001:db8:123:456:b4e4:c1ff:feaa:9070/64 scope global mngtmpaddr dynamic
       valid_lft 86365sec preferred_lft 14365sec
    inet6 fe80::50b2:4bff:fe32:2a3c/64 scope link
       valid_lft forever preferred_lft forever

Furthermore, I'd like to stress that this IPv6 address is *GLOBALLY REACHABLE*! Yes, that means that anyone anywhere on the IPv6 Internet (including the instances themselves) can initiate, e.g., SSH connections *directly* to the compute node - even if it's behind a firewall or are using only private RFC1918 addresses or whatever. These packets will look just like normal VXLAN packets coming from the L3 agent, so they'll bypass any normal network-level protection.

One workaround is to set /proc/sys/net/ipv6/conf/default/disable_ipv6=1. That causes the kernel to ensure that all the relevant devices (vxlan, bridge, tap) gets created with IPv6 disabled by default. However, if you do want IPv6 on other unrelated interfaces (e.g., for management of the compute node itself or to carry vxlan traffic) this could be problem...

Is this bug really fixed? Running Mitaka, it seems not. Using linuxbridges in combination with vxlan, only the vxlan interface gets disable_ipv6=1 set, not the bridge one.

This is from the compute node when booting up an instance (first one on that particular network, so all the interfaces must be provisioned):

# egrep 'brq3cd6a5c8-ec|disable_ipv6' linuxbridge-agent.log
2016-04-18 13:08:41.701 5916 DEBUG neutron.agent.linux.utils [req-1926075e-555e-4363-ab24-4c93d0b5c989 - - - - -] Running command (rootwrap daemon): ['sysctl', '-w', 'net.ipv6.conf.vxlan-65601.disable_ipv6=1'] execute_rootwrap_daemon /usr/lib/python2.7/site-packages/neutron/agent/linux/utils.py:100
2016-04-18 13:08:41.710 5916 DEBUG neutron.agent.linux.utils [req-1926075e-555e-4363-ab24-4c93d0b5c989 - - - - -] Running command (rootwrap daemon): ['ip', 'link', 'set', 'brq3cd6a5c8-ec', 'up'] execute_rootwrap_daemon /usr/lib/python2.7/site-packages/neutron/agent/linux/utils.py:100
2016-04-18 13:08:41.714 5916 DEBUG neutron.agent.linux.utils [req-1926075e-555e-4363-ab24-4c93d0b5c989 - - - - -] Running command (rootwrap daemon): ['brctl', 'addif', 'brq3cd6a5c8-ec', 'vxlan-65601'] execute_rootwrap_daemon /usr/lib/python2.7/site-packages/neutron/agent/linux/utils.py:100
2016-04-18 13:08:41.729 5916 DEBUG neutron.plugins.ml2.drivers.linuxbridge.agent.linuxbridge_neutron_agent [req-1926075e-555e-4363-ab24-4c93d0b5c989 - - - - -] Skip adding device tap323ae2d2-4b to brq3cd6a5c8-ec. It is owned by compute:None and thus added elsewhere. _add_tap_interface /usr/lib/python2.7/site-packages/neutron/plugins/ml2/drivers/linuxbridge/agent/linuxbridge_neutron_agent.py:472

As you can see, disable_ipv6 gets set on the vxlan interface, but not the bridge (nor the tap interface for that matter).

And lo and behold, the bridge interface has acquired an global ipv6 address (because there's a neutron router/L3 agent attached to the network with ipv6-address-mode=slaac+ipv6-ra-mode=slaac):

# ip address list dev brq3cd6a5c8-ec
18: brq3cd6a5c8-ec: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 8950 qdisc noqueue state UP
    link/ether b6:e4:c1:aa:90:70 brd ff:ff:ff:ff:ff:ff
    inet6 2001:db8:123:456:b4e4:c1ff:feaa:9070/64 scope global mngtmpaddr dynamic
       valid_lft 86365sec preferred_lft 14365sec
    inet6 fe80::50b2:4bff:fe32:2a3c/64 scope link
       valid_lft forever preferred_lft forever

Furthermore, I'd like to stress that this IPv6 address is *GLOBALLY REACHABLE*! Yes, that means that anyone anywhere on the IPv6 Internet (including the instances themselves) can initiate, e.g., SSH connections *directly* to the compute node - even if it's behind a firewall or are using only private RFC1918 addresses or whatever. These packets will look just like normal VXLAN packets coming from the L3 agent, so they'll bypass any normal network-level protection.

One workaround is to set /proc/sys/net/ipv6/conf/default/disable_ipv6=1. That causes the kernel to ensure that all the relevant devices (vxlan, bridge, tap) gets created with IPv6 disabled by default. However, if you do want IPv6 on other unrelated interfaces (e.g., for management of the compute node itself or to carry vxlan traffic) this could be problematic if the sysctl gets set before those interfaces are plumbed into the kernel. So be careful...

I'm running openstack-neutron-linuxbridge-8.0.0-1.el7.noarch FWIW.

Revision history for this message

Tore Anderson (toreanderson) wrote on 2016-04-19:

#19

I've found that my suggested workaround to set /proc/sys/net/ipv6/conf/default/disable_ipv6=1 must only be run on the compute nodes. If it's run on the network nodes, then neutron-l3-agent flat out refuses to configure any IPv6 connectivity on the routers (even though the sysctl is set to 0 inside the qrouter network namespaces). See https://github.com/openstack/neutron/blob/master/neutron/common/ipv6_utils.py#L51-L64

However it seems that the setting is not necessary on the network nodes in any case, disable_ipv6 does get set to 1 on the linuxbridge devices there by something (I have not attempted to figure out what, exactly). It is only on the compute nodes that I need to set disable_ipv6=1 to avoid the global IPv6 address from being configured on the linuxbridges.

Revision history for this message

Tore Anderson (toreanderson) wrote on 2016-04-21:

#20

Ok, so setting default/disable_ipv6=1 is *not* a viable solution, not even on the compute nodes. The reason: neutron-linuxbridge-agent will (just like neutron-l3-agent) end up believing that IPv6 is completely disabled on the system, and skip applying the IPv6 security group when plumbing an instance. The instance thus ends up being completely wide open from the global IPv6 Internet. Not good.

Setting default/accept_ra=0 seems like a better solution, as this will at the very least stop the services running directly on the compute node from being globally reachable. However it will not prevent the Linux kernel from auto-configuring a link-local address on the bridge device, which in turn is directly reachable from the instances without any kind of firewalling. This bug is in other words *NOT* fixed in Mitaka, as far as I can tell.

Revision history for this message

Tore Anderson (toreanderson) wrote on 2016-05-04:

#21

As I've mentioned in my previous three comments, this bug is *not* fixed. It has been erroneously marked as fixed. Can someone with the appropriate access please re-open it? Otherwise I suppose I'll just have to submit a new duplicate issue.

Revision history for this message

Brian Haley (brian-haley) wrote on 2016-05-04:

#22

Tore, so it looks like you're using linuxbridge, which I will admit I don't typically run. I'll try and get a config with that up and running.

FYI, I can't reproduce what you're seeing using OVS.

Config:
- Ubuntu 16.04
- single-node devstack
- neutron w/OVS configured with OVSHybridIptablesFirewallDrive
- DVR enabled

Boot a single VM

qbr bridge is created for hybrid plugging, which was where the IPv6 link-local address was being configured.

$ sysctl net.ipv6.conf.qbr74767d3d-4a.disable_ipv6
net.ipv6.conf.qbr74767d3d-4a.disable_ipv6 = 1

$ ip a s qbr74767d3d-4a
15: qbr74767d3d-4a: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1450 qdisc noqueue state UP group default qlen 1000
link/ether ce:93:48:a0:42:c0 brd ff:ff:ff:ff:ff:ff

So there's no addresses there at all.

From inside the VM I can ping the all-nodes multicast address:

$ ping6 -I eth0 ff02::1
PING ff02::1 (ff02::1): 56 data bytes
ping6: can't bind to interface eth0: Operation not permitted
64 bytes from fe80::f816:3eff:fec0:a651: seq=0 ttl=64 time=2.408 ms
64 bytes from fe80::f816:3eff:fe92:17e9: seq=0 ttl=64 time=5.795 ms (DUP!)
64 bytes from fe80::f816:3eff:fe41:6efc: seq=0 ttl=64 time=7.924 ms (DUP!)
64 bytes from fe80::f816:3eff:fe6b:f22: seq=0 ttl=64 time=8.231 ms (DUP!)
64 bytes from fe80::f816:3eff:fe7d:2e94: seq=0 ttl=64 time=10.130 ms (DUP!)
64 bytes from fe80::f816:3eff:fe5e:b721: seq=0 ttl=64 time=10.410 ms (DUP!)

Each address is something that responded:

fe80::f816:3eff:fec0:a651 - the VM itself
fe80::f816:3eff:fe92:17e9 - dhcp server
fe80::f816:3eff:fe41:6efc - router (distributed interface - v4 subnet, it's local to the VM)
fe80::f816:3eff:fe6b:f22 - router (centralized snat interface - v4 subnet)
fe80::f816:3eff:fe7d:2e94 - router (distributed interface - v6 subnet)
fe80::f816:3eff:fe5e:b721 - router (centralized snat interface - v6 subnet)

I tried to ssh to all of those addresses from the neutron infrastructure and couldn't - connection refused.

Revision history for this message

Tore Anderson (toreanderson) wrote on 2016-05-04:

#23

Hi Brian,

I am fairly certain that using linuxbridge is a prerequisite for reproducing the issue I'm seeing. I'm not sure if multiple nodes are required (I'm not familiar with devstack), but that's what I have at least - compute nodes with nova separate from network nodes with neutron-l3-agent and neutron-dhcp-agent, with neutron-linuxbridge-agent on both terminating vxlan tunnels between them.

Revision history for this message

Brian Haley (brian-haley) wrote on 2016-05-04:

#24

Tore - things seem to work fine on a single-node devstack linuxbridge config, the VM can only "see" the router and dhcp agent addresses. And I don't see any IPv6 addresses on the brq interfaces. I also see disable_ipv6=1 being set on the brq devices, which you didn't see, I don't know why that is.

I just don't know how quickly I'll get a mult-node setup running.

Revision history for this message

Tore Anderson (toreanderson) wrote on 2016-05-04:

#25

Hmm. Thinking a bit more about it, I think you'll probably need a multi-node setup. As I observed in comment #19, on the network nodes the disable_ipv6 sysctl *does* get set, it's only on the compute nodes where it does not. If you have everything running on a single node, I'm guessing that whatever it is that is responsible for setting disable_ipv6 correctly on my network nodes is saving the day for you on your hybrid network+compute node.

Revision history for this message

Darragh O'Reilly (darragh-oreilly) wrote on 2016-05-04:

#26

The nova patch fixed it for ovs. But nova uses this with neutron linuxbridge (not sure if nova-network uses it too)
https://github.com/openstack/nova/blob/stable/mitaka/nova/network/linux_net.py#L1616

You should be able to get nova to create the bridge on an all-in-one by booting a vm to a subnet with dhcp disabled and no attached routers.

Revision history for this message

Brian Haley (brian-haley) wrote on 2016-05-05:

#27

Tore, I think I've found the code path, perhaps you could confirm based on the debug messages in your log files.

The nova vif driver (nova/virt/libvirt/vif.py) plug_bridge() method will be called, which will ensure the bridge exists. For this it will call over to ensure_bridge() in nova/network/linux_net.py. That code does not disable ipv6 :(

The hybrid case used when OVS is the bridge calls plug_ovs_hybrid(), which uses the code that was added to disable ipv6.

That's my guess without actually setting it up, so might just be a one-line change?

Revision history for this message

Tore Anderson (toreanderson) wrote on 2016-05-05:

#28

Attached are the results from when reproducing the bug with debugging output enabled. It's from a previously unused compute node (no instances nor any virtual networks running), and then I did "nova boot" to fire up an instance.

As you can see, the auto-created brq device on the compute node gets configured with a global IPv6 address and an IPv6 default route. This address is reachable from anywhere in the world, bypassing any network firewalls or anything else that would protect the compute node from unauthorised access. I've therefore chosen to anonymise the addresses in the output.

From reading the previous comments on this issue, it seems that nobody realised that the brq devices would obtain global IPv6 connectivity if there is an IPv6 router on the network. In all likelihood this was the case for the fixed OVS part of the bug as well. This aspect significantly exposes the exposure to possible unauthorised access to the compute node, so it might be wise to reconsider whether or not this should be considered a security issue.

Anyway. While the vxlan device does get disable_ipv6 set, the tap device does not. It therefore auto-configures a link-local IPv6 address, but not a global one or a default route. Presumably the Linux kernel will not process RAs on devices that are members of a bridge device. So while this might not be a problem per se, but I think that disable_ipv6 should be set on the tap device anyway as a precaution. There is no reason at all to retain active layer-3 configuration on any of these interfaces, as far as I can tell

Revision history for this message

Brian Haley (brian-haley) wrote on 2016-05-05:

#29

Tore, the debug output from the nova logs (cpu) would be most helpful, as that is where the brq device is created and configured.

And we did realize the qvb-* devices created on an OVS hybrid plug would auto-configure an IPv6 address since it would process the RA. But if the prefix assigned is only tenant-scope (i.e. some 2001:db8 prefix they are testing with) then there is not Internet reachability, it is mainly that the VM has access to the compute node.

I do not feel we need to worry about the tap devices, can the VM ping that address? And my single-node has disable_ipv6=1 on the taps, I don't know why you are seeing different yet.

Revision history for this message

Tore Anderson (toreanderson) wrote on 2016-05-05:

#30

Attaching debug log from nova-compute, as requested.

While it's true that a 2001:db8 prefix used for testing wouldn't be globally available, that's kind of besides the point, I think. Nobody would use 2001:db8-prefixes in production - there's no NAT or floating IPs in IPv6, so any production deployment of IPv6 will necessarily use globally reachable prefixes and likely RAs with the A-flag set, and thus be vulnerable to unauthorised access to the compute node.

I've tested briefly and I have not been successful in accessing the link-local address on the tap device from the instance or from bare-metal hosts outside of OpenStack residing on the same network. There's no response to ICMPv6 neighbour solicitations, and if I configure a static neighbour entry on the instance or the bare-metal host with the MAC address of the tap or brq device, the packets simply go unanswered. However, considering that the tap device is there only to provide forwarding at layer-2, it does strike me as wrong that there is active layer-3 configuration on it. For all I know, the fact that I cannot reach the link-local address from the instance is dependent on logic in the Linux kernel of the compute node which could change in the future. Therefore I think it would be prudent to set the disable_ipv6 sysctl on this device as well. Considering that disable_ipv6 does get set on the network node, it also seems more consistent that the same thing should happen on the compute nodes.

Revision history for this message

Brian Haley (brian-haley) wrote on 2016-05-05:

#31

Thanks for the log, it shows the path I assumed from comment #27:

DEBUG nova.virt.libvirt.vif Ensuring bridge brq30861ce7-0a plug_bridge

DEBUG nova.network.linux_net Starting Bridge brq30861ce7-0a ensure_bridge

DEBUG oslo_concurrency.processutils Running cmd (subprocess): sudo nova-rootwrap /etc/nova/rootwrap.conf brctl addbr brq30861ce7-0a execute

DEBUG oslo_concurrency.processutils Running cmd (subprocess): sudo nova-rootwrap /etc/nova/rootwrap.conf brctl setfd brq30861ce7-0a 0 execute

DEBUG oslo_concurrency.processutils Running cmd (subprocess): sudo nova-rootwrap /etc/nova/rootwrap.conf brctl stp brq30861ce7-0a off execute

No disable_ipv6 call.

Let me send out a patch, hopefully you can give it a try.

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2016-05-05: Fix proposed to nova (master)

#33

Fix proposed to branch: master
Review: https://review.openstack.org/313070

Revision history for this message

Doug Hellmann (doug-hellmann) wrote on 2016-08-30: Fix included in openstack/os-vif 1.2.0

#34

This issue was fixed in the openstack/os-vif 1.2.0 release.

Revision history for this message

Kalle Happonen (kalle-happonen) wrote on 2016-10-06:

#35

Any progress with https://review.openstack.org/313070
I think this still affects linuxbridge users if I'm not mistaken?

Revision history for this message

OpenStack Infra (hudson-openstack) wrote on 2017-02-06: Change abandoned on nova (master)

#36

Change abandoned by Michael Still (<email address hidden>) on branch: master
Review: https://review.openstack.org/313070
Reason: This patch has been sitting unchanged for more than 12 weeks. I am therefore going to abandon it to keep the review queue sane. Please feel free to restore the change if you're still working on it.

OpenStack Compute (nova)

Host is accessible from instance using Linux bridge IPv6 address

Bug Description

Other bug subscribers

Bug attachments

Remote bug watches

	Status	Importance	Assigned to
OpenStack Compute (nova)	Fix Released	Medium	Brian Haley
OpenStack Security Advisory	Won't Fix	Undecided	Unassigned
neutron	Fix Released	Medium	Brian Haley