Bug #1359833 “HA deployment fails with Neutron GRE” : Bugs : Fuel for OpenStack

Revision history for this message

Eugene Nikanorov (enikanorov) wrote on 2014-08-21:

#1

fuel-snapshot-2014-08-21_16-40-58.tgz Edit (36.0 MiB, application/x-tar)

Nastya Urlapova (aurlapova) on 2014-08-21

Changed in fuel:
importance:	Undecided → High
assignee:	nobody → Fuel Library Team (fuel-library)
milestone:	none → 5.1

Revision history for this message

Sergey Vasilenko (xenolog) wrote on 2014-08-21:

#2

Controller nodes can't communicate by management network, but can by admin network.

PRIMARY-CONTROLLER:
[root@node-5 ~]# ip -f inet a
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 16436 qdisc noqueue state UNKNOWN
    inet 127.0.0.1/8 scope host lo
11: br-ex: <BROADCAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN
    inet 172.16.161.51/24 brd 172.16.161.255 scope global br-ex
12: br-mgmt: <BROADCAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN
    inet 192.168.0.3/24 brd 192.168.0.255 scope global br-mgmt
13: br-storage: <BROADCAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN
    inet 192.168.1.2/24 brd 192.168.1.255 scope global br-storage
14: br-fw-admin: <BROADCAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN
    inet 10.20.0.3/24 brd 10.20.0.255 scope global br-fw-admin
27: hapr-host: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UP qlen 1000
    inet 240.0.0.1/30 scope global hapr-host
You have new mail in /var/spool/mail/root
[root@node-5 ~]#

2-ND CONTROLLER:
[root@node-6 ~]# ip -f inet a
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 16436 qdisc noqueue state UNKNOWN
    inet 127.0.0.1/8 scope host lo
11: br-ex: <BROADCAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN
    inet 172.16.161.52/24 brd 172.16.161.255 scope global br-ex
12: br-mgmt: <BROADCAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN
    inet 192.168.0.4/24 brd 192.168.0.255 scope global br-mgmt
13: br-storage: <BROADCAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN
    inet 192.168.1.3/24 brd 192.168.1.255 scope global br-storage
14: br-fw-admin: <BROADCAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN
    inet 10.20.0.4/24 brd 10.20.0.255 scope global br-fw-admin
17: br-mgmt-hapr: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UP qlen 1000
    inet 192.168.0.4/24 scope global br-mgmt-hapr
19: br-ex-hapr: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UP qlen 1000
    inet 172.16.161.52/24 scope global br-ex-hapr
[root@node-6 ~]#

[root@node-6 ~]# ip n
192.168.0.3 dev br-mgmt INCOMPLETE
192.168.0.2 dev br-mgmt-hapr lladdr ce:a3:0d:ea:d8:53 REACHABLE
10.20.0.2 dev br-fw-admin lladdr 52:54:00:b3:76:00 REACHABLE
10.20.0.3 dev br-fw-admin lladdr a6:a1:2b:d6:9f:43 STALE
192.168.0.5 dev br-mgmt INCOMPLETE
192.168.1.2 dev br-storage FAILED
172.16.161.50 dev br-ex-hapr lladdr 66:d4:bc:40:b8:b9 REACHABLE
[root@node-6 ~]#

Controller nodes can't communicate by management network, but can by admin network.

PRIMARY-CONTROLLER:
[root@node-5 ~]# ip -f inet a
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 16436 qdisc noqueue state UNKNOWN
    inet 127.0.0.1/8 scope host lo
11: br-ex: <BROADCAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN
    inet 172.16.161.51/24 brd 172.16.161.255 scope global br-ex
12: br-mgmt: <BROADCAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN
    inet 192.168.0.3/24 brd 192.168.0.255 scope global br-mgmt
13: br-storage: <BROADCAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN
    inet 192.168.1.2/24 brd 192.168.1.255 scope global br-storage
14: br-fw-admin: <BROADCAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN
    inet 10.20.0.3/24 brd 10.20.0.255 scope global br-fw-admin
27: hapr-host: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UP qlen 1000
    inet 240.0.0.1/30 scope global hapr-host
You have new mail in /var/spool/mail/root
[root@node-5 ~]#

2-ND CONTROLLER:
[root@node-6 ~]# ip -f inet a
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 16436 qdisc noqueue state UNKNOWN
    inet 127.0.0.1/8 scope host lo
11: br-ex: <BROADCAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN
    inet 172.16.161.52/24 brd 172.16.161.255 scope global br-ex
12: br-mgmt: <BROADCAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN
    inet 192.168.0.4/24 brd 192.168.0.255 scope global br-mgmt
13: br-storage: <BROADCAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN
    inet 192.168.1.3/24 brd 192.168.1.255 scope global br-storage
14: br-fw-admin: <BROADCAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UNKNOWN
    inet 10.20.0.4/24 brd 10.20.0.255 scope global br-fw-admin
17: br-mgmt-hapr: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UP qlen 1000
    inet 192.168.0.4/24 scope global br-mgmt-hapr
19: br-ex-hapr: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc pfifo_fast state UP qlen 1000
    inet 172.16.161.52/24 scope global br-ex-hapr
[root@node-6 ~]#

[root@node-6 ~]# ip n
192.168.0.3 dev br-mgmt  INCOMPLETE
192.168.0.2 dev br-mgmt-hapr lladdr ce:a3:0d:ea:d8:53 REACHABLE
10.20.0.2 dev br-fw-admin lladdr 52:54:00:b3:76:00 REACHABLE
10.20.0.3 dev br-fw-admin lladdr a6:a1:2b:d6:9f:43 STALE
192.168.0.5 dev br-mgmt  INCOMPLETE
192.168.1.2 dev br-storage  FAILED
172.16.161.50 dev br-ex-hapr lladdr 66:d4:bc:40:b8:b9 REACHABLE
[root@node-6 ~]#

Revision history for this message

Sergey Vasilenko (xenolog) wrote on 2014-08-21:

#3

controller nodes also can communicate by public network.

Admin and public networks -- untagged, management and storage -- tagged.

looks like low-level network issue

Revision history for this message

Sergey Vasilenko (xenolog) wrote on 2014-08-21:

#4

Screenshot 2014-08-21 21.47.10.png Edit (1.2 MiB, image/png)

Revision history for this message

Eugene Nikanorov (enikanorov) wrote on 2014-08-22:

#5

apparently the issue was in wrong interface configuration on the nodes.

The problem is that network verification passes as usual giving no hint that something is wrong with configuration.

Also, resulting errors in the logs hardly give meaningful hint about the issue.

I suggest to lower the importance and add some verification steps into the deployment process.

Vladimir Kuklin (vkuklin) on 2014-08-22

Changed in fuel:
status:	New → Confirmed
status:	Confirmed → Invalid

Fuel for OpenStack

HA deployment fails with Neutron GRE

Bug Description

Other bug subscribers

Bug attachments

Remote bug watches