neutron

Ip segments lost when restart ovs-agent with openvswitch firewall

Bug #1822256 reported by Yang Li on 2019-03-29

This bug affects 2 people

Affects		Status	Importance	Assigned to	Milestone
	neutron	Invalid	High	Unassigned

Bug Description

environment：
linux version: Linux controller.novalocal 3.10.0-693.el7.x86_64 #1 SMP Tue Aug 22 21:09:27 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux
OpenStack version: Rocky
network type: vxlan or vlan
firewall driver: openvswitch

1. Create 2 VMs(vm1, vm2) in different compute nodes(node-1, node-2) with all tcp passed sg in one network.

2. Login to vm2, create a large file, for example:
vm2# dd if=/dev/zero of=/mnt/test.img bs=1G count=5

3.Login to vm1, scp vm2's large file into vm1, when scp process starts, go to step 4.
vm1# scp vm2-ip:/mnt/test.img /mnt

4.Login to node-2, and restart neutron-openvswitch-agent, this will refresh all the openflow in br-int
node-2# systemctl restart neutron-openvswitch-agent

5.Login to vm1, and after several seconds, you will find the scp process status is stalled.

After some investigation, I found the openflow refresh causes ip segments lost.When this happened, I captured packets with "tcpdump -i tap-xxx -w tmp.pcap", and with wireshark I saw these errors:

192.168.100.19 192.168.100.5 SSH 16478 Server: [TCP ACKed unseen segment] [TCP Previous segment not captured] , Encrypted packet (len=16412)
192.168.100.19 192.168.100.5 SSH 8302 Server: [TCP ACKed unseen segment] , Encrypted packet (len=8236)
192.168.100.5 192.168.100.19 TCP 66 [TCP ACKed unseen segment] [TCP Previous segment not captured] 54354 → 22 [ACK] Seq=2509 Ack=600733 Win=16522 Len=0 TSval=2847412 TSecr=2851031
192.168.100.19 192.168.100.5 SSH 1464 Server: [TCP Spurious Retransmission] , Encrypted packet (len=1398)
192.168.100.5 192.168.100.19 TCP 78 [TCP Dup ACK 25182#1] 54354 → 22 [ACK] Seq=326305 Ack=67089901 Win=18494 Len=0 TSval=2849742 TSecr=2853310 SLE=67073429 SRE=67074827
192.168.100.5 192.168.100.19 TCP 110 [TCP Retransmission] 54354 → 22 [PSH, ACK] Seq=326173 Ack=67089901 Win=18494 Len=44 TSval=2849742 TSecr=2853310

192.168.100.19 192.168.100.5 TCP 1464 [TCP Retransmission] 22 → 54354 [ACK] Seq=70971905 Ack=346105 Win=2016 Len=1398 TSval=2853361 TSecr=2849691
192.168.100.19 192.168.100.5 TCP 1464 [TCP Retransmission] 22 → 54354 [ACK] Seq=70971905 Ack=346105 Win=2016 Len=1398 TSval=2853463 TSecr=2849691
192.168.100.19 192.168.100.5 TCP 1464 [TCP Retransmission] 22 → 54354 [ACK] Seq=70971905 Ack=346105 Win=2016 Len=1398 TSval=2854076 TSecr=2849691

And I checked the statue of this tcp connect in both compute nodes, it's still ESTABLISHED.
# conntrack -L | grep 192.168.100.5
tcp 6 299 ESTABLISHED src=192.168.100.5 dst=192.168.100.19 sport=54356 dport=22 src=192.168.100.19 dst=192.168.100.5 sport=22 dport=54354 [ASSURED] mark=0 zone=4 use=1
# conntrack -L | grep 192.168.100.5
tcp 6 287 ESTABLISHED src=192.168.100.5 dst=192.168.100.19 sport=54356 dport=22 src=192.168.100.19 dst=192.168.100.5 sport=22 dport=54354 [ASSURED] mark=0 zone=1 use=1

I have no idea why refresh openflow will cause ip segments lost, hopes someone has a way to solve this problem.

See original description

Tags:

Revision history for this message

Yang Li (yang-li) wrote on 2019-03-29:

This problem is also can be reproduced in one compute node, no need in different compute nodes.
If the first time it's not reproduced, try step4 again, after several times it will be reproduced.

summary:	- Ip fragments lost when restart ovs-agent with openvswitch firewall + Ip segments lost when restart ovs-agent with openvswitch firewall
description:	updated

Revision history for this message

Bence Romsics (bence-romsics) wrote on 2019-03-29:

I think we had similar problems in the past (because of clearing all flows during agent restart) but that was supposed to be fixed. For example I found this:

https://bugs.launchpad.net/neutron/+bug/1383674

tags:

added: ovs ovs-fw

Revision history for this message

Bence Romsics (bence-romsics) wrote on 2019-03-29:

I did not have the time to reproduce this yet, but if it is reproducible then I think this is quite important to fix, so I'm setting it High.

Changed in neutron:
importance:	Undecided → High

Revision history for this message

Yang Li (yang-li) wrote on 2019-03-30:

I thought this bug was same as https://bugs.launchpad.net/neutron/+bug/1708731, but the bug in 1708731 only affects vxlan/gre such tunnel network, and the link status is invalid, this is different from the currnet bug which can be reproduced in vxlan or vlan network, even 2 VMs in one compute node, and the status of tcp link is still ESTABLISHED, seems this is a new bug.

Revision history for this message

Yang Li (yang-li) wrote on 2019-03-30:

BTW, I didn't reproduced this bug when firewall driver is OVSHybridIptablesFirewallDriver, seems this bug only affect openvswitch firewall driver.

Revision history for this message

Yang Li (yang-li) wrote on 2019-04-02:

I reproduced this bug again, in the server side, I got these packets, there are 'seq 157061839:157068829' and 'seq 157061839:157063237' both exist which start with 157061839, this is unnormal.
16:11:36.795490 fa:16:3e:7b:31:8b > fa:16:3e:7e:e8:5b, ethertype IPv4 (0x0800), length 1312: 192.168.100.19.22 > 192.168.100.5.54367: Flags [P.], seq 157060593:157061839, ack 737639, win 2016, options [nop,nop,TS val 69340450 ecr 69335583], length 1246
16:11:36.796120 fa:16:3e:7b:31:8b > fa:16:3e:7e:e8:5b, ethertype IPv4 (0x0800), length 7056: 192.168.100.19.22 > 192.168.100.5.54367: Flags [.], seq 157061839:157068829, ack 737683, win 2016, options [nop,nop,TS val 69340450 ecr 69335583], length 6990
16:11:36.796153 fa:16:3e:7b:31:8b > fa:16:3e:7e:e8:5b, ethertype IPv4 (0x0800), length 1312: 192.168.100.19.22 > 192.168.100.5.54367: Flags [P.], seq 157068829:157070075, ack 737683, win 2016, options [nop,nop,TS val 69340450 ecr 69335583], length 1246
16:11:36.998982 fa:16:3e:7b:31:8b > fa:16:3e:7e:e8:5b, ethertype IPv4 (0x0800), length 1464: 192.168.100.19.22 > 192.168.100.5.54367: Flags [.], seq 157061839:157063237, ack 737683, win 2016, options [nop,nop,TS val 69340501 ecr 69335583], length 1398
16:11:37.407022 fa:16:3e:7b:31:8b > fa:16:3e:7e:e8:5b, ethertype IPv4 (0x0800), length 1464: 192.168.100.19.22 > 192.168.100.5.54367: Flags [.], seq 157061839:157063237, ack 737683, win 2016, options [nop,nop,TS val 69340603 ecr 69335583], length 1398
16:11:38.222904 fa:16:3e:7b:31:8b > fa:16:3e:7e:e8:5b, ethertype IPv4 (0x0800), length 1464: 192.168.100.19.22 > 192.168.100.5.54367: Flags [.], seq 157061839:157063237, ack 737683, win 2016, options [nop,nop,TS val 69340807 ecr 69335583], length 1398

In the client side, I got these packets, which contain 'sack 1 {157061839:157063237}', I think the sack option caused retransmission.
16:09:54.527451 fa:16:3e:7e:e8:5b > fa:16:3e:7b:31:8b, ethertype IPv4 (0x0800), length 78: 192.168.100.5.54367 > 192.168.100.19.22: Flags [.], ack 157070075, win 18486, options [nop,nop,TS val 69335634 ecr 69340450,nop,nop,sack 1 {157061839:157063237}], length 0
16:09:54.935973 fa:16:3e:7e:e8:5b > fa:16:3e:7b:31:8b, ethertype IPv4 (0x0800), length 78: 192.168.100.5.54367 > 192.168.100.19.22: Flags [.], ack 157070075, win 18486, options [nop,nop,TS val 69335736 ecr 69340450,nop,nop,sack 1 {157061839:157063237}], length 0
16:09:55.751463 fa:16:3e:7e:e8:5b > fa:16:3e:7b:31:8b, ethertype IPv4 (0x0800), length 78: 192.168.100.5.54367 > 192.168.100.19.22: Flags [.], ack 157070075, win 18486, options [nop,nop,TS val 69335940 ecr 69340450,nop,nop,sack 1 {157061839:157063237}], length 0

I reproduced this bug again, in the server side, I got these packets, there are 'seq 157061839:157068829' and 'seq 157061839:157063237' both exist which start with 157061839, this is unnormal. 
16:11:36.795490 fa:16:3e:7b:31:8b > fa:16:3e:7e:e8:5b, ethertype IPv4 (0x0800), length 1312: 192.168.100.19.22 > 192.168.100.5.54367: Flags [P.], seq 157060593:157061839, ack 737639, win 2016, options [nop,nop,TS val 69340450 ecr 69335583], length 1246
16:11:36.796120 fa:16:3e:7b:31:8b > fa:16:3e:7e:e8:5b, ethertype IPv4 (0x0800), length 7056: 192.168.100.19.22 > 192.168.100.5.54367: Flags [.], seq 157061839:157068829, ack 737683, win 2016, options [nop,nop,TS val 69340450 ecr 69335583], length 6990
16:11:36.796153 fa:16:3e:7b:31:8b > fa:16:3e:7e:e8:5b, ethertype IPv4 (0x0800), length 1312: 192.168.100.19.22 > 192.168.100.5.54367: Flags [P.], seq 157068829:157070075, ack 737683, win 2016, options [nop,nop,TS val 69340450 ecr 69335583], length 1246
16:11:36.998982 fa:16:3e:7b:31:8b > fa:16:3e:7e:e8:5b, ethertype IPv4 (0x0800), length 1464: 192.168.100.19.22 > 192.168.100.5.54367: Flags [.], seq 157061839:157063237, ack 737683, win 2016, options [nop,nop,TS val 69340501 ecr 69335583], length 1398
16:11:37.407022 fa:16:3e:7b:31:8b > fa:16:3e:7e:e8:5b, ethertype IPv4 (0x0800), length 1464: 192.168.100.19.22 > 192.168.100.5.54367: Flags [.], seq 157061839:157063237, ack 737683, win 2016, options [nop,nop,TS val 69340603 ecr 69335583], length 1398
16:11:38.222904 fa:16:3e:7b:31:8b > fa:16:3e:7e:e8:5b, ethertype IPv4 (0x0800), length 1464: 192.168.100.19.22 > 192.168.100.5.54367: Flags [.], seq 157061839:157063237, ack 737683, win 2016, options [nop,nop,TS val 69340807 ecr 69335583], length 1398

Revision history for this message

LIU Yulong (dragon889) wrote on 2019-04-11:

For the tap-device, the local vlan tag was stripped before send to it. So you cannot see it.
And in your last comment #6, you can see, all your packets do not have any vlan tag.
Here are some example packets which have vlan id 205 captured by tcpdump:
14:53:54.915607 fe:ea:c8:20:fe:d0 > Broadcast, ethertype 802.1Q (0x8100), length 64: vlan 205, p 0, ethertype ARP, Request who-has xxx.xxx.xxx.xxx tell xxx.xxx.xxx.xxx, length 46
14:53:55.153421 fe:ea:c8:20:fe:d0 > Broadcast, ethertype 802.1Q (0x8100), length 64: vlan 205, p 0, ethertype ARP, Request who-has xxx.xxx.xxx.xxx tell xxx.xxx.xxx.xxx, length 46

And if you checked the OF flows, you may see the following 'strip_vlan' actions:

dl_vlan=34,dl_dst=fa:16:3e:64:9d:57 actions=strip_vlan,output:2235
dl_vlan=34,dl_dst=fa:16:3e:64:9d:57 actions=load:0x8bb->NXM_NX_REG5[],load:0x22->NXM_NX_REG6[],strip_vlan,resubmit(,81)

ovs-ofctl show br-int|grep tapbad2bb56-13
2235(tapbad2bb56-13): addr:fe:16:3e:64:9d:57

For the "TCP Retransmission", the reason can be varied. I can guess even you do not restart ovs-agent, you will see the
"TCP Retransmission" if the network is congested

So, I don't think this is an issue.

Changed in neutron:
status:	New → Invalid

Revision history for this message

Yang Li (yang-li) wrote on 2019-04-12:

In my test, if I don't restart the ovs-agent, the "TCP Retransmission" won't happen, all the scp process will be done normally. Once I restart the ovs-agent, the "TCP Retransmission" will happened after 10s later, seems it's not caused by network congested, it has a connection between scp and ovs-agent.

Report a bug

This report contains Public information

Everyone can see this information.

You are

Subscribing...

Edit bug mail

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.