Networking hangs on azure using hv_netvsc; bisected
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
Fix Released
|
High
|
Unassigned | ||
Vivid |
Fix Released
|
High
|
Unassigned |
Bug Description
Running Ubuntu instances on azure, testing basic networking between two instances. This involves configuring VXLAN between the two instances and running iperf and rsync of the kernel tree between the instances, e.g.,
ip link add vxlan0 type vxlan id 999 local 10.88.0.12 remote 10.88.0.11 dev eth0
ip l set vxlan0 up
ip addr add 242.0.0.12/8 dev vxlan0
After some time (sometimes instantly, sometimes up to 30 minutes of activity), the networking will hang. This hang takes two forms: a complete loss of connectivity (all network, even the ssh session used to log in), or just a loss of connectivity between instances (the ssh session remains active). Sometimes for the latter case, the ssh session will then later hang.
This first appeared when testing with the Ubuntu 3.19 kernel, and I subsequently bisected this to:
commit effa2012d207f78
Author: KY Srinivasan <email address hidden>
Date: Mon May 11 15:39:46 2015 -0700
hv_netvsc: Use the xmit_more skb flag to optimize signaling the host
BugLink: http://
Based on the information given to this driver (via the xmit_more skb flag),
we can defer signaling the host if more packets are on the way. This will help
make the host more efficient since it can potentially process a larger batch of
packets. Implement this optimization.
Signed-off-by: K. Y. Srinivasan <email address hidden>
Signed-off-by: David S. Miller <email address hidden>
Acked-by: Tim Gardner <email address hidden>
Acked-by: Brad Figg <email address hidden>
Signed-off-by: Brad Figg <email address hidden>
I also tested the mainline kernel (net-next); it fails with the equivalent commit:
commit 82fa3c776e5abba
Author: KY Srinivasan <email address hidden>
Date: Mon May 11 15:39:46 2015 -0700
hv_netvsc: Use the xmit_more skb flag to optimize signaling the host
For both kernel trees, I also tested the prior commit and it did not
exhibit the failure after many hours. For ubuntu, this was
commit a4aeb290bd75af5
Author: K. Y. Srinivasan <email address hidden>
Date: Wed Mar 18 12:29:29 2015 -0700
Drivers: hv: vmbus: Export the vmbus_sendpacke
and for mainline it was
commit 9eea92226407e7a
Author: Alexei Starovoitov <email address hidden>
Date: Mon May 11 15:19:48 2015 -0700
pktgen: fix packet generation
tags: | added: bisect-done |
tags: | added: kernel-bug-exists-upstream |
Changed in linux (Ubuntu): | |
importance: | Undecided → High |
tags: | added: kernel-da-key kernel-hyper-v |
Changed in linux (Ubuntu): | |
status: | Incomplete → Triaged |
Changed in linux (Ubuntu Vivid): | |
status: | New → Fix Committed |
Changed in linux (Ubuntu): | |
status: | Triaged → Fix Released |
Changed in linux (Ubuntu Vivid): | |
importance: | Undecided → High |
This bug is missing log files that will aid in diagnosing the problem. From a terminal window please run:
apport-collect 1508706
and then change the status of the bug to 'Confirmed'.
If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.
This change has been made by an automated script, maintained by the Ubuntu Kernel Team.