Activity log for bug #1642763

Date Who What changed Old value New value Message
2016-11-17 22:29:18 bugproxy bug added bug
2016-11-17 22:29:20 bugproxy tags architecture-ppc64le bugnameltc-148871 severity-critical targetmilestone-inin---
2016-11-17 22:29:21 bugproxy attachment added keepalived deb package with fix. https://bugs.launchpad.net/bugs/1642763/+attachment/4778870/+files/keepalived_1.2.19-1test_ppc64el.deb
2016-11-17 22:29:23 bugproxy attachment added Upstream patch ported to keepalived 1.2.19 (found in Ubuntu 16.04) https://bugs.launchpad.net/bugs/1642763/+attachment/4778872/+files/Netlink-error-message-truncated-messages-V1.2.19.patch
2016-11-17 22:29:25 bugproxy ubuntu: assignee Taco Screen team (taco-screen-team)
2016-11-17 22:29:34 bugproxy affects ubuntu keepalived (Ubuntu)
2016-11-17 22:41:39 Jon Grimm bug added subscriber Jon Grimm
2016-11-17 22:46:07 Jon Grimm keepalived (Ubuntu): status New Triaged
2016-11-17 22:46:40 Jon Grimm bug added subscriber Ubuntu Server Team
2016-11-18 08:22:22 Christian Ehrhardt  keepalived (Ubuntu): importance Undecided Medium
2016-11-29 16:37:05 Jon Grimm keepalived (Ubuntu): assignee Taco Screen team (taco-screen-team) Nish Aravamudan (nacc)
2016-11-29 16:42:51 Mark Brown bug added subscriber Mark Brown
2016-11-29 17:01:34 Nish Aravamudan nominated for series Ubuntu Yakkety
2016-11-29 17:01:34 Nish Aravamudan bug task added keepalived (Ubuntu Yakkety)
2016-11-29 17:01:34 Nish Aravamudan nominated for series Ubuntu Xenial
2016-11-29 17:01:34 Nish Aravamudan bug task added keepalived (Ubuntu Xenial)
2016-11-29 17:34:03 Nish Aravamudan keepalived (Ubuntu): status Triaged In Progress
2016-11-29 17:34:05 Nish Aravamudan keepalived (Ubuntu Xenial): status New In Progress
2016-11-29 17:34:08 Nish Aravamudan keepalived (Ubuntu Yakkety): status New In Progress
2016-11-29 17:34:10 Nish Aravamudan keepalived (Ubuntu Xenial): assignee Nish Aravamudan (nacc)
2016-11-29 17:34:11 Nish Aravamudan keepalived (Ubuntu Yakkety): assignee Nish Aravamudan (nacc)
2016-11-29 18:09:10 Nish Aravamudan keepalived (Ubuntu): status In Progress Fix Committed
2016-11-29 18:50:34 Nish Aravamudan description == Comment: #0 - Andrew Thorstensen - 2016-11-17 09:50:25 == ---Problem Description--- Using Ubuntu 16.04 on ppc64le, we are building a 'neutron network node' using the VRRP configuration (built on keepalived). Information on this OpenStack configuration can be found here: https://wiki.openstack.org/wiki/Neutron/L3_High_Availability_VRRP When we run, the configuration is failing to apply via keepalived. The logs post the following: Nov 17 02:58:31 p8test-lp1 Keepalived_vrrp[54542]: VRRP is trying to assign VIP to unknown qr-a5f5ba96-52 interface !!! go out and fix your conf !!! However, the device DOES exist. But the keepalived config just doesn't always deploy it. ii keepalived 1:1.2.19-1 ppc64el Failover and monitoring daemon for LVS clusters This configuration sometimes works, but does sometimes fail on Ubuntu 16.04.1 ---uname output--- Linux p8test-lp1 4.4.0-47-generic #68-Ubuntu SMP Wed Oct 26 19:38:24 UTC 2016 ppc64le ppc64le ppc64le GNU/Linux ---Additional Hardware Info--- This is a Power8 system with Ubuntu 16.04.1 installed. Though we see no indication that this is specific to Power. Machine Type = S822L Machine Type = 8286-42A ---Steps to Reproduce--- Install openstack. Run the network node in a VRRP HA configuration. Create a router and assign a global IP. == Comment: #5 - David J. Wilder - 2016-11-17 15:58:04 == The problem is fixed in this upstream commit: https://github.com/acassen/keepalived/commit/9f327bbf3e86def1055a106eda0633638bda0345 On systems with a page size larger than 4096 keepalived may report: "Netlink: error: message truncated" messages This error was reported on a ppc64le in an OpenStack/Nutron environment. Ppc64le is using a 64k pages size. I found that keepalived's netlink recvmsg buffer was too small causing messages to be truncated. The size of the read buffer for the netlink socket should be based on page size however, it should not exceed 8192. See the comment in the patch. I tested the fix by creating 100 veth interfaces and verifying the errors did not return. Signed-off-by: David Wilder <dwilder@us.ibm.com> Signed-off-by: Quentin Armitage <quentin@armitage.org.uk> ... [Impact] * keepalived on ppc64el (due to a large page size) experiences "Netlink: error: message truncated" messages. * These Netlink truncations result in keepalived thinking that the the underlying device does not exist, even though it does. [Test Case] * Creating 100 veth interfaces ppc64el and verify if "Netlink: error: message truncated" errors are emitted. If so, the bug is present. If not, the bug is fixed. [Regression Potential] * This is code issue, fixed upstream, in the keepalived code when the system page size exceeds 4096. The upstream fix was backported to all releases and should only properly limit the size of the buffer used for netlink to at most 8192 on systems with a page size greater than 8192. I believe risk of regression is very low. --- == Comment: #0 - Andrew Thorstensen - 2016-11-17 09:50:25 == ---Problem Description--- Using Ubuntu 16.04 on ppc64le, we are building a 'neutron network node' using the VRRP configuration (built on keepalived). Information on this OpenStack configuration can be found here: https://wiki.openstack.org/wiki/Neutron/L3_High_Availability_VRRP When we run, the configuration is failing to apply via keepalived. The logs post the following: Nov 17 02:58:31 p8test-lp1 Keepalived_vrrp[54542]: VRRP is trying to assign VIP to unknown qr-a5f5ba96-52 interface !!! go out and fix your conf !!! However, the device DOES exist. But the keepalived config just doesn't always deploy it. ii keepalived 1:1.2.19-1 ppc64el Failover and monitoring daemon for LVS clusters This configuration sometimes works, but does sometimes fail on Ubuntu 16.04.1 ---uname output--- Linux p8test-lp1 4.4.0-47-generic #68-Ubuntu SMP Wed Oct 26 19:38:24 UTC 2016 ppc64le ppc64le ppc64le GNU/Linux ---Additional Hardware Info--- This is a Power8 system with Ubuntu 16.04.1 installed. Though we see no indication that this is specific to Power. Machine Type = S822L Machine Type = 8286-42A ---Steps to Reproduce---  Install openstack. Run the network node in a VRRP HA configuration. Create a router and assign a global IP. == Comment: #5 - David J. Wilder - 2016-11-17 15:58:04 == The problem is fixed in this upstream commit: https://github.com/acassen/keepalived/commit/9f327bbf3e86def1055a106eda0633638bda0345 On systems with a page size larger than 4096 keepalived may report: "Netlink: error: message truncated" messages This error was reported on a ppc64le in an OpenStack/Nutron environment. Ppc64le is using a 64k pages size. I found that keepalived's netlink recvmsg buffer was too small causing messages to be truncated. The size of the read buffer for the netlink socket should be based on page size however, it should not exceed 8192. See the comment in the patch. I tested the fix by creating 100 veth interfaces and verifying the errors did not return. Signed-off-by: David Wilder <dwilder@us.ibm.com> Signed-off-by: Quentin Armitage <quentin@armitage.org.uk> ...
2016-11-29 22:13:31 Nish Aravamudan description [Impact] * keepalived on ppc64el (due to a large page size) experiences "Netlink: error: message truncated" messages. * These Netlink truncations result in keepalived thinking that the the underlying device does not exist, even though it does. [Test Case] * Creating 100 veth interfaces ppc64el and verify if "Netlink: error: message truncated" errors are emitted. If so, the bug is present. If not, the bug is fixed. [Regression Potential] * This is code issue, fixed upstream, in the keepalived code when the system page size exceeds 4096. The upstream fix was backported to all releases and should only properly limit the size of the buffer used for netlink to at most 8192 on systems with a page size greater than 8192. I believe risk of regression is very low. --- == Comment: #0 - Andrew Thorstensen - 2016-11-17 09:50:25 == ---Problem Description--- Using Ubuntu 16.04 on ppc64le, we are building a 'neutron network node' using the VRRP configuration (built on keepalived). Information on this OpenStack configuration can be found here: https://wiki.openstack.org/wiki/Neutron/L3_High_Availability_VRRP When we run, the configuration is failing to apply via keepalived. The logs post the following: Nov 17 02:58:31 p8test-lp1 Keepalived_vrrp[54542]: VRRP is trying to assign VIP to unknown qr-a5f5ba96-52 interface !!! go out and fix your conf !!! However, the device DOES exist. But the keepalived config just doesn't always deploy it. ii keepalived 1:1.2.19-1 ppc64el Failover and monitoring daemon for LVS clusters This configuration sometimes works, but does sometimes fail on Ubuntu 16.04.1 ---uname output--- Linux p8test-lp1 4.4.0-47-generic #68-Ubuntu SMP Wed Oct 26 19:38:24 UTC 2016 ppc64le ppc64le ppc64le GNU/Linux ---Additional Hardware Info--- This is a Power8 system with Ubuntu 16.04.1 installed. Though we see no indication that this is specific to Power. Machine Type = S822L Machine Type = 8286-42A ---Steps to Reproduce---  Install openstack. Run the network node in a VRRP HA configuration. Create a router and assign a global IP. == Comment: #5 - David J. Wilder - 2016-11-17 15:58:04 == The problem is fixed in this upstream commit: https://github.com/acassen/keepalived/commit/9f327bbf3e86def1055a106eda0633638bda0345 On systems with a page size larger than 4096 keepalived may report: "Netlink: error: message truncated" messages This error was reported on a ppc64le in an OpenStack/Nutron environment. Ppc64le is using a 64k pages size. I found that keepalived's netlink recvmsg buffer was too small causing messages to be truncated. The size of the read buffer for the netlink socket should be based on page size however, it should not exceed 8192. See the comment in the patch. I tested the fix by creating 100 veth interfaces and verifying the errors did not return. Signed-off-by: David Wilder <dwilder@us.ibm.com> Signed-off-by: Quentin Armitage <quentin@armitage.org.uk> ... [Impact]  * keepalived on ppc64el (due to a large page size) experiences "Netlink: error: message truncated" messages.  * These Netlink truncations result in keepalived thinking that the the underlying device does not exist, even though it does. [Test Case]  * Creating 100 veth interfaces ppc64el and verify if "Netlink: error: message truncated" errors are emitted. If so, the bug is present. If not, the bug is fixed. [Regression Potential]  * This is code issue, fixed upstream, in the keepalived code when the system page size exceeds 4096. The upstream fix was backported to all releases and should only properly limit the size of the buffer used for netlink to at most 8192 on systems with a page size greater than 8192. I believe risk of regression is very low. * Using the tests provided by David Wilder, I ran on both x86_64 and ppc64el LXD containers. Without the backported changes, I saw no issues on x86_64, and the reported issue on ppc64el (as expected, as a page size greater than 4K is required to see the buffer size issue). With the backported changes, both architectures show no issue with the provided testcase. --- == Comment: #0 - Andrew Thorstensen - 2016-11-17 09:50:25 == ---Problem Description--- Using Ubuntu 16.04 on ppc64le, we are building a 'neutron network node' using the VRRP configuration (built on keepalived). Information on this OpenStack configuration can be found here: https://wiki.openstack.org/wiki/Neutron/L3_High_Availability_VRRP When we run, the configuration is failing to apply via keepalived. The logs post the following: Nov 17 02:58:31 p8test-lp1 Keepalived_vrrp[54542]: VRRP is trying to assign VIP to unknown qr-a5f5ba96-52 interface !!! go out and fix your conf !!! However, the device DOES exist. But the keepalived config just doesn't always deploy it. ii keepalived 1:1.2.19-1 ppc64el Failover and monitoring daemon for LVS clusters This configuration sometimes works, but does sometimes fail on Ubuntu 16.04.1 ---uname output--- Linux p8test-lp1 4.4.0-47-generic #68-Ubuntu SMP Wed Oct 26 19:38:24 UTC 2016 ppc64le ppc64le ppc64le GNU/Linux ---Additional Hardware Info--- This is a Power8 system with Ubuntu 16.04.1 installed. Though we see no indication that this is specific to Power. Machine Type = S822L Machine Type = 8286-42A ---Steps to Reproduce---  Install openstack. Run the network node in a VRRP HA configuration. Create a router and assign a global IP. == Comment: #5 - David J. Wilder - 2016-11-17 15:58:04 == The problem is fixed in this upstream commit: https://github.com/acassen/keepalived/commit/9f327bbf3e86def1055a106eda0633638bda0345 On systems with a page size larger than 4096 keepalived may report: "Netlink: error: message truncated" messages This error was reported on a ppc64le in an OpenStack/Nutron environment. Ppc64le is using a 64k pages size. I found that keepalived's netlink recvmsg buffer was too small causing messages to be truncated. The size of the read buffer for the netlink socket should be based on page size however, it should not exceed 8192. See the comment in the patch. I tested the fix by creating 100 veth interfaces and verifying the errors did not return. Signed-off-by: David Wilder <dwilder@us.ibm.com> Signed-off-by: Quentin Armitage <quentin@armitage.org.uk> ...
2016-11-30 14:43:41 Ravi Gummadi bug added subscriber Ravi Gummadi
2016-12-01 01:09:19 Launchpad Janitor keepalived (Ubuntu): status Fix Committed Fix Released
2016-12-01 18:34:37 Brian Murray keepalived (Ubuntu Yakkety): status In Progress Fix Committed
2016-12-01 18:34:38 Brian Murray bug added subscriber Ubuntu Stable Release Updates Team
2016-12-01 18:34:40 Brian Murray bug added subscriber SRU Verification
2016-12-01 18:34:46 Brian Murray tags architecture-ppc64le bugnameltc-148871 severity-critical targetmilestone-inin--- architecture-ppc64le bugnameltc-148871 severity-critical targetmilestone-inin--- verification-needed
2016-12-01 18:36:18 Brian Murray keepalived (Ubuntu Xenial): status In Progress Fix Committed
2016-12-01 19:56:46 Kyle L. Henderson bug added subscriber Kyle L. Henderson
2016-12-02 16:59:40 bugproxy tags architecture-ppc64le bugnameltc-148871 severity-critical targetmilestone-inin--- verification-needed architecture-ppc64le bugnameltc-148871 severity-critical targetmilestone-inin16041 verification-done
2016-12-12 22:06:24 Brian Murray tags architecture-ppc64le bugnameltc-148871 severity-critical targetmilestone-inin16041 verification-done architecture-ppc64le bugnameltc-148871 severity-critical targetmilestone-inin16041 verification-done-xenial verification-needed
2016-12-12 22:08:01 Launchpad Janitor keepalived (Ubuntu Xenial): status Fix Committed Fix Released
2016-12-15 00:57:30 Nish Aravamudan tags architecture-ppc64le bugnameltc-148871 severity-critical targetmilestone-inin16041 verification-done-xenial verification-needed architecture-ppc64le bugnameltc-148871 severity-critical targetmilestone-inin16041 verification-done-xenial verification-done-yakkety
2016-12-15 16:14:33 Brian Murray removed subscriber Ubuntu Stable Release Updates Team
2016-12-15 16:14:46 Launchpad Janitor keepalived (Ubuntu Yakkety): status Fix Committed Fix Released
2016-12-15 16:44:14 Nish Aravamudan keepalived (Ubuntu): assignee Nish Aravamudan (nacc)
2016-12-15 16:44:16 Nish Aravamudan keepalived (Ubuntu Xenial): assignee Nish Aravamudan (nacc)
2016-12-15 16:44:18 Nish Aravamudan keepalived (Ubuntu Yakkety): assignee Nish Aravamudan (nacc)