Activity log for bug #1598229

Date Who What changed Old value New value Message
2016-07-01 16:04:22 Richard bug added bug
2016-07-01 16:05:48 Richard description Bug Description: Encountered a memory leak with corosync on all three nodes in a cluster: Jun 13 20:36:35 XXXXXXXXX1 kernel: [929808.525991] Out of memory: Kill process 4846 (corosync) score 941 or sacrifice child Jun 13 20:36:35 XXXXXXXXX1 kernel: [929808.620411] Killed process 4846 (corosync) total-vm:267928256kB, anon-rss:257475632kB, file-rss:37816kB Jun 29 02:26:17 XXXXXXXXX1 kernel: [2247790.069557] Out of memory: Kill process 27791 (corosync) score 938 or sacrifice child Jun 29 02:26:17 XXXXXXXXX1 kernel: [2247790.166524] Killed process 27791 (corosync) total-vm:265216168kB, anon-rss:255941644kB, file-rss:28580kB Jun 14 14:00:03 XXXXXXXXX2 kernel: [993027.615377] Out of memory: Kill process 5167 (corosync) score 943 or sacrifice child Jun 14 14:00:03 XXXXXXXXX2 kernel: [993027.709419] Killed process 5167 (corosync) total-vm:265023016kB, anon-rss:256668244kB, file-rss:33844kB Jun 28 22:56:30 XXXXXXXXX2 kernel: [2235753.617203] Out of memory: Kill process 27073 (corosync) score 941 or sacrifice child Jun 28 22:56:30 XXXXXXXXX2 kernel: [2235753.713521] Killed process 27073 (corosync) total-vm:261875792kB, anon-rss:255939160kB, file-rss:24760kB Mar 21 22:19:17 XXXXXXXXX2 kernel: [956727.096937] Out of memory: Kill process 5422 (corosync) score 942 or sacrifice child Mar 21 22:19:17 XXXXXXXXX2 kernel: [956727.191025] Killed process 5422 (corosync) total-vm:264643868kB, anon-rss:256189360kB, file-rss:33976kB Apr 26 00:30:04 XXXXXXXXX2 kernel: [1017203.359940] Out of memory: Kill process 5183 (corosync) score 927 or sacrifice child Apr 26 00:30:04 XXXXXXXXX2 kernel: [1017203.455015] Killed process 5183 (corosync) total-vm:271136904kB, anon-rss:251953372kB, file-rss:33760kB Jun 29 09:00:02 XXXXXXXXX3 kernel: [2276334.347836] Out of memory: Kill process 24183 (corosync) score 937 or sacrifice child Jun 29 09:00:02 XXXXXXXXX3 kernel: [2276334.444000] Killed process 24183 (corosync) total-vm:270476488kB, anon-rss:255257476kB, file-rss:32248kB Mar 22 04:58:18 XXXXXXXXX3 kernel: [979377.041372] Out of memory: Kill process 5088 (corosync) score 941 or sacrifice child Mar 22 04:58:18 XXXXXXXXX3 kernel: [979377.135414] Killed process 5088 (corosync) total-vm:265582012kB, anon-rss:255851792kB, file-rss:36000kB Apr 26 09:26:02 XXXXXXXXX3 kernel: [1014911.175029] Out of memory: Kill process 5255 (corosync) score 925 or sacrifice child Apr 26 09:26:02 XXXXXXXXX3 kernel: [1014911.270203] Killed process 5255 (corosync) total-vm:269154272kB, anon-rss:251736288kB, file-rss:35740kB Jun 13 22:46:23 XXXXXXXXX3 kernel: [942502.987771] Out of memory: Kill process 5230 (corosync) score 940 or sacrifice child Jun 13 22:46:23 XXXXXXXXX3 kernel: [942503.081826] Killed process 5230 (corosync) total-vm:265560916kB, anon-rss:256339740kB, file-rss:33788kB The memory leak was confirmed through an analysis of atop logs where it was observed that memory utilization by corosync would go from 47% to 97% over the course of several days before corosync was then killed. The are many memory leaks identified for the current version of corosync in MOS6.1 # dpkg -l | grep corosync ii corosync 2.3.4-0u~u14.04+mos1 amd64 Standards-based cluster framework (daemon and modules) ii libcorosync-common4 2.3.4-0u~u14.04+mos1 amd64 Standards-based cluster framework, common library Steps to reproduce: Unsure how to reproduce at this point, as logging is not detailed enough. Expected results: Impact: corosync has crashed relatively frequently on all three nodes, however Environment description: - Operation system: Ubuntu 14.04.2 LTS - 3.13.0-61-generic - Versions of components: # dpkg -l | egrep 'corosync|pacemaker' ii corosync 2.3.4-0u~u14.04+mos1 amd64 Standards-based cluster framework (daemon and modules) ii crmsh 2.1.0-1~u14.04+mos1 all CRM shell for the pacemaker cluster manager ii libcorosync-common4 2.3.4-0u~u14.04+mos1 amd64 Standards-based cluster framework, common library ii pacemaker 1.1.12-0u~u14.04+mos6.1 amd64 HA cluster resource manager ii pacemaker-cli-utils 1.1.12-0u~u14.04+mos6.1 amd64 Command line interface utilities for Pacemaker # uname -r 3.13.0-61-generic - Reference architecture: MOS6.1 - unable to provide more information due to restrictions, but at scale - Network model: Neutron+GRE+vlan - Related projects installed: N/A Bug Description: Encountered a memory leak with corosync on all three nodes in a cluster: Jun 13 20:36:35 XXXXXXXXX1 kernel: [929808.525991] Out of memory: Kill process 4846 (corosync) score 941 or sacrifice child Jun 13 20:36:35 XXXXXXXXX1 kernel: [929808.620411] Killed process 4846 (corosync) total-vm:267928256kB, anon-rss:257475632kB, file-rss:37816kB Jun 29 02:26:17 XXXXXXXXX1 kernel: [2247790.069557] Out of memory: Kill process 27791 (corosync) score 938 or sacrifice child Jun 29 02:26:17 XXXXXXXXX1 kernel: [2247790.166524] Killed process 27791 (corosync) total-vm:265216168kB, anon-rss:255941644kB, file-rss:28580kB Jun 14 14:00:03 XXXXXXXXX2 kernel: [993027.615377] Out of memory: Kill process 5167 (corosync) score 943 or sacrifice child Jun 14 14:00:03 XXXXXXXXX2 kernel: [993027.709419] Killed process 5167 (corosync) total-vm:265023016kB, anon-rss:256668244kB, file-rss:33844kB Jun 28 22:56:30 XXXXXXXXX2 kernel: [2235753.617203] Out of memory: Kill process 27073 (corosync) score 941 or sacrifice child Jun 28 22:56:30 XXXXXXXXX2 kernel: [2235753.713521] Killed process 27073 (corosync) total-vm:261875792kB, anon-rss:255939160kB, file-rss:24760kB Mar 21 22:19:17 XXXXXXXXX2 kernel: [956727.096937] Out of memory: Kill process 5422 (corosync) score 942 or sacrifice child Mar 21 22:19:17 XXXXXXXXX2 kernel: [956727.191025] Killed process 5422 (corosync) total-vm:264643868kB, anon-rss:256189360kB, file-rss:33976kB Apr 26 00:30:04 XXXXXXXXX2 kernel: [1017203.359940] Out of memory: Kill process 5183 (corosync) score 927 or sacrifice child Apr 26 00:30:04 XXXXXXXXX2 kernel: [1017203.455015] Killed process 5183 (corosync) total-vm:271136904kB, anon-rss:251953372kB, file-rss:33760kB Jun 29 09:00:02 XXXXXXXXX3 kernel: [2276334.347836] Out of memory: Kill process 24183 (corosync) score 937 or sacrifice child Jun 29 09:00:02 XXXXXXXXX3 kernel: [2276334.444000] Killed process 24183 (corosync) total-vm:270476488kB, anon-rss:255257476kB, file-rss:32248kB Mar 22 04:58:18 XXXXXXXXX3 kernel: [979377.041372] Out of memory: Kill process 5088 (corosync) score 941 or sacrifice child Mar 22 04:58:18 XXXXXXXXX3 kernel: [979377.135414] Killed process 5088 (corosync) total-vm:265582012kB, anon-rss:255851792kB, file-rss:36000kB Apr 26 09:26:02 XXXXXXXXX3 kernel: [1014911.175029] Out of memory: Kill process 5255 (corosync) score 925 or sacrifice child Apr 26 09:26:02 XXXXXXXXX3 kernel: [1014911.270203] Killed process 5255 (corosync) total-vm:269154272kB, anon-rss:251736288kB, file-rss:35740kB Jun 13 22:46:23 XXXXXXXXX3 kernel: [942502.987771] Out of memory: Kill process 5230 (corosync) score 940 or sacrifice child Jun 13 22:46:23 XXXXXXXXX3 kernel: [942503.081826] Killed process 5230 (corosync) total-vm:265560916kB, anon-rss:256339740kB, file-rss:33788kB The memory leak was confirmed through an analysis of atop logs where it was observed that memory utilization by corosync would go from 47% to 97% over the course of several days before corosync was then killed. The are many memory leaks identified for the current version of corosync in MOS6.1 # dpkg -l | grep corosync ii corosync 2.3.4-0u~u14.04+mos1 amd64 Standards-based cluster framework (daemon and modules) ii libcorosync-common4 2.3.4-0u~u14.04+mos1 amd64 Standards-based cluster framework, common library Steps to reproduce: Unsure how to reproduce at this point, as logging is not detailed enough. Will enable debug when possible. Expected results: Impact: corosync has crashed relatively frequently on all three nodes, however unsure if this has occurred in other zones. Environment description: - Operation system: Ubuntu 14.04.2 LTS - 3.13.0-61-generic - Versions of components: # dpkg -l | egrep 'corosync|pacemaker' ii corosync 2.3.4-0u~u14.04+mos1 amd64 Standards-based cluster framework (daemon and modules) ii crmsh 2.1.0-1~u14.04+mos1 all CRM shell for the pacemaker cluster manager ii libcorosync-common4 2.3.4-0u~u14.04+mos1 amd64 Standards-based cluster framework, common library ii pacemaker 1.1.12-0u~u14.04+mos6.1 amd64 HA cluster resource manager ii pacemaker-cli-utils 1.1.12-0u~u14.04+mos6.1 amd64 Command line interface utilities for Pacemaker # uname -r 3.13.0-61-generic - Reference architecture: MOS6.1 - unable to provide more information due to restrictions, but at scale - Network model: Neutron+GRE+vlan - Related projects installed: N/A
2016-07-01 16:16:23 Denis Meltsaykin tags customer-found
2016-07-01 16:20:17 Denis Meltsaykin nominated for series mos/9.x
2016-07-01 16:20:17 Denis Meltsaykin bug task added mos/9.x
2016-07-01 16:20:17 Denis Meltsaykin nominated for series mos/7.0.x
2016-07-01 16:20:17 Denis Meltsaykin bug task added mos/7.0.x
2016-07-01 16:20:17 Denis Meltsaykin nominated for series mos/6.1.x
2016-07-01 16:20:17 Denis Meltsaykin bug task added mos/6.1.x
2016-07-01 16:20:17 Denis Meltsaykin nominated for series mos/8.0.x
2016-07-01 16:20:17 Denis Meltsaykin bug task added mos/8.0.x
2016-07-04 12:18:21 Dina Belova mos/6.1.x: assignee MOS Linux (mos-linux)
2016-07-04 12:18:29 Dina Belova mos/7.0.x: assignee MOS Linux (mos-linux)
2016-07-04 12:18:36 Dina Belova mos/8.0.x: assignee MOS Linux (mos-linux)
2016-07-04 12:18:40 Dina Belova mos/9.x: assignee MOS Linux (mos-linux)
2016-07-04 12:19:05 Dina Belova mos/6.1.x: milestone 6.1-updates
2016-07-04 12:19:07 Dina Belova mos/7.0.x: milestone 7.0-updates
2016-07-04 12:19:10 Dina Belova mos/8.0.x: milestone 8.0-updates
2016-07-04 12:19:14 Dina Belova mos/9.x: milestone 9.1
2016-07-04 12:19:29 Dina Belova mos/6.1.x: importance Undecided High
2016-07-04 12:19:30 Dina Belova mos/7.0.x: importance Undecided High
2016-07-04 12:19:32 Dina Belova mos/8.0.x: importance Undecided High
2016-07-04 12:19:33 Dina Belova mos/9.x: importance Undecided High
2016-07-04 12:19:46 Dina Belova mos/6.1.x: status New Confirmed
2016-07-04 12:19:50 Dina Belova mos/8.0.x: status New Confirmed
2016-07-04 12:19:52 Dina Belova mos/9.x: status New Confirmed
2016-07-04 12:19:54 Dina Belova mos/7.0.x: status New Confirmed
2016-08-25 17:21:51 Dmitry Teselkin mos/9.x: status Confirmed Fix Committed
2016-08-25 17:22:25 Dmitry Teselkin mos/8.0.x: assignee MOS Linux (mos-linux) MOS Maintenance (mos-maintenance)
2016-08-25 17:27:43 Dmitry Teselkin mos/7.0.x: assignee MOS Linux (mos-linux) MOS Maintenance (mos-maintenance)
2016-08-25 17:27:51 Dmitry Teselkin mos/6.1.x: assignee MOS Linux (mos-linux) MOS Maintenance (mos-maintenance)
2016-08-30 15:01:46 Sergii Rizvan mos/8.0.x: assignee MOS Maintenance (mos-maintenance) Sergii Rizvan (srizvan)
2016-08-30 15:01:59 Sergii Rizvan mos/7.0.x: assignee MOS Maintenance (mos-maintenance) Sergii Rizvan (srizvan)
2016-09-02 08:36:10 Sergii Turivnyi tags customer-found customer-found on-verification
2016-09-05 12:19:51 Timur Nurlygayanov mos/9.x: status Fix Committed Fix Released
2016-09-09 10:16:39 Sergii Rizvan mos/7.0.x: status Confirmed In Progress
2016-09-09 10:16:41 Sergii Rizvan mos/8.0.x: status Confirmed In Progress
2016-09-12 10:24:17 Sergii Rizvan mos/7.0.x: milestone 7.0-updates 7.0-mu-6
2016-09-12 10:24:22 Sergii Rizvan mos/8.0.x: milestone 8.0-updates 8.0-mu-4
2016-09-12 10:33:04 Sergii Rizvan mos/8.0.x: status In Progress Fix Committed
2016-09-12 10:33:07 Sergii Rizvan mos/7.0.x: status In Progress Fix Committed
2016-09-15 16:25:43 Sergii Rizvan mos/7.0.x: status Fix Committed In Progress
2016-09-15 16:25:45 Sergii Rizvan mos/8.0.x: status Fix Committed In Progress
2016-10-18 06:38:05 Denis Meltsaykin mos/7.0.x: status In Progress Fix Committed
2016-10-18 11:08:05 Sergii Rizvan mos/8.0.x: status In Progress Fix Committed
2016-10-18 12:10:47 Sergii Rizvan mos/6.1.x: status Confirmed Won't Fix
2016-10-20 17:43:58 TatyanaGladysheva mos/7.0.x: status Fix Committed Fix Released
2017-02-14 07:57:10 TatyanaGladysheva mos/8.0.x: status Fix Committed Fix Released