Activity log for bug #1920944

Date Who What changed Old value New value Message
2021-03-23 15:31:19 Andrea Righi bug added bug
2021-03-23 15:31:38 Andrea Righi nominated for series Ubuntu Focal
2021-03-23 15:31:38 Andrea Righi bug task added linux-aws (Ubuntu Focal)
2021-03-23 16:14:14 Andrea Righi description [Impact] In LP: #1918694 we applied a fix and a workaround to solve the hibernation issues on c5.18xlarge. The workaround was in the form of a SAUCE patch: "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" It looks like we can replace this workaround with a proper fix, by applying this patch: https://lore.kernel.org/kvm/87sg4t7vqy.fsf@vitty.brq.redhat.com/T/#m7533e1d1e551bff425da029fd401bd87935edc33 [Test plan] Create a c5.18xlarge instance, run the memory stress test script (the same test script that we are using to stress test hibernation), trigger the hibernate event, trigger the resume event. Repeat a couple of times and the problem is very likely to happen. [Fix] https://lore.kernel.org/kvm/87sg4t7vqy.fsf@vitty.brq.redhat.com/T/#m7533e1d1e551bff425da029fd401bd87935edc33 The fix has been tested extensively in the AWS infrastructure with positive results. [Regression potential] This new code introduced by the fix can be executed also when a CPU is put offline, so we may see potential regressions in the KVM CPU hotplugging. [Impact] In LP: #1918694 we applied a fix and a workaround to solve the hibernation issues on c5.18xlarge. The workaround was in the form of a SAUCE patch:   "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" It looks like we can replace this workaround with a proper fix, by applying this patch: https://lore.kernel.org/kvm/87sg4t7vqy.fsf@vitty.brq.redhat.com/T/#m7533e1d1e551bff425da029fd401bd87935edc33 [Test plan] Create a c5.18xlarge instance, run the memory stress test script (the same test script that we are using to stress test hibernation), trigger the hibernate event, trigger the resume event. Repeat a couple of times and the problem is very likely to happen. [Fix] Replace "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" with: https://lore.kernel.org/kvm/87sg4t7vqy.fsf@vitty.brq.redhat.com/T/#m7533e1d1e551bff425da029fd401bd87935edc33 The fix has been tested extensively in the AWS infrastructure with positive results. [Regression potential] This new code introduced by the fix can be executed also when a CPU is put offline, so we may see potential regressions in the KVM CPU hotplugging.
2021-05-18 15:17:06 Andrea Righi description [Impact] In LP: #1918694 we applied a fix and a workaround to solve the hibernation issues on c5.18xlarge. The workaround was in the form of a SAUCE patch:   "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" It looks like we can replace this workaround with a proper fix, by applying this patch: https://lore.kernel.org/kvm/87sg4t7vqy.fsf@vitty.brq.redhat.com/T/#m7533e1d1e551bff425da029fd401bd87935edc33 [Test plan] Create a c5.18xlarge instance, run the memory stress test script (the same test script that we are using to stress test hibernation), trigger the hibernate event, trigger the resume event. Repeat a couple of times and the problem is very likely to happen. [Fix] Replace "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" with: https://lore.kernel.org/kvm/87sg4t7vqy.fsf@vitty.brq.redhat.com/T/#m7533e1d1e551bff425da029fd401bd87935edc33 The fix has been tested extensively in the AWS infrastructure with positive results. [Regression potential] This new code introduced by the fix can be executed also when a CPU is put offline, so we may see potential regressions in the KVM CPU hotplugging. [Impact] In LP: #1918694 we applied a fix and a workaround to solve the hibernation issues on c5.18xlarge. The workaround was in the form of a SAUCE patch:   "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" It looks like we can replace this workaround with a proper fix, by applying this patch: http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/ [Test plan] Create a c5.18xlarge instance, run the memory stress test script (the same test script that we are using to stress test hibernation), trigger the hibernate event, trigger the resume event. Repeat a couple of times and the problem is very likely to happen. [Fix] Replace "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" with: http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/ The fix has been tested extensively in the AWS infrastructure with positive results. [Regression potential] This new code introduced by the fix can be executed also when a CPU is put offline, so we may see potential regressions in the KVM CPU hotplugging.
2021-05-20 13:23:11 Andrea Righi nominated for series Ubuntu Groovy
2021-05-20 13:23:11 Andrea Righi bug task added linux-aws (Ubuntu Groovy)
2021-05-20 13:23:11 Andrea Righi nominated for series Ubuntu Hirsute
2021-05-20 13:23:11 Andrea Righi bug task added linux-aws (Ubuntu Hirsute)
2021-05-20 13:23:27 Andrea Righi bug task added linux (Ubuntu)
2021-05-20 13:28:12 Andrea Righi description [Impact] In LP: #1918694 we applied a fix and a workaround to solve the hibernation issues on c5.18xlarge. The workaround was in the form of a SAUCE patch:   "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" It looks like we can replace this workaround with a proper fix, by applying this patch: http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/ [Test plan] Create a c5.18xlarge instance, run the memory stress test script (the same test script that we are using to stress test hibernation), trigger the hibernate event, trigger the resume event. Repeat a couple of times and the problem is very likely to happen. [Fix] Replace "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" with: http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/ The fix has been tested extensively in the AWS infrastructure with positive results. [Regression potential] This new code introduced by the fix can be executed also when a CPU is put offline, so we may see potential regressions in the KVM CPU hotplugging. [Impact] In LP: #1918694 we applied a fix and a workaround to solve the hibernation issues on c5.18xlarge. The workaround was in the form of a SAUCE patch:   "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" It looks like we can replace this workaround with a proper fix, by applying this patch: http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/ This is required because various PV features (Async PF, PV EOI, steal time) work through memory shared with hypervisor and when we restore from hibernation we must properly tear down all these features to make sure hypervisor doesn't write to stale locations after we jump to the previously hibernated kernel. For this reason it is safe to apply this patch set also to the other generic kernels and not just AWS. [Test plan] This can be easily tested on AWS (but it should be reproduced by hibernating any kvm instance with multiple CPUs). Create a c5.18xlarge instance, run the memory stress test script (the same test script that we are using to stress test hibernation), trigger the hibernate event, trigger the resume event. Repeat a couple of times and the problem is very likely to happen. [Fix] On the AWS kernel replace "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" with: http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/ For the other kernels, simply apply this patch set. The fix has been tested extensively in the AWS infrastructure with positive results. [Regression potential] This new code introduced by the fix can be executed also when a CPU is put offline, so we may see potential regressions in the KVM CPU hotplugging.
2021-05-20 13:28:19 Andrea Righi summary aws: proper fix for c5.18xlarge hibernation issues properly tear down KVM PV features on hibernate
2021-05-20 13:30:09 Ubuntu Kernel Bot linux (Ubuntu): status New Incomplete
2021-05-20 13:30:14 Ubuntu Kernel Bot linux (Ubuntu Focal): status New Incomplete
2021-05-20 13:30:17 Ubuntu Kernel Bot linux (Ubuntu Groovy): status New Incomplete
2021-05-20 13:30:19 Ubuntu Kernel Bot linux (Ubuntu Hirsute): status New Incomplete
2021-05-20 13:33:42 Andrea Righi summary properly tear down KVM PV features on hibernate kvm: properly tear down PV features on hibernate
2021-05-20 13:36:05 Andrea Righi description [Impact] In LP: #1918694 we applied a fix and a workaround to solve the hibernation issues on c5.18xlarge. The workaround was in the form of a SAUCE patch:   "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" It looks like we can replace this workaround with a proper fix, by applying this patch: http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/ This is required because various PV features (Async PF, PV EOI, steal time) work through memory shared with hypervisor and when we restore from hibernation we must properly tear down all these features to make sure hypervisor doesn't write to stale locations after we jump to the previously hibernated kernel. For this reason it is safe to apply this patch set also to the other generic kernels and not just AWS. [Test plan] This can be easily tested on AWS (but it should be reproduced by hibernating any kvm instance with multiple CPUs). Create a c5.18xlarge instance, run the memory stress test script (the same test script that we are using to stress test hibernation), trigger the hibernate event, trigger the resume event. Repeat a couple of times and the problem is very likely to happen. [Fix] On the AWS kernel replace "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" with: http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/ For the other kernels, simply apply this patch set. The fix has been tested extensively in the AWS infrastructure with positive results. [Regression potential] This new code introduced by the fix can be executed also when a CPU is put offline, so we may see potential regressions in the KVM CPU hotplugging. [Impact] In LP: #1918694 we applied a fix and a workaround to solve the hibernation issues on c5.18xlarge. The workaround was in the form of a SAUCE patch:   "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" It looks like we can replace this workaround with a proper fix, by applying this patch: http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/ This is required because various PV features (Async PF, PV EOI, steal time) work through memory shared with hypervisor and when we restore from hibernation we must properly tear down all these features to make sure hypervisor doesn't write to stale locations after we jump to the previously hibernated kernel. For this reason it is safe to apply this patch set also to all the generic kernels and not just AWS. [Test plan] This can be easily tested on AWS (but it should be reproduced by hibernating any kvm instance with multiple CPUs). Create a c5.18xlarge instance, run the memory stress test script (the same test script that we are using to stress test hibernation), trigger the hibernate event, trigger the resume event. Repeat a couple of times and the problem is very likely to happen. [Fix] On the AWS kernel replace "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" with: http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/ For the other kernels, simply apply this patch set. The fix has been tested extensively in the AWS infrastructure with positive results. [Regression potential] This new code introduced by the fix can be executed also when a CPU is put offline, so we may see potential regressions in the KVM CPU hot-plugging.
2021-05-20 16:04:13 Kleber Sacilotto de Souza linux (Ubuntu Hirsute): status Incomplete In Progress
2021-05-20 16:04:15 Kleber Sacilotto de Souza linux (Ubuntu Groovy): status Incomplete In Progress
2021-05-20 16:04:19 Kleber Sacilotto de Souza linux (Ubuntu Focal): status Incomplete In Progress
2021-05-26 17:32:29 Kleber Sacilotto de Souza linux (Ubuntu Focal): status In Progress Fix Committed
2021-05-26 17:33:58 Kleber Sacilotto de Souza linux (Ubuntu Groovy): status In Progress Fix Committed
2021-05-26 17:35:31 Kleber Sacilotto de Souza linux (Ubuntu Hirsute): status In Progress Fix Committed
2021-05-28 23:47:55 Kelsey Steele linux-aws (Ubuntu Focal): status New Fix Committed
2021-06-02 19:58:39 Ubuntu Kernel Bot tags verification-needed-hirsute
2021-06-03 03:30:44 Ubuntu Kernel Bot tags verification-needed-hirsute verification-needed-focal verification-needed-hirsute
2021-06-05 17:22:56 Ubuntu Kernel Bot tags verification-needed-focal verification-needed-hirsute verification-needed-focal verification-needed-groovy verification-needed-hirsute
2021-06-21 23:22:10 Launchpad Janitor linux (Ubuntu): status Incomplete Fix Released
2021-06-21 23:22:10 Launchpad Janitor cve linked 2020-24586
2021-06-21 23:22:10 Launchpad Janitor cve linked 2020-24587
2021-06-21 23:22:10 Launchpad Janitor cve linked 2020-24588
2021-06-21 23:22:10 Launchpad Janitor cve linked 2020-26139
2021-06-21 23:22:10 Launchpad Janitor cve linked 2020-26141
2021-06-21 23:22:10 Launchpad Janitor cve linked 2020-26145
2021-06-21 23:22:10 Launchpad Janitor cve linked 2020-26147
2021-06-21 23:22:10 Launchpad Janitor cve linked 2021-20288
2021-06-21 23:22:10 Launchpad Janitor cve linked 2021-33200
2021-06-21 23:22:10 Launchpad Janitor cve linked 2021-3489
2021-06-21 23:22:10 Launchpad Janitor cve linked 2021-3490
2021-06-22 16:01:10 Launchpad Janitor linux (Ubuntu Hirsute): status Fix Committed Fix Released
2021-06-22 16:01:38 Launchpad Janitor linux-aws (Ubuntu Hirsute): status New Fix Released
2021-06-22 16:03:26 Launchpad Janitor linux (Ubuntu Groovy): status Fix Committed Fix Released
2021-06-22 16:03:26 Launchpad Janitor cve linked 2021-23133
2021-06-22 16:03:26 Launchpad Janitor cve linked 2021-31440
2021-06-22 16:03:47 Launchpad Janitor linux-aws (Ubuntu Groovy): status New Fix Released
2021-06-22 16:05:32 Launchpad Janitor linux (Ubuntu Focal): status Fix Committed Fix Released
2021-06-22 16:05:56 Launchpad Janitor linux-aws (Ubuntu Focal): status Fix Committed Fix Released
2021-06-24 18:14:22 Launchpad Janitor linux-aws (Ubuntu): status New Fix Released