2021-03-23 15:31:19 |
Andrea Righi |
bug |
|
|
added bug |
2021-03-23 15:31:38 |
Andrea Righi |
nominated for series |
|
Ubuntu Focal |
|
2021-03-23 15:31:38 |
Andrea Righi |
bug task added |
|
linux-aws (Ubuntu Focal) |
|
2021-03-23 16:14:14 |
Andrea Righi |
description |
[Impact]
In LP: #1918694 we applied a fix and a workaround to solve the hibernation issues on c5.18xlarge. The workaround was in the form of a SAUCE patch:
"UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot"
It looks like we can replace this workaround with a proper fix, by applying this patch:
https://lore.kernel.org/kvm/87sg4t7vqy.fsf@vitty.brq.redhat.com/T/#m7533e1d1e551bff425da029fd401bd87935edc33
[Test plan]
Create a c5.18xlarge instance, run the memory stress test script (the same test script that we are using to stress test hibernation), trigger the hibernate event, trigger the resume event. Repeat a couple of times and the problem is very likely to happen.
[Fix]
https://lore.kernel.org/kvm/87sg4t7vqy.fsf@vitty.brq.redhat.com/T/#m7533e1d1e551bff425da029fd401bd87935edc33
The fix has been tested extensively in the AWS infrastructure with positive results.
[Regression potential]
This new code introduced by the fix can be executed also when a CPU is put offline, so we may see potential regressions in the KVM CPU hotplugging. |
[Impact]
In LP: #1918694 we applied a fix and a workaround to solve the hibernation issues on c5.18xlarge. The workaround was in the form of a SAUCE patch:
"UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot"
It looks like we can replace this workaround with a proper fix, by applying this patch:
https://lore.kernel.org/kvm/87sg4t7vqy.fsf@vitty.brq.redhat.com/T/#m7533e1d1e551bff425da029fd401bd87935edc33
[Test plan]
Create a c5.18xlarge instance, run the memory stress test script (the same test script that we are using to stress test hibernation), trigger the hibernate event, trigger the resume event. Repeat a couple of times and the problem is very likely to happen.
[Fix]
Replace "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" with:
https://lore.kernel.org/kvm/87sg4t7vqy.fsf@vitty.brq.redhat.com/T/#m7533e1d1e551bff425da029fd401bd87935edc33
The fix has been tested extensively in the AWS infrastructure with positive results.
[Regression potential]
This new code introduced by the fix can be executed also when a CPU is put offline, so we may see potential regressions in the KVM CPU hotplugging. |
|
2021-05-18 15:17:06 |
Andrea Righi |
description |
[Impact]
In LP: #1918694 we applied a fix and a workaround to solve the hibernation issues on c5.18xlarge. The workaround was in the form of a SAUCE patch:
"UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot"
It looks like we can replace this workaround with a proper fix, by applying this patch:
https://lore.kernel.org/kvm/87sg4t7vqy.fsf@vitty.brq.redhat.com/T/#m7533e1d1e551bff425da029fd401bd87935edc33
[Test plan]
Create a c5.18xlarge instance, run the memory stress test script (the same test script that we are using to stress test hibernation), trigger the hibernate event, trigger the resume event. Repeat a couple of times and the problem is very likely to happen.
[Fix]
Replace "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" with:
https://lore.kernel.org/kvm/87sg4t7vqy.fsf@vitty.brq.redhat.com/T/#m7533e1d1e551bff425da029fd401bd87935edc33
The fix has been tested extensively in the AWS infrastructure with positive results.
[Regression potential]
This new code introduced by the fix can be executed also when a CPU is put offline, so we may see potential regressions in the KVM CPU hotplugging. |
[Impact]
In LP: #1918694 we applied a fix and a workaround to solve the hibernation issues on c5.18xlarge. The workaround was in the form of a SAUCE patch:
"UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot"
It looks like we can replace this workaround with a proper fix, by applying this patch:
http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/
[Test plan]
Create a c5.18xlarge instance, run the memory stress test script (the same test script that we are using to stress test hibernation), trigger the hibernate event, trigger the resume event. Repeat a couple of times and the problem is very likely to happen.
[Fix]
Replace "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" with:
http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/
The fix has been tested extensively in the AWS infrastructure with positive results.
[Regression potential]
This new code introduced by the fix can be executed also when a CPU is put offline, so we may see potential regressions in the KVM CPU hotplugging. |
|
2021-05-20 13:23:11 |
Andrea Righi |
nominated for series |
|
Ubuntu Groovy |
|
2021-05-20 13:23:11 |
Andrea Righi |
bug task added |
|
linux-aws (Ubuntu Groovy) |
|
2021-05-20 13:23:11 |
Andrea Righi |
nominated for series |
|
Ubuntu Hirsute |
|
2021-05-20 13:23:11 |
Andrea Righi |
bug task added |
|
linux-aws (Ubuntu Hirsute) |
|
2021-05-20 13:23:27 |
Andrea Righi |
bug task added |
|
linux (Ubuntu) |
|
2021-05-20 13:28:12 |
Andrea Righi |
description |
[Impact]
In LP: #1918694 we applied a fix and a workaround to solve the hibernation issues on c5.18xlarge. The workaround was in the form of a SAUCE patch:
"UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot"
It looks like we can replace this workaround with a proper fix, by applying this patch:
http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/
[Test plan]
Create a c5.18xlarge instance, run the memory stress test script (the same test script that we are using to stress test hibernation), trigger the hibernate event, trigger the resume event. Repeat a couple of times and the problem is very likely to happen.
[Fix]
Replace "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" with:
http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/
The fix has been tested extensively in the AWS infrastructure with positive results.
[Regression potential]
This new code introduced by the fix can be executed also when a CPU is put offline, so we may see potential regressions in the KVM CPU hotplugging. |
[Impact]
In LP: #1918694 we applied a fix and a workaround to solve the hibernation issues on c5.18xlarge. The workaround was in the form of a SAUCE patch:
"UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot"
It looks like we can replace this workaround with a proper fix, by applying this patch:
http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/
This is required because various PV features (Async PF, PV EOI, steal time) work through memory shared with hypervisor and when we restore from hibernation we must properly tear down all these features to make sure hypervisor doesn't write to stale locations after we jump to the previously hibernated kernel.
For this reason it is safe to apply this patch set also to the other generic kernels and not just AWS.
[Test plan]
This can be easily tested on AWS (but it should be reproduced by hibernating any kvm instance with multiple CPUs). Create a c5.18xlarge instance, run the memory stress test script (the same test script that we are using to stress test hibernation), trigger the hibernate event, trigger the resume event. Repeat a couple of times and the problem is very likely to happen.
[Fix]
On the AWS kernel replace "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" with:
http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/
For the other kernels, simply apply this patch set.
The fix has been tested extensively in the AWS infrastructure with positive results.
[Regression potential]
This new code introduced by the fix can be executed also when a CPU is put offline, so we may see potential regressions in the KVM CPU hotplugging. |
|
2021-05-20 13:28:19 |
Andrea Righi |
summary |
aws: proper fix for c5.18xlarge hibernation issues |
properly tear down KVM PV features on hibernate |
|
2021-05-20 13:30:09 |
Ubuntu Kernel Bot |
linux (Ubuntu): status |
New |
Incomplete |
|
2021-05-20 13:30:14 |
Ubuntu Kernel Bot |
linux (Ubuntu Focal): status |
New |
Incomplete |
|
2021-05-20 13:30:17 |
Ubuntu Kernel Bot |
linux (Ubuntu Groovy): status |
New |
Incomplete |
|
2021-05-20 13:30:19 |
Ubuntu Kernel Bot |
linux (Ubuntu Hirsute): status |
New |
Incomplete |
|
2021-05-20 13:33:42 |
Andrea Righi |
summary |
properly tear down KVM PV features on hibernate |
kvm: properly tear down PV features on hibernate |
|
2021-05-20 13:36:05 |
Andrea Righi |
description |
[Impact]
In LP: #1918694 we applied a fix and a workaround to solve the hibernation issues on c5.18xlarge. The workaround was in the form of a SAUCE patch:
"UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot"
It looks like we can replace this workaround with a proper fix, by applying this patch:
http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/
This is required because various PV features (Async PF, PV EOI, steal time) work through memory shared with hypervisor and when we restore from hibernation we must properly tear down all these features to make sure hypervisor doesn't write to stale locations after we jump to the previously hibernated kernel.
For this reason it is safe to apply this patch set also to the other generic kernels and not just AWS.
[Test plan]
This can be easily tested on AWS (but it should be reproduced by hibernating any kvm instance with multiple CPUs). Create a c5.18xlarge instance, run the memory stress test script (the same test script that we are using to stress test hibernation), trigger the hibernate event, trigger the resume event. Repeat a couple of times and the problem is very likely to happen.
[Fix]
On the AWS kernel replace "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" with:
http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/
For the other kernels, simply apply this patch set.
The fix has been tested extensively in the AWS infrastructure with positive results.
[Regression potential]
This new code introduced by the fix can be executed also when a CPU is put offline, so we may see potential regressions in the KVM CPU hotplugging. |
[Impact]
In LP: #1918694 we applied a fix and a workaround to solve the hibernation issues on c5.18xlarge. The workaround was in the form of a SAUCE patch:
"UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot"
It looks like we can replace this workaround with a proper fix, by applying this patch:
http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/
This is required because various PV features (Async PF, PV EOI, steal time) work through memory shared with hypervisor and when we restore from hibernation we must properly tear down all these features to make sure hypervisor doesn't write to stale locations after we jump to the previously hibernated kernel.
For this reason it is safe to apply this patch set also to all the generic kernels and not just AWS.
[Test plan]
This can be easily tested on AWS (but it should be reproduced by hibernating any kvm instance with multiple CPUs). Create a c5.18xlarge instance, run the memory stress test script (the same test script that we are using to stress test hibernation), trigger the hibernate event, trigger the resume event. Repeat a couple of times and the problem is very likely to happen.
[Fix]
On the AWS kernel replace "UBUNTU: SAUCE: aws: kvm: double the size of hv_clock_boot" with:
http://next.patchew.org/Linux/20210414123544.1060604-1-vkuznets@redhat.com/
For the other kernels, simply apply this patch set.
The fix has been tested extensively in the AWS infrastructure with positive results.
[Regression potential]
This new code introduced by the fix can be executed also when a CPU is put offline, so we may see potential regressions in the KVM CPU hot-plugging. |
|
2021-05-20 16:04:13 |
Kleber Sacilotto de Souza |
linux (Ubuntu Hirsute): status |
Incomplete |
In Progress |
|
2021-05-20 16:04:15 |
Kleber Sacilotto de Souza |
linux (Ubuntu Groovy): status |
Incomplete |
In Progress |
|
2021-05-20 16:04:19 |
Kleber Sacilotto de Souza |
linux (Ubuntu Focal): status |
Incomplete |
In Progress |
|
2021-05-26 17:32:29 |
Kleber Sacilotto de Souza |
linux (Ubuntu Focal): status |
In Progress |
Fix Committed |
|
2021-05-26 17:33:58 |
Kleber Sacilotto de Souza |
linux (Ubuntu Groovy): status |
In Progress |
Fix Committed |
|
2021-05-26 17:35:31 |
Kleber Sacilotto de Souza |
linux (Ubuntu Hirsute): status |
In Progress |
Fix Committed |
|
2021-05-28 23:47:55 |
Kelsey Steele |
linux-aws (Ubuntu Focal): status |
New |
Fix Committed |
|
2021-06-02 19:58:39 |
Ubuntu Kernel Bot |
tags |
|
verification-needed-hirsute |
|
2021-06-03 03:30:44 |
Ubuntu Kernel Bot |
tags |
verification-needed-hirsute |
verification-needed-focal verification-needed-hirsute |
|
2021-06-05 17:22:56 |
Ubuntu Kernel Bot |
tags |
verification-needed-focal verification-needed-hirsute |
verification-needed-focal verification-needed-groovy verification-needed-hirsute |
|
2021-06-21 23:22:10 |
Launchpad Janitor |
linux (Ubuntu): status |
Incomplete |
Fix Released |
|
2021-06-21 23:22:10 |
Launchpad Janitor |
cve linked |
|
2020-24586 |
|
2021-06-21 23:22:10 |
Launchpad Janitor |
cve linked |
|
2020-24587 |
|
2021-06-21 23:22:10 |
Launchpad Janitor |
cve linked |
|
2020-24588 |
|
2021-06-21 23:22:10 |
Launchpad Janitor |
cve linked |
|
2020-26139 |
|
2021-06-21 23:22:10 |
Launchpad Janitor |
cve linked |
|
2020-26141 |
|
2021-06-21 23:22:10 |
Launchpad Janitor |
cve linked |
|
2020-26145 |
|
2021-06-21 23:22:10 |
Launchpad Janitor |
cve linked |
|
2020-26147 |
|
2021-06-21 23:22:10 |
Launchpad Janitor |
cve linked |
|
2021-20288 |
|
2021-06-21 23:22:10 |
Launchpad Janitor |
cve linked |
|
2021-33200 |
|
2021-06-21 23:22:10 |
Launchpad Janitor |
cve linked |
|
2021-3489 |
|
2021-06-21 23:22:10 |
Launchpad Janitor |
cve linked |
|
2021-3490 |
|
2021-06-22 16:01:10 |
Launchpad Janitor |
linux (Ubuntu Hirsute): status |
Fix Committed |
Fix Released |
|
2021-06-22 16:01:38 |
Launchpad Janitor |
linux-aws (Ubuntu Hirsute): status |
New |
Fix Released |
|
2021-06-22 16:03:26 |
Launchpad Janitor |
linux (Ubuntu Groovy): status |
Fix Committed |
Fix Released |
|
2021-06-22 16:03:26 |
Launchpad Janitor |
cve linked |
|
2021-23133 |
|
2021-06-22 16:03:26 |
Launchpad Janitor |
cve linked |
|
2021-31440 |
|
2021-06-22 16:03:47 |
Launchpad Janitor |
linux-aws (Ubuntu Groovy): status |
New |
Fix Released |
|
2021-06-22 16:05:32 |
Launchpad Janitor |
linux (Ubuntu Focal): status |
Fix Committed |
Fix Released |
|
2021-06-22 16:05:56 |
Launchpad Janitor |
linux-aws (Ubuntu Focal): status |
Fix Committed |
Fix Released |
|
2021-06-24 18:14:22 |
Launchpad Janitor |
linux-aws (Ubuntu): status |
New |
Fix Released |
|