Random crashes related to CPU virtualization - have a sure way to replicate the crashes

Bug #1982965 reported by Ip Soon
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Confirmed
Undecided
Unassigned

Bug Description

Problem description: Linux suffers random crashes especially during video intensive operations.

Hardware configurations:

CPU: AMD Ryzen 7 5700G with Radeon Graphics
Motherboard: Gigabyte B550M D3H
BIOS version: F14 then F15c, both exhibited the same behaviors. F15c seems to be a little more resilient to the crashes, meaning the execution of glmark2 lasts a little longer before crashes.

OS: Linux Mint 20.3 / with a number of Ubuntu kernels from 5.11, 5.13, and 5.15. Due to the CPU and motherboard, have to use non-packaged kernels for both graphics and network interface to work.

Key findings:

Once the CPU virtualization is turned-on, system starts to crash randomly. For Gigabyte motherboard, Hardware virtualization is called SVM mode. All crashes occurred under CPU SVM mode is enabled.

Crashes means, sudden lost of desktop screen during operations and received only a black screen. No keyboard or mouse response. Had to power cycle the computer to resume operations. Tried to store the syslog for each boot ended in crash.

At beginning, I tried to sync the file system at rebooting the computer, but the computer boots up without syncing the file system as well.

During at least three months of this kind of bad experiences, tried to upgrade kernel from 5.11 to 5.15, none of them helped.

Methods of replicating the crashes:

1. Turn on the SVM mode in BIOS.
2. Execute glmark2 to benchmark the graphic card performance.
3. System crashes before the benchmark test is done every time.

To me the syslogs doesn't show any meaningful info for the crashes. The following link gives some of the syslog gathered in the painful experiences especially those crashes shortly after reboot.

https://drive.google.com/drive/folders/1cYbQPdfPSC_6A879xBWRs1jIY7R5ylMU?usp=sharing

The most important Finding is once the SVM mode on the motherboard is turned off (disabled), I no longer experience the same kind of random crashes, and glmark2 benchmark can finish without any issues. All versions of kernel 5.11 to 5.15 work stably as expected.

The real question to me is how the CPU virtualization is supported in the kernel and why turning on the CPU virtualization can promote the crashes. Perhaps the BIOS might be involved for a satisfactory solution to this observed problem, however, what I can offer is just a sure way to replicate the crashes for others to investigate this issue further.

That's all I can report here. If you need any further information. Please feel free to let me know.

Thanks for your attentions.

Ip Soon (ipsmile)
summary: - Random crashes without any meaningful error logs related to CPU
- virtualization but a way to replicate the crashes
+ Random crashes related to CPU virtualization - have a sure way to
+ replicate the crashes
description: updated
Revision history for this message
Ubuntu Kernel Bot (ubuntu-kernel-bot) wrote : Missing required logs.

This bug is missing log files that will aid in diagnosing the problem. While running an Ubuntu kernel (not a mainline or third-party kernel) please enter the following command in a terminal window:

apport-collect 1982965

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: New → Incomplete
Ip Soon (ipsmile)
description: updated
Revision history for this message
Ip Soon (ipsmile) wrote : AlsaInfo.txt

apport information

tags: added: apport-collected una
description: updated
Revision history for this message
Ip Soon (ipsmile) wrote : CRDA.txt

apport information

Revision history for this message
Ip Soon (ipsmile) wrote : CurrentDmesg.txt

apport information

Revision history for this message
Ip Soon (ipsmile) wrote : Lspci.txt

apport information

Revision history for this message
Ip Soon (ipsmile) wrote : Lspci-vt.txt

apport information

Revision history for this message
Ip Soon (ipsmile) wrote : Lsusb.txt

apport information

Revision history for this message
Ip Soon (ipsmile) wrote : Lsusb-t.txt

apport information

Revision history for this message
Ip Soon (ipsmile) wrote : Lsusb-v.txt

apport information

Revision history for this message
Ip Soon (ipsmile) wrote : ProcCpuinfo.txt

apport information

Revision history for this message
Ip Soon (ipsmile) wrote : ProcCpuinfoMinimal.txt

apport information

Revision history for this message
Ip Soon (ipsmile) wrote : ProcEnviron.txt

apport information

Revision history for this message
Ip Soon (ipsmile) wrote : ProcInterrupts.txt

apport information

Revision history for this message
Ip Soon (ipsmile) wrote : ProcModules.txt

apport information

Revision history for this message
Ip Soon (ipsmile) wrote : PulseList.txt

apport information

Revision history for this message
Ip Soon (ipsmile) wrote : UdevDb.txt

apport information

Revision history for this message
Ip Soon (ipsmile) wrote : WifiSyslog.txt

apport information

Revision history for this message
Ip Soon (ipsmile) wrote : acpidump.txt

apport information

Revision history for this message
Ip Soon (ipsmile) wrote : modified.conffile..etc.apport.crashdb.conf.txt

apport information

Revision history for this message
Ip Soon (ipsmile) wrote (last edit ):

Tried to execute apport-collect 1982965
but my computer is not configured in SVM enabled mode at this moment.

If anyone is interested, I can set it up in SVM enabled mode to do this again.

As I stated when configured in the motherboard SVM enabled, meaning CPU virtualization is on, the random crashes will show.

Changed in linux (Ubuntu):
status: Incomplete → Confirmed
description: updated
Ip Soon (ipsmile)
description: updated
description: updated
Ip Soon (ipsmile)
description: updated
Ip Soon (ipsmile)
description: updated
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.