Huawei Kunpeng 920 arm64 machine KVM guest frequently crash

Bug #1869763 reported by J0n47
10
This bug affects 1 person
Affects Status Importance Assigned to Milestone
kunpeng920
Invalid
Undecided
Unassigned
linux (Ubuntu)
Invalid
Undecided
Unassigned

Bug Description

Hardware spec: 64 core 64 G ram arm64

OS & kernel version: Ubuntu 18.04.4 LTS, 4.15.0-91-generic

`virt-host-validate qemu` output:
QEMU: Checking if device /dev/kvm exists : PASS
QEMU: Checking if device /dev/kvm is accessible : PASS
QEMU: Checking if device /dev/vhost-net exists : PASS
QEMU: Checking if device /dev/net/tun exists : PASS
QEMU: Checking for cgroup 'memory' controller support : PASS
QEMU: Checking for cgroup 'memory' controller mount-point : PASS
QEMU: Checking for cgroup 'cpu' controller support : PASS
QEMU: Checking for cgroup 'cpu' controller mount-point : PASS
QEMU: Checking for cgroup 'cpuacct' controller support : PASS
QEMU: Checking for cgroup 'cpuacct' controller mount-point : PASS
QEMU: Checking for cgroup 'cpuset' controller support : PASS
QEMU: Checking for cgroup 'cpuset' controller mount-point : PASS
QEMU: Checking for cgroup 'devices' controller support : PASS
QEMU: Checking for cgroup 'devices' controller mount-point : PASS
QEMU: Checking for cgroup 'blkio' controller support : PASS
QEMU: Checking for cgroup 'blkio' controller mount-point : PASS
WARN (Unknown if this platform has IOMMU support)

libvirt version: 4.0.0-1ubuntu8.14

qemu version: 1:2.11+dfsg-1ubuntu7.23 arm64

guest vm kernel version: 4.15.0-64-generic aarch64

dmesg log:
kvm [49132]: Unexpected L2 read permission error

libvirt log:
ubuntu libvirtd: 2020-03-20 03:04.474+0000: 42934: warning : qemuDomainObjTaint:5602 : Domain id=38 name='vm-157' uuid=3d447d79-c2a1-4351-b607-6698a2cd6c5f is tainted: host-cpu

When "Unexpected L2 read permission error" this error occured, one of guest machines will become "paused" state, need to `virsh reset PAUSED_VM_NAME` to reset then start it. Those guest vm hang/crash occured very frequently, sometimes several times per hour.

Revision history for this message
Ubuntu Foundations Team Bug Bot (crichton) wrote :

Thank you for taking the time to report this bug and helping to make Ubuntu better. It seems that your bug report is not filed about a specific source package though, rather it is just filed against Ubuntu in general. It is important that bug reports be filed about source packages so that people interested in the package can find the bugs about it. You can find some hints about determining what package your bug might be about at https://wiki.ubuntu.com/Bugs/FindRightPackage. You might also ask for help in the #ubuntu-bugs irc channel on Freenode.

To change the source package that this bug is filed about visit https://bugs.launchpad.net/ubuntu/+bug/1869763/+editstatus and add the package name in the text box next to the word Package.

[This is an automated message. I apologize if it reached you inappropriately; please just reply to this message indicating so.]

tags: added: bot-comment
J0n47 (361904373-9)
affects: ubuntu → linux-meta (Ubuntu)
Juhani Numminen (jsonic)
affects: linux-meta (Ubuntu) → linux (Ubuntu)
Revision history for this message
Ubuntu Kernel Bot (ubuntu-kernel-bot) wrote : Missing required logs.

This bug is missing log files that will aid in diagnosing the problem. While running an Ubuntu kernel (not a mainline or third-party kernel) please enter the following command in a terminal window:

apport-collect 1869763

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: New → Incomplete
tags: added: bionic
Revision history for this message
J0n47 (361904373-9) wrote :

after switched to HWE kernel: linux-image-generic-hwe-18.04 5.3.0.45.101 arm64
Current not occured within 1 day.

Revision history for this message
Launchpad Janitor (janitor) wrote :

[Expired for linux (Ubuntu) because there has been no activity for 60 days.]

Changed in linux (Ubuntu):
status: Incomplete → Expired
Revision history for this message
liuliu (beyond2) wrote :

hello,How about the issue now?

Does it reproduce with the kernel version "5.3.0.45.101"?

Revision history for this message
Fred Kimmy (kongzizaixian) wrote :

https://lists.cs.columbia.edu/pipermail/kvmarm/2020-September/042543.html

this patch will slove this bug。 Can you test it?

Changed in linux (Ubuntu):
status: Expired → New
Revision history for this message
Ubuntu Kernel Bot (ubuntu-kernel-bot) wrote :

This bug is missing log files that will aid in diagnosing the problem. While running an Ubuntu kernel (not a mainline or third-party kernel) please enter the following command in a terminal window:

apport-collect 1869763

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: New → Incomplete
Changed in kunpeng920:
status: New → Incomplete
Revision history for this message
Ike Panhc (ikepanhc) wrote :

From description this issue happens with Ubuntu 18.04.4 LTS, 4.15.0-91-generic and can not reproduce with HWE kernel. Is there any qemu options needed to reproduce?

I am going to create a KVM guest on Ubuntu 18.04 GA kernel with kunpeng920 machines and see if I can reproduce.

Revision history for this message
Ike Panhc (ikepanhc) wrote :

I can not reproduce this issue with bionic-update kernel and the patch mentioned in #6 has been merged into ubuntu-bionic since 4.15.0-125.128. I believe this issue has been fixed and am going to close this bug. Please re-open if this issue is still bothering you with bionic-update kernel.

Changed in linux (Ubuntu):
status: Incomplete → Invalid
Changed in kunpeng920:
status: Incomplete → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.