System suddenly reboots on high load with 4.13.0-26-generic

Bug #1743735 reported by Marco Kleiss
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux-hwe (Ubuntu)
Invalid
Undecided
Unassigned

Bug Description

Hi, I administer a system which encounters high load regularly over nighttime. Since Kernel 4.13.0-26-generic it suddenly reboots shortly after the load starts. When switching back to Kernel 4.10.0-42-generic everything works fine. I guess earlier versions of 4.13.0 worked too.
The system is running Ubuntu 16.04.3 LTS on a AMD EPYC 7351.
regards Marco

# lsb_release -rd
Description: Ubuntu 16.04.3 LTS
Release: 16.04

# apt-cache policy linux-image-4.13.0-26-generic
linux-image-4.13.0-26-generic:
  Installed: 4.13.0-26.29~16.04.2
  Candidate: 4.13.0-26.29~16.04.2
  Version table:
 *** 4.13.0-26.29~16.04.2 500
        500 http://de.archive.ubuntu.com/ubuntu xenial-updates/main amd64 Packages
        500 http://security.ubuntu.com/ubuntu xenial-security/main amd64 Packages
        100 /var/lib/dpkg/status

Revision history for this message
Marco Kleiss (mkleiss) wrote :

I recognized that after a crash I see the following entries in dmesg
# dmesg |grep BERT
[ 0.000000] ACPI: BERT 0x00000000DA1DB7C0 000030 (v01 AMD AMD BERT 00000001 AMD 00000001)
[ 2.457831] BERT: [Firmware Bug]: Invalid error record.

So I now rather think it is a Hardware failure than a bug. I am setting the status from new to invalid (hoping thats the correct state).

Changed in linux-hwe (Ubuntu):
status: New → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.