Comment 5 for bug 1787904

Revision history for this message
Krister Swenson (thekswenson) wrote : Re: [Bug 1787904] Re: [radeon] machine crash after GPU reset

It seems to occur intermittently...
  I'm never sure what the conditions are, but it happens once every few
weeks.
I can be in the middle of work and the machine will crash.

Unfortunately, I can't make it happen at will. Is there something you'd
like me to do?

On Wed, Aug 22, 2018 at 6:01 PM Joseph Salisbury <
<email address hidden>> wrote:

> Do you have a way to reproduce this issue, or was it a one time event?
>
> ** Changed in: linux (Ubuntu)
> Importance: Undecided => Medium
>
> ** Changed in: linux (Ubuntu)
> Status: Confirmed => Incomplete
>
> --
> You received this bug notification because you are subscribed to the bug
> report.
> https://bugs.launchpad.net/bugs/1787904
>
> Title:
> [radeon] machine crash after GPU reset
>
> Status in linux package in Ubuntu:
> Incomplete
>
> Bug description:
> I showed up to work today (after being away for a couple of weeks) and
> my machine had crashed.
> I'm not sure why it happened this morning at 7am. My only guess is that
> a janitor came in and touched the mouse which woke the screen up after a
> couple of weeks of sleep.
> (I have been using the machine remotely for computations, which had
> finished yesterday, but the building was closed so I'm guessing this is the
> first time the screen has come on in a while).
>
> The first unusual entries in my kern.log are:
>
> Aug 20 07:02:43 winters kernel: [2218007.668454] radeon 0000:03:00.0:
> ring 0 stalled for more than 10248msec
> Aug 20 07:02:43 winters kernel: [2218007.668465] radeon 0000:03:00.0:
> GPU lockup (current fence id 0x00000000000799d4 last fence id
> 0x00000000000799d6 on ring 0)
>
> They are plentiful, and are mixed with other messages (see attached
> log file) such as:
>
> Aug 20 07:03:00 winters kernel: [2218025.438173] radeon 0000:03:00.0:
> failed VCE resume (-110).
> Aug 20 07:03:01 winters kernel: [2218025.721068] [drm:r600_ring_test
> [radeon]] *ERROR* radeon: ring 0 test failed (scratch(0x850C)=0xCAFEDEAD)
> Aug 20 07:03:01 winters kernel: [2218025.721085] [drm:si_resume
> [radeon]] *ERROR* si startup failed on resume
> Aug 20 07:03:01 winters kernel: [2218025.722887] WARNING: CPU: 18 PID:
> 3715 at
> /build/linux-60XibS/linux-4.15.0/drivers/gpu/drm/radeon/radeon_object.c:84
> radeon_ttm_bo_destroy+0xfb/0x100 [radeon]
> .
> .
> .
> Aug 20 07:03:38 winters kernel: [2218062.876285] [drm:atom_op_jump
> [radeon]] *ERROR* atombios stuck in loop for more than 5secs aborting
> Aug 20 07:03:38 winters kernel: [2218062.876298]
> [drm:atom_execute_table_locked [radeon]] *ERROR* atombios stuck executing
> BBC8 (len 237, WS 0, PS 4) @ 0xBBD6
> Aug 20 07:03:38 winters kernel: [2218062.876309]
> [drm:atom_execute_table_locked [radeon]] *ERROR* atombios stuck executing
> B3EE (len 78, WS 12, PS 8) @ 0xB427
>
> The last message before death appears to be:
>
> Aug 20 07:10:02 winters kernel: [2218447.006362] radeon 0000:03:00.0:
> GPU reset succeeded, trying to resume
>
> ProblemType: Bug
> DistroRelease: Ubuntu 18.04
> Package: xserver-xorg-video-radeon 1:18.0.1-1
> ProcVersionSignature: Ubuntu 4.15.0-32.35-generic 4.15.18
> Uname: Linux 4.15.0-32-generic x86_64
> NonfreeKernelModules: wl
> .tmp.unity_support_test.0:
>
> ApportVersion: 2.20.9-0ubuntu7.2
> Architecture: amd64
> CompizPlugins: No value set for
> `/apps/compiz-1/general/screen0/options/active_plugins'
> CompositorRunning: None
> CurrentDesktop: ubuntu:GNOME
> Date: Mon Aug 20 10:18:42 2018
> DistUpgraded: 2018-05-02 12:54:58,714 DEBUG icon theme changed,
> re-reading
> DistroCodename: bionic
> DistroVariant: ubuntu
> DkmsStatus:
> bcmwl, 6.30.223.271+bdcom, 4.15.0-29-generic, x86_64: installed
> bcmwl, 6.30.223.271+bdcom, 4.15.0-30-generic, x86_64: installed
> bcmwl, 6.30.223.271+bdcom, 4.15.0-32-generic, x86_64: installed
> ExtraDebuggingInterest: Yes
> GraphicsCard:
> Advanced Micro Devices, Inc. [AMD/ATI] Oland GL [FirePro W2100]
> [1002:6608] (prog-if 00 [VGA controller])
> Subsystem: Dell Oland GL [FirePro W2100] [1028:2120]
> InstallationDate: Installed on 2018-01-05 (226 days ago)
> InstallationMedia: Ubuntu 17.04 "Zesty Zapus" - Release amd64 (20170412)
> MachineType: Dell Inc. Precision Tower 7810
> ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-4.15.0-32-generic
> root=UUID=90d43be7-a2f7-4500-8b88-9bd7a549d96d ro quiet splash vt.handoff=1
> SourcePackage: xserver-xorg-video-ati
> UpgradeStatus: Upgraded to bionic on 2018-05-02 (109 days ago)
> dmi.bios.date: 06/25/2018
> dmi.bios.vendor: Dell Inc.
> dmi.bios.version: A27
> dmi.board.name: 0KJCC5
> dmi.board.vendor: Dell Inc.
> dmi.board.version: A00
> dmi.chassis.type: 7
> dmi.chassis.vendor: Dell Inc.
> dmi.modalias:
> dmi:bvnDellInc.:bvrA27:bd06/25/2018:svnDellInc.:pnPrecisionTower7810:pvr:rvnDellInc.:rn0KJCC5:rvrA00:cvnDellInc.:ct7:cvr:
> dmi.product.name: Precision Tower 7810
> dmi.sys.vendor: Dell Inc.
> version.compiz: compiz 1:0.9.13.1+18.04.20180302-0ubuntu1
> version.libdrm2: libdrm2 2.4.91-2
> version.libgl1-mesa-dri: libgl1-mesa-dri 18.0.5-0ubuntu0~18.04.1
> version.libgl1-mesa-glx: libgl1-mesa-glx 18.0.5-0ubuntu0~18.04.1
> version.xserver-xorg-core: xserver-xorg-core 2:1.19.6-1ubuntu4
> version.xserver-xorg-input-evdev: xserver-xorg-input-evdev
> 1:2.10.5-1ubuntu1
> version.xserver-xorg-video-ati: xserver-xorg-video-ati 1:18.0.1-1
> version.xserver-xorg-video-intel: xserver-xorg-video-intel
> 2:2.99.917+git20171229-1
> version.xserver-xorg-video-nouveau: xserver-xorg-video-nouveau 1:1.0.15-2
> xserver.bootTime: Fri Jan 5 15:01:43 2018
> xserver.configfile: default
> xserver.errors:
>
> xserver.logfile: /var/log/Xorg.0.log
> xserver.version: 2:1.19.3-1ubuntu1.3
> xserver.video_driver: radeon
>
> To manage notifications about this bug go to:
> https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1787904/+subscriptions
>