[TTM] Buffer eviction failed

Bug #1884726 reported by Michele Della Guardia
22
This bug affects 4 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Expired
Undecided
Unassigned

Bug Description

Under some usage conditions on Ubuntu 20.04 I get an almost complete system freeze and after some minutes I get this issuing a dmesg command: "[TTM] Buffer eviction failed".
I can always reproduce the problem using Streetview function inside maps.google.com using Firefox (77.0.1).
My kernel version is 5.4.0-38 and my graphic card is an old Nvidia

01:00.0 VGA compatible controller: NVIDIA Corporation G86M [GeForce 8400M GS] (rev a1

When I switch to closed source Nvidia proprietary driver (ver. 340.108) I have no issues.
---
ProblemType: Bug
ApportVersion: 2.20.11-0ubuntu27.3
Architecture: amd64
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC0: michele 1586 F.... pulseaudio
CasperMD5CheckResult: skip
CurrentDesktop: ubuntu:GNOME
DistroRelease: Ubuntu 20.04
HibernationDevice: RESUME=UUID=4e56d43d-0da0-4af9-a55c-035f04c115bf
InstallationDate: Installed on 2017-11-13 (952 days ago)
InstallationMedia: Ubuntu 16.04.3 LTS "Xenial Xerus" - Release amd64 (20170801)
MachineType: Acer Aspire 5720
Package: linux (not installed)
ProcEnviron:
 TERM=xterm-256color
 PATH=(custom, no user)
 XDG_RUNTIME_DIR=<set>
 LANG=it_IT.UTF-8
 SHELL=/bin/bash
ProcFB: 0 nouveaudrmfb
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-5.4.0-38-generic root=UUID=28547816-80be-43c8-a86e-d655d7d8ad5e ro quiet splash vt.handoff=7
ProcVersionSignature: Ubuntu 5.4.0-38.42-generic 5.4.44
RelatedPackageVersions:
 linux-restricted-modules-5.4.0-38-generic N/A
 linux-backports-modules-5.4.0-38-generic N/A
 linux-firmware 1.187.1
StagingDrivers: ashmem_linux
Tags: focal staging
Uname: Linux 5.4.0-38-generic x86_64
UpgradeStatus: Upgraded to focal on 2020-04-24 (60 days ago)
UserGroups: adm cdrom dialout dip lpadmin plugdev sambashare sudo tty uucp
_MarkForUpload: True
dmi.bios.date: 11/10/2008
dmi.bios.vendor: Acer
dmi.bios.version: V1.45
dmi.board.asset.tag: Base Board Asset Tag
dmi.board.name: Nettiling
dmi.board.vendor: Acer
dmi.board.version: V1.45
dmi.chassis.type: 1
dmi.chassis.vendor: Acer
dmi.chassis.version: V1.45
dmi.modalias: dmi:bvnAcer:bvrV1.45:bd11/10/2008:svnAcer:pnAspire5720:pvrV1.45:rvnAcer:rnNettiling:rvrV1.45:cvnAcer:ct1:cvrV1.45:
dmi.product.family: None
dmi.product.name: Aspire 5720
dmi.product.sku: None
dmi.product.version: V1.45
dmi.sys.vendor: Acer
modified.conffile..etc.default.apport: [modified]
mtime.conffile..etc.default.apport: 2019-08-08T12:13:15.951541

Revision history for this message
Ubuntu Kernel Bot (ubuntu-kernel-bot) wrote : Missing required logs.

This bug is missing log files that will aid in diagnosing the problem. While running an Ubuntu kernel (not a mainline or third-party kernel) please enter the following command in a terminal window:

apport-collect 1884726

and then change the status of the bug to 'Confirmed'.

If, due to the nature of the issue you have encountered, you are unable to run this command, please add a comment stating that fact and change the bug status to 'Confirmed'.

This change has been made by an automated script, maintained by the Ubuntu Kernel Team.

Changed in linux (Ubuntu):
status: New → Incomplete
Revision history for this message
Michele Della Guardia (micheledellaguardia) wrote : AlsaInfo.txt

apport information

tags: added: apport-collected staging
description: updated
Revision history for this message
Michele Della Guardia (micheledellaguardia) wrote : CRDA.txt

apport information

Revision history for this message
Michele Della Guardia (micheledellaguardia) wrote : CurrentDmesg.txt

apport information

Revision history for this message
Michele Della Guardia (micheledellaguardia) wrote : IwConfig.txt

apport information

Revision history for this message
Michele Della Guardia (micheledellaguardia) wrote : Lspci.txt

apport information

Revision history for this message
Michele Della Guardia (micheledellaguardia) wrote : Lspci-vt.txt

apport information

Revision history for this message
Michele Della Guardia (micheledellaguardia) wrote : Lsusb.txt

apport information

Revision history for this message
Michele Della Guardia (micheledellaguardia) wrote : Lsusb-t.txt

apport information

Revision history for this message
Michele Della Guardia (micheledellaguardia) wrote : Lsusb-v.txt

apport information

Revision history for this message
Michele Della Guardia (micheledellaguardia) wrote : ProcCpuinfo.txt

apport information

Revision history for this message
Michele Della Guardia (micheledellaguardia) wrote : ProcCpuinfoMinimal.txt

apport information

Revision history for this message
Michele Della Guardia (micheledellaguardia) wrote : ProcInterrupts.txt

apport information

Revision history for this message
Michele Della Guardia (micheledellaguardia) wrote : ProcModules.txt

apport information

Revision history for this message
Michele Della Guardia (micheledellaguardia) wrote : PulseList.txt

apport information

Revision history for this message
Michele Della Guardia (micheledellaguardia) wrote : RfKill.txt

apport information

Revision history for this message
Michele Della Guardia (micheledellaguardia) wrote : UdevDb.txt

apport information

Revision history for this message
Michele Della Guardia (micheledellaguardia) wrote : WifiSyslog.txt

apport information

Changed in linux (Ubuntu):
status: Incomplete → Confirmed
Revision history for this message
Kai-Heng Feng (kaihengfeng) wrote :

Please test latest drm-tip kernel:
https://kernel.ubuntu.com/~kernel-ppa/mainline/drm-tip/current/

If the issue persists, please file an upstream bug at
https://gitlab.freedesktop.org/drm/nouveau/issues

Changed in linux (Ubuntu):
status: Confirmed → Incomplete
Revision history for this message
Launchpad Janitor (janitor) wrote :

[Expired for linux (Ubuntu) because there has been no activity for 60 days.]

Changed in linux (Ubuntu):
status: Incomplete → Expired
Revision history for this message
Paddy Launch (p-l) wrote (last edit ):

Hello I like to help but not sure what to do.

I could not use the workaround to install the proprietary driver because it does not work https://bugs.launchpad.net/ubuntu/+source/nvidia-graphics-drivers-440/+bug/1878946.

What must I do for the test of the Please test latest drm-tip kernel:
https://kernel.ubuntu.com/~kernel-ppa/mainline/drm-tip/current/ ?

Sorry I have not done such things before, so I need more advise.

I got today the error serveral time directly one after the other and the machine freeze about 15 minutes.

Here the Infor from the journal

Mar 18 18:46:35 dob-R510-P510 kernel: [TTM] Buffer eviction failed
Mar 18 18:46:50 dob-R510-P510 kernel: [TTM] Buffer eviction failed
Mar 18 18:47:20 dob-R510-P510 kernel: [TTM] Buffer eviction failed
Mar 18 18:47:50 dob-R510-P510 kernel: [TTM] Buffer eviction failed
Mar 18 18:48:20 dob-R510-P510 kernel: [TTM] Buffer eviction failed
Mar 18 18:48:51 dob-R510-P510 kernel: [TTM] Buffer eviction failed
Mar 18 18:49:36 dob-R510-P510 kernel: [TTM] Buffer eviction failed
Mar 18 18:50:21 dob-R510-P510 kernel: [TTM] Buffer eviction failed
Mar 18 18:51:37 dob-R510-P510 kernel: [TTM] Buffer eviction failed
Mar 18 18:52:07 dob-R510-P510 kernel: [TTM] Buffer eviction failed
Mar 18 18:52:22 dob-R510-P510 kernel: [TTM] Buffer eviction failed
Mar 18 18:54:08 dob-R510-P510 kernel: [TTM] Buffer eviction failed

Revision history for this message
alex-mobigo (alex-mobigo) wrote (last edit ):
Download full text (5.3 KiB)

I have a similar issue with LXDE 18.04.
sb_release -a
No LSB modules are available.
Distributor ID: Ubuntu
Description: Ubuntu 18.04.6 LTS
Release: 18.04
Codename: bionic

It seems to be related to nouveau driver when in use by the chromium web engine. Chrome, Chromium, Brave freeze the system (crash report below), in some occasions if you can kill the browser (if you can) and enter CTRL+DEL several times and wait a few minutes (3 to 10 min) and the PC is responsive again.

[40802.271418] [TTM] Buffer eviction failed
[40802.289063] ------------[ cut here ]------------
[40802.289067] Trying to vfree() bad address (00000000b4473d8e)
[40802.289086] WARNING: CPU: 1 PID: 12940 at /build/linux-hwe-5.4-rs4SmW/linux-hwe-5.4-5.4.0/mm/vmalloc.c:2240 __vunmap+0x1fe/0x210
[40802.289087] Modules linked in: cdc_mbim cdc_ncm ppp_deflate bsd_comp ppp_async cdc_ether qmi_wwan cdc_wdm option usbnet usb_wwan usbserial mii nls_utf8 isofs binfmt_misc nouveau mxm_wmi wmi video ttm coretemp kvm_intel drm_kms_helper kvm drm input_leds snd_hda_codec_via snd_hda_codec_generic serio_raw ledtrig_audio snd_hda_intel i2c_algo_bit lpc_ich snd_intel_dspcfg fb_sys_fops snd_hda_codec syscopyarea snd_hda_core sysfillrect sysimgblt asus_atk0110 snd_hwdep mac_hid sch_fq_codel tw5864 solo6x10 videobuf2_dma_contig videobuf2_dma_sg videobuf2_memops videobuf2_v4l2 videobuf2_common videodev mc snd_pcm snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device snd_timer snd soundcore parport_pc ppdev lp parport ip_tables x_tables autofs4 uas usb_storage psmouse r8169 pata_acpi pata_jmicron realtek
[40802.289121] CPU: 1 PID: 12940 Comm: chromium-browse Not tainted 5.4.0-55-generic #61~18.04.1-Ubuntu
[40802.289122] Hardware name: System manufacturer System Product Name/P5Q SE PLUS, BIOS 2204 08/28/2009
[40802.289124] RIP: 0010:__vunmap+0x1fe/0x210
[40802.289126] Code: e8 67 85 e2 ff 0f 0b eb e0 31 d2 31 f6 48 c7 c7 ff ff ff ff e8 c3 fc ff ff eb c6 48 89 fe 48 c7 c7 08 76 36 a0 e8 42 85 e2 ff <0f> 0b eb bb 0f 1f 40 00 66 2e 0f 1f 84 00 00 00 00 00 66 66 66 66
[40802.289126] RSP: 0018:ffffa33e00fa38e0 EFLAGS: 00010286
[40802.289128] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000006
[40802.289128] RDX: 0000000000000007 RSI: 0000000000000086 RDI: ffff8e8baba978c0
[40802.289129] RBP: ffffa33e00fa3908 R08: 00000000000004f3 R09: 0000000000000004
[40802.289130] R10: 0000000000000000 R11: 0000000000000001 R12: 0000000000000a80
[40802.289130] R13: ffff8e8ba133ba80 R14: 0000251bee76401f R15: 0000000000000018
[40802.289132] FS: 00007fa148d3fc00(0000) GS:ffff8e8baba80000(0000) knlGS:0000000000000000
[40802.289132] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[40802.289133] CR2: 00007fc3c86f7fd0 CR3: 000000009707c000 CR4: 00000000000406e0
[40802.289134] Call Trace:
[40802.289139] vunmap+0x2f/0x40
[40802.289213] nvkm_umem_unmap+0x4e/0x70 [nouveau]
[40802.289233] nvkm_object_dtor+0xa2/0x180 [nouveau]
[40802.289254] nvkm_object_del+0x25/0x90 [nouveau]
[40802.289274] nvkm_ioctl_del+0x8c/0xa0 [nouveau]
[40802.289294] nvkm_ioctl+0x11d/0x280 [nouveau]
[40802.289328] nvkm_client_ioctl+0x12/0x20 [nouveau]
[40802.289347] nvif_object_ioctl+0x47/0x50 [nouveau]
[40802.2893...

Read more...

Revision history for this message
alex-mobigo (alex-mobigo) wrote :

Update info:
01:00.0 VGA compatible controller: NVIDIA Corporation G96 [GeForce 9500 GT] (rev a1)

I can work with intensive GPU accel (opengl / opengles) without any crashes.
With the browser (Chrome,Chromium,Brave, and recently i have read about the TTM crash with Firefox), it can crash the computer after a few minutes while scrolling down the page with lots of ads.

I am not sure if the bug is in TTM/drm or the GPU userspace in the browser. But Chromium crashes right after login into lightdm. I was not even aware chromium had a background thread running.

Just for Information, here is how chromium is rendering the page, looks like corrupt page.

Revision history for this message
alex-mobigo (alex-mobigo) wrote :

Switched to nvidia proprietary driver and so far no more crashes in the browser, need more time to fully test with the nvidia driver.

Definitely a nouveau driver bug with the kernel:
Linux svn 5.4.0-55-generic #61~18.04.1-Ubuntu SMP Wed Nov 11 14:12:46 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux

Attached is the crash_history.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Related questions

Remote bug watches

Bug watches keep track of this bug in other bug trackers.