10de:1056 [Dell Latitude E6520] [NVS 4200M] X freeze, "unhandled status 0x00800000"

Bug #1243557 reported by Rebecca Palmer
14
This bug affects 2 people
Affects Status Importance Assigned to Milestone
Nouveau Xorg driver
Confirmed
High
xserver-xorg-video-nouveau (Ubuntu)
Triaged
Low
Unassigned

Bug Description

With the Nouveau driver selected, my system freezes within a few minutes of boot if on battery, but only on heavy graphics load (flightgear in maximized window) if on mains. The mouse pointer still moves at first, but often freezes later; the keyboard LEDs do not react. Sounds already playing continue until finished, but don't repeat if set to (i.e. applications are frozen). Alt+SysRq works (provided bug 1244312 is fixed first).

The kernel log nearly always contains the line
nouveau ![ PFIFO][0000:01:00.0] unhandled status 0x00800000

and sometimes other errors as well, while the Xorg log often contains "EQ overflow" errors; the full logs are attached.

WORKAROUND: This does not occur with the nvidia-319 driver.

ProblemType: Bug
DistroRelease: Ubuntu 13.10
Package: xserver-xorg-video-nouveau 1:1.0.9-2ubuntu1
ProcVersionSignature: Ubuntu 3.11.0-12.19-generic 3.11.3
Uname: Linux 3.11.0-12-generic x86_64
NonfreeKernelModules: nvidia
.proc.driver.nvidia.gpus.0: Error: [Errno 21] Is a directory: '/proc/driver/nvidia/gpus/0'
.proc.driver.nvidia.registry: Binary: ""
.proc.driver.nvidia.version:
 NVRM version: NVIDIA UNIX x86_64 Kernel Module 319.32 Wed Jun 19 15:51:20 PDT 2013
 GCC version: gcc version 4.8.1 (Ubuntu/Linaro 4.8.1-10ubuntu8)
.tmp.unity.support.test.0:

ApportVersion: 2.12.5-0ubuntu2
Architecture: amd64
CompizPlugins: [core,composite,opengl,compiztoolbox,decor,vpswitch,snap,mousepoll,resize,place,move,wall,grid,regex,imgpng,session,gnomecompat,animation,fade,unitymtgrabhandles,workarounds,scale,expo,ezoom,unityshell]
CompositorRunning: compiz
CompositorUnredirectDriverBlacklist: '(nouveau|Intel).*Mesa 8.0'
CompositorUnredirectFSW: true
Date: Wed Oct 23 08:19:06 2013
DistUpgraded: 2013-10-21 23:03:05,341 DEBUG enabling apt cron job
DistroCodename: saucy
DistroVariant: ubuntu
DkmsStatus: nvidia-319, 319.32, 3.11.0-12-generic, x86_64: installed
EcryptfsInUse: Yes
GraphicsCard:
 NVIDIA Corporation GF119M [NVS 4200M] [10de:1056] (rev a1) (prog-if 00 [VGA controller])
   Subsystem: Dell Device [1028:0494]
InstallationDate: Installed on 2012-01-25 (636 days ago)
InstallationMedia: Ubuntu 11.10 "Oneiric Ocelot" - Release amd64 (20111012)
JockeyStatus:
 kmod:nvidia_319_updates - NVIDIA binary Xorg driver, kernel module and VDPAU library (Proprietary, Disabled, Not in use)
 kmod:wl - Broadcom STA wireless driver (Proprietary, Disabled, Not in use) [auto-install]
 kmod:nvidia_304 - NVIDIA binary Xorg driver, kernel module and VDPAU library (Proprietary, Disabled, Not in use)
 kmod:nvidia_304_updates - NVIDIA binary Xorg driver, kernel module and VDPAU library (Proprietary, Disabled, Not in use)
 kmod:nvidia_319 - nvidia_319 (Proprietary, Enabled, Not in use)
MachineType: Dell Inc. Latitude E6520
MarkForUpload: True
PlymouthDebug: Error: [Errno 13] Permission denied: '/var/log/plymouth-debug.log'
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-3.11.0-12-generic root=UUID=b491a34a-6045-4ad4-a2b8-b9ca018c5d41 ro quiet splash
SourcePackage: xserver-xorg-video-nouveau
UpgradeStatus: Upgraded to saucy on 2013-10-21 (1 days ago)
XorgConf:
 Section "Device"
  Identifier "Default Device"
  Option "NoLogo" "True"
 EndSection
dmi.bios.date: 10/18/2011
dmi.bios.vendor: Dell Inc.
dmi.bios.version: A08
dmi.board.name: 0J4TFW
dmi.board.vendor: Dell Inc.
dmi.board.version: A01
dmi.chassis.type: 9
dmi.chassis.vendor: Dell Inc.
dmi.modalias: dmi:bvnDellInc.:bvrA08:bd10/18/2011:svnDellInc.:pnLatitudeE6520:pvr01:rvnDellInc.:rn0J4TFW:rvrA01:cvnDellInc.:ct9:cvr:
dmi.product.name: Latitude E6520
dmi.product.version: 01
dmi.sys.vendor: Dell Inc.
version.compiz: compiz 1:0.9.10+13.10.20131011-0ubuntu1
version.ia32-libs: ia32-libs N/A
version.libdrm2: libdrm2 2.4.46-1
version.libgl1-mesa-dri: libgl1-mesa-dri 9.2.1-1ubuntu3
version.libgl1-mesa-dri-experimental: libgl1-mesa-dri-experimental N/A
version.libgl1-mesa-glx: libgl1-mesa-glx 9.2.1-1ubuntu3
version.nvidia-graphics-drivers: nvidia-graphics-drivers N/A
version.xserver-xorg-core: xserver-xorg-core 2:1.14.3-3ubuntu2
version.xserver-xorg-input-evdev: xserver-xorg-input-evdev 1:2.7.3-0ubuntu3.1
version.xserver-xorg-video-ati: xserver-xorg-video-ati 1:7.2.0-0ubuntu10
version.xserver-xorg-video-intel: xserver-xorg-video-intel 2:2.99.904-0ubuntu2
version.xserver-xorg-video-nouveau: xserver-xorg-video-nouveau 1:1.0.9-2ubuntu1
xserver.bootTime: Wed Oct 23 07:24:22 2013
xserver.configfile: /etc/X11/xorg.conf
xserver.errors:
 Failed to load /usr/lib/xorg/modules/libglamoregl.so: /usr/lib/xorg/modules/libglamoregl.so: undefined symbol: _glapi_tls_Context
 Failed to load module "glamoregl" (loader failed, 7)
 open /dev/fb0: No such file or directory
xserver.logfile: /var/log/Xorg.0.log
xserver.outputs:

xserver.version: 2:1.14.3-3ubuntu2
---
ApportVersion: 2.12.5-0ubuntu2.1
Architecture: amd64
DistUpgraded: 2013-10-21 23:03:05,341 DEBUG enabling apt cron job
DistroCodename: saucy
DistroRelease: Ubuntu 13.10
DistroVariant: ubuntu
EcryptfsInUse: Yes
GraphicsCard:
 NVIDIA Corporation GF119M [NVS 4200M] [10de:1056] (rev a1) (prog-if 00 [VGA controller])
   Subsystem: Dell Device [1028:0494]
InstallationDate: Installed on 2012-01-25 (640 days ago)
InstallationMedia: Ubuntu 11.10 "Oneiric Ocelot" - Release amd64 (20111012)
MachineType: Dell Inc. Latitude E6520
MarkForUpload: True
Package: xserver-xorg-video-nouveau 1:1.0.9-2ubuntu1
PackageArchitecture: amd64
PlymouthDebug: Error: [Errno 13] Permission denied: u'/var/log/plymouth-debug.log'
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-3.11.0-12-generic root=UUID=b491a34a-6045-4ad4-a2b8-b9ca018c5d41 ro quiet splash reboot=p vt.handoff=7
ProcVersionSignature: Ubuntu 3.11.0-12.19-generic 3.11.3
Tags: saucy ubuntu
Uname: Linux 3.11.0-12-generic x86_64
UpgradeStatus: Upgraded to saucy on 2013-10-21 (5 days ago)
UserGroups: adm admin cdrom dialout lpadmin plugdev sambashare
XorgConf:
 Section "Device"
  Identifier "Default Device"
  Option "NoLogo" "True"
 EndSection
dmi.bios.date: 10/18/2011
dmi.bios.vendor: Dell Inc.
dmi.bios.version: A08
dmi.board.name: 0J4TFW
dmi.board.vendor: Dell Inc.
dmi.board.version: A01
dmi.chassis.type: 9
dmi.chassis.vendor: Dell Inc.
dmi.modalias: dmi:bvnDellInc.:bvrA08:bd10/18/2011:svnDellInc.:pnLatitudeE6520:pvr01:rvnDellInc.:rn0J4TFW:rvrA01:cvnDellInc.:ct9:cvr:
dmi.product.name: Latitude E6520
dmi.product.version: 01
dmi.sys.vendor: Dell Inc.
version.compiz: compiz 1:0.9.10+13.10.20131011-0ubuntu1
version.ia32-libs: ia32-libs N/A
version.libdrm2: libdrm2 2.4.46-1
version.libgl1-mesa-dri: libgl1-mesa-dri 9.2.1-1ubuntu3
version.libgl1-mesa-dri-experimental: libgl1-mesa-dri-experimental N/A
version.libgl1-mesa-glx: libgl1-mesa-glx 9.2.1-1ubuntu3
version.xserver-xorg-core: xserver-xorg-core 2:1.14.3-3ubuntu2
version.xserver-xorg-input-evdev: xserver-xorg-input-evdev 1:2.7.3-0ubuntu3.1
version.xserver-xorg-video-ati: xserver-xorg-video-ati 1:7.2.0-0ubuntu10
version.xserver-xorg-video-intel: xserver-xorg-video-intel 2:2.99.904-0ubuntu2
version.xserver-xorg-video-nouveau: xserver-xorg-video-nouveau 1:1.0.9-2ubuntu1
xserver.bootTime: Sun Oct 27 12:20:04 2013
xserver.configfile: /etc/X11/xorg.conf
xserver.errors:
 Failed to load module "nvidia" (module does not exist, 0)
 Failed to load module "nvidia" (module does not exist, 0)
xserver.logfile: /var/log/Xorg.0.log
xserver.version: 2:1.14.3-3ubuntu2
xserver.video_driver: nouveau

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote :
Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote :

With the latest upstream kernel (but not changing the userspace parts of the driver), this bug still exists but takes longer to occur.

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : BootDmesg.txt

apport information

tags: added: apport-collected
description: updated
Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : BootLog.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : CurrentDmesg.txt

apport information

description: updated
Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : BootDmesg.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : BootLog.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : CurrentDmesg.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : Dependencies.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : DpkgLog.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : HookError_source_xserver_xorg_video_nouveau.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : LightdmDisplayLog.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : LightdmGreeterLog.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : LightdmGreeterLogOld.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : LightdmLog.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : Lspci.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : Lsusb.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : MonitorsUser.xml.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : ProcCpuinfo.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : ProcEnviron.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : ProcInterrupts.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : ProcModules.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : UdevDb.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : UdevLog.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : XorgLog.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : XorgLogOld.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : Xrandr.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : xdpyinfo.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : xserver.devices.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : xserver.outputs.txt

apport information

Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote : Re: [NVS 4200M] X freeze, "unhandled status 0x00800000"
description: updated
Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote :
Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote :
Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote :

This bug seems to have become less frequent since my upstream kernel test: reliably triggering it now requires being on battery and (rather than or) heavy graphics load. As no relevant packages are listed in the apt log as upgraded since then, I suspect this test left a configuration change behind.

Revision history for this message
In , Rebecca Palmer (rebecca-palmer) wrote :

With the Nouveau driver selected, my NVS 4200M/Ubuntu 13.10 system frequently hangs when under heavy graphics load (flightgear usually triggers this within a few minutes) and/or on battery power. This does not happen with the nvidia-319 binary driver.

The mouse pointer continues to move at first, but often freezes later; the keyboard LEDs do not react. Sounds already playing continue until finished, but no new ones start (i.e. applications are frozen). Alt+SysRq works.

The kernel log always has the error
nouveau ![ PFIFO][0000:01:00.0] unhandled status 0x00800000
usually preceded by several instances of the warning
nouveau W[ PFIFO][0000:01:00.0] INTR 0x01000000: 0x00000005
and followed by a wide range of other errors.

This happens with both the default (3.11.0) kernel (Ubuntu bug https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-nouveau/+bug/1243557 ) and the latest git kernel, with the default userspace (libdrm-nouveau 2.4.46/xserver-xorg-video-nouveau 1.0.9/mesa 9.2.1); the attached logs are with the latter (crashes at Nov 15 18:05 and 18:42 in the kernel log, the Xorg log is from the second of these).

With the latest git userspace (xorg/proto/dri3proto, mesa/drm, xcb/libxcb, xorg/lib/libxshmfence, mesa/mesa, xorg/proto/presentproto, xcb/proto, nouveau/xf86-video-nouveau), the system hangs on boot (at a correctly displayed graphical splash screen), with Alt+SysRq not working and nothing recognisable as an error in the logs. I suspect this is due to an incompatible combination of Ubuntu and latest-git (the recommended mesa/drm, nouveau/xf86-video-nouveau, mesa/mesa wouldn't compile on its own, as it needs dri3 which isn't in Ubuntu yet).

Revision history for this message
In , Rebecca Palmer (rebecca-palmer) wrote :

Created attachment 89286
Xorg log

Revision history for this message
In , Ilia Mirkin (imirkin) wrote :

There was a bug introduced in 3.11 (I think) for nvc1,nvd7,nvd9 which is fixed by http://cgit.freedesktop.org/nouveau/linux-2.6/commit/?id=89ad841ffd3eccd06e2a12649f4a5028ecb973b7. I'm not sure what the user-visible effect of the bug is, I suspect it depends on a lot of local configuration settings. I don't know if this will help you, but I believe it's worth trying.

Revision history for this message
In , Rebecca Palmer (rebecca-palmer) wrote :

Created attachment 89287
kernel log

Revision history for this message
In , Rebecca Palmer (rebecca-palmer) wrote :

> http://cgit.freedesktop.org/nouveau/linux-2.6/commit/?id=89ad841ffd3eccd06e2a12649f4a5028ecb973b7

The git kernel I tested was yesterday's, so would already have that fix.

Changed in nouveau:
importance: Unknown → Medium
status: Unknown → Confirmed
Revision history for this message
In , Rebecca Palmer (rebecca-palmer) wrote :

Created attachment 89421
kernel log 2

Revision history for this message
In , Rebecca Palmer (rebecca-palmer) wrote :

Created attachment 89422
Xorg log 2

Revision history for this message
In , Rebecca Palmer (rebecca-palmer) wrote :

Switching to the 9.2 branch of mesa makes the git userspace start, but it still has the original bug. In the attached logs:

Nov 18 16:29 and Xorg log: git kernel, git userspace (libdrm head, xf86-video-nouveau head, mesa 9.2 branch head)
Nov 18 16:37: Ubuntu kernel, git userspace

Revision history for this message
In , Rebecca Palmer (rebecca-palmer) wrote :

This bug seemed to become harder to trigger when I installed git-as-of-Oct-26, then easier again when I installed git-as-of-Nov-14, but given that this change seemed to persist after returning to 3.11 (suggesting a left-behind configuration change) and the general randomness of the bug, this would not give a reliable bisection (so the date mismatch doesn't rule out #71662 being the same bug).

In 3.8.0 the same symptoms occur but the log message is
nouveau ![ PFIFO][0000:01:00.0] unhandled status 0x01000000
I suspect (but have not tested) that this change was http://cgit.freedesktop.org/nouveau/linux-2.6/commit/drivers/gpu/drm/nouveau/core/engine/fifo/nvc0.c?id=32256c87ead3edec86bed5023a0ff96a6d907931 ,i.e. this error was what is now the warning.

Is 0x00800000 unhandled because it is an inherently fatal error, or because nobody outside Nvidia knows what it means? https://github.com/envytools/envytools/blob/master/hwdocs/fifo/nvc0-pfifo.rst is a contentless stub.

Revision history for this message
In , Ilia Mirkin (imirkin) wrote :

*** Bug 71662 has been marked as a duplicate of this bug. ***

Revision history for this message
penalvch (penalvch) wrote : Re: [NVS 4200M] X freeze, "unhandled status 0x00800000"

Rebecca Palmer, this bug was reported a while ago and there hasn't been any activity in it recently. We were wondering if this is still an issue? If so, could you please test for this with the latest development release of Ubuntu? ISO images are available from http://cdimage.ubuntu.com/daily-live/current/ .

If it remains an issue, could you please run the following command in the development release from a Terminal (Applications->Accessories->Terminal), as it will automatically gather and attach updated debug information to this report:

apport-collect -p xserver-xorg-video-nouveau REPLACE-WITH-BUG-NUMBER

Please note, given that the information from the prior release is already available, doing this on a release prior to the development one would not be helpful.

Thank you for your understanding.

Helpful bug reporting tips:
https://wiki.ubuntu.com/ReportingBugs

description: updated
tags: removed: apport-collected
Changed in xserver-xorg-video-nouveau (Ubuntu):
importance: Undecided → Low
status: New → Incomplete
Revision history for this message
Rebecca Palmer (rebecca-palmer) wrote :

It's worse in Trusty, occurring immediately on heavy graphics load and within a few minutes even in ordinary use.

The attached is the output of apport-bug xserver-xorg-video-nouveau --save (apport-collect isn't installed by default) after rebooting, and contains more instances of the warning than were normal in 13.10, but not the final error (I suspect it wasn't synced to disk).

It also displayed a "send problem report?" on reboot, which I OK'd but suspect is essentially the same information.

penalvch (penalvch)
tags: added: trusty
tags: added: bios-outdated-a19
summary: - [NVS 4200M] X freeze, "unhandled status 0x00800000"
+ 10de:1056 [Dell Latitude E6520] [NVS 4200M] X freeze, "unhandled status
+ 0x00800000"
description: updated
Changed in xserver-xorg-video-nouveau (Ubuntu):
status: Incomplete → Triaged
Revision history for this message
In , Rebecca Palmer (rebecca-palmer) wrote :

Created attachment 91782
Ubuntu 14.04 (Linux 3.13.0) log

This bug is easier to trigger in Ubuntu Trusty (kernel 3.13.0, libdrm-nouveau2 2.4.50, xserver-xorg-video-nouveau 1.0.10, mesa 10.0.1), occurring immediately on heavy graphics load and within a few minutes even in ordinary use.

The attached log contains more instances of the warning than were normal in 13.10, but not the final error (I suspect it wasn't synced to disk).

Revision history for this message
In , Ilia Mirkin (imirkin) wrote :

Out of curiousity, does using blob pgraph fw help with this issue?

You can extract it yourself by using the instructions at http://nouveau.freedesktop.org/wiki/NVC0_Firmware/ or you can try using my script (https://raw2.github.com/imirkin/re-vp2/master/extract_firmware.py) with the 325.15 blob, although I'm not _100%_ sure that it produces the correct graph fw for nvd9; that's a very recently-added feature.

Note that you'll need to add nouveau.config=NvGrUseFW=1 to your kernel cmdline, and make sure that the fw is reachable when the nouveau module loads.

Revision history for this message
In , Rebecca Palmer (rebecca-palmer) wrote :

Created attachment 91842
kernel log 3.13rc7 and 3.11+firmware

The 3.13-rc7 upstream kernel in Ubuntu 13.10 also sometimes crashes, but not as often as the near-identical kernel in Trusty; this might mean the regression is in userspace, or might be the same persistence (presumably a left-behind configuration change from 3.12) we saw earlier.

The blob firmware (in the standard 3.11 kernel) turns the login screen blank, with no recognisable error in the log.

Revision history for this message
In , Ilia Mirkin (imirkin) wrote :

(In reply to comment #12)
> The blob firmware (in the standard 3.11 kernel) turns the login screen
> blank, with no recognisable error in the log.

Was this with mmiotrace'd firmware or with firmware extracted using my script? If the latter, try moving nvd7_fuc*[cd] over nvd9_fuc*[cd] -- perhaps they're swapped. Also 3.11 contained a bug for nvd7/nvd9 (and nvc1) which was fixed by 89ad841ffd3e and backported to 3.12.x iirc. Although I'm not sure if that's important when the blob fw is used.

Revision history for this message
In , Rebecca Palmer (rebecca-palmer) wrote :

Your script, and kernel 3.13-rc7 with either the "nvd9" or "nvd7" firmware also gives a blank screen.

Revision history for this message
In , Kel-p-doran (kel-p-doran) wrote :

Jan 10 20:09:20 lap14 kernel: [ 1.610566] nouveau [ PGRAPH][0000:01:00.0] using external firmware
Jan 10 20:09:20 lap14 kernel: [ 1.611187] nouveau E[ PGRAPH][0000:01:00.0] failed to load fuc409c
Jan 10 20:09:20 lap14 kernel: [ 1.611190] nouveau E[ DEVICE][0000:01:00.0] failed to create 0x1800d915, -22
Jan 10 20:09:20 lap14 kernel: [ 1.611193] nouveau E[ DRM] failed to create 0x80000080, -22

You must have named the fuc files wrong or they were not in the initrd, if nouveau fails to load the firmware like this it simply bails out and the driver won't finish loading, hence the blank screen.

Revision history for this message
In , Rebecca Palmer (rebecca-palmer) wrote :

Created attachment 92127
kernel log 3.13rc+firmware

What does "not in the initrd" mean? The files were in /lib/firmware/nouveau with the names the script gave them (nvd9_fuc*).

The errors with kernel 3.13-rc7 (attached) are similar:
Jan 13 22:47:57 lap14 kernel: [ 1.653189] nouveau [ PGRAPH][0000:01:00.0] using external firmware
Jan 13 22:47:57 lap14 kernel: [ 1.653203] nouveau 0000:01:00.0: Direct firmware load failed with error -2
Jan 13 22:47:57 lap14 kernel: [ 1.653204] nouveau 0000:01:00.0: Falling back to user helper
Jan 13 22:47:57 lap14 kernel: [ 1.653447] nouveau 0000:01:00.0: Direct firmware load failed with error -2
Jan 13 22:47:57 lap14 kernel: [ 1.653452] nouveau 0000:01:00.0: Falling back to user helper
Jan 13 22:47:57 lap14 kernel: [ 1.653651] nouveau E[ PGRAPH][0000:01:00.0] failed to load fuc409c
Jan 13 22:47:57 lap14 kernel: [ 1.653658] nouveau E[ DEVICE][0000:01:00.0] failed to create 0x1800d916, -22
Jan 13 22:47:57 lap14 kernel: [ 1.653663] nouveau E[ DRM] failed to create 0x80000080, -22
Jan 13 22:47:57 lap14 kernel: [ 1.654258] nouveau: probe of 0000:01:00.0 failed with error -22

Revision history for this message
In , Ilia Mirkin (imirkin) wrote :

(In reply to comment #16)
> Created attachment 92127 [details]
> kernel log 3.13rc+firmware
>
> What does "not in the initrd" mean? The files were in /lib/firmware/nouveau
> with the names the script gave them (nvd9_fuc*).

But was this /lib/firmware/nouveau available when the nouveau module loaded? E.g. if it loads from initrd, this needs to be in the initrd. If nouveau is built-in, the firmware files need to be baked into the kernel (I think).

Revision history for this message
In , Rebecca Palmer (rebecca-palmer) wrote :

It still doesn't find the firmware after updating the initramfs (sudo update-initramfs -u -k all), whether it is placed at /lib/firmware/nouveau/nvd9_fuc*, /lib/firmware/nouveau/fuc*, /lib/firmware/<kernel_version>/nouveau/nvd9_fuc* or /lib/firmware/<kernel_version>/nouveau/fuc*. (The script uses nvd9_fuc* but http://nouveau.freedesktop.org/wiki/NVC0_Firmware/ says just fuc* for pre-NVE0 cards.)

My 3.13-rc7 test kernel was compiled with the procedure in https://wiki.ubuntu.com/KernelTeam/GitKernelBuild; the .config lines containing "nouveau" are
CONFIG_DRM_NOUVEAU=m
CONFIG_NOUVEAU_DEBUG=5
CONFIG_NOUVEAU_DEBUG_DEFAULT=3
CONFIG_DRM_NOUVEAU_BACKLIGHT=y
and kernel/drivers/gpu/drm/nouveau/nouveau.ko is on the modules.order list, not the modules.builtin list.

Revision history for this message
In , Ilia Mirkin (imirkin) wrote :

(In reply to comment #18)
> It still doesn't find the firmware after updating the initramfs (sudo
> update-initramfs -u -k all), whether it is placed at
> /lib/firmware/nouveau/nvd9_fuc*, /lib/firmware/nouveau/fuc*,
> /lib/firmware/<kernel_version>/nouveau/nvd9_fuc* or
> /lib/firmware/<kernel_version>/nouveau/fuc*. (The script uses nvd9_fuc* but
> http://nouveau.freedesktop.org/wiki/NVC0_Firmware/ says just fuc* for
> pre-NVE0 cards.)

Erm, that's a lie. I've been meaning to fix it. I'm like 99.999% sure it always has to be nvXX_fucYYY[cd]. I'll double-check it again before updating the wiki. Putting it in /lib/firmware/nouveau is sufficient -- adding in the kernel version also works, but you'd normally just do that for firmware that was kernel version dependent, which this isn't.

>
> My 3.13-rc7 test kernel was compiled with the procedure in
> https://wiki.ubuntu.com/KernelTeam/GitKernelBuild; the .config lines
> containing "nouveau" are
> CONFIG_DRM_NOUVEAU=m
> CONFIG_NOUVEAU_DEBUG=5
> CONFIG_NOUVEAU_DEBUG_DEFAULT=3
> CONFIG_DRM_NOUVEAU_BACKLIGHT=y
> and kernel/drivers/gpu/drm/nouveau/nouveau.ko is on the modules.order list,
> not the modules.builtin list.

Don't know what to say. It just uses request_firmware() so it's whatever the kernel's normal mechanism for loading firmware.

Revision history for this message
In , Rebecca Palmer (rebecca-palmer) wrote :

Created attachment 96610
kernel log 3.14rc-Mar-26

Revision history for this message
In , Rebecca Palmer (rebecca-palmer) wrote :

Created attachment 96611
Xorg log 3.14rc-Mar-26

Revision history for this message
In , Rebecca Palmer (rebecca-palmer) wrote :

This bug still exists in 3.14rc commit f217c44ebd41ce7369d2df07622b2839479183b0 (26 Mar Linus' tree, Ubuntu userspace; as the nouveau/master branch hasn't been used for 5 months, should we stop suggesting that people test with it?).

Is there anything else I can do to help? I will probably only have this machine for a few more months.

Revision history for this message
In , Rebecca Palmer (rebecca-palmer) wrote :

Created attachment 101990
kernel log 3.15rc-Jun-27

This bug still exists in drm-nouveau-next (commit 242a42eadfc17448a0d5b2ffc0cb191c8b51971a) with Ubuntu 14.04 userspace. The error message has changed to "E[ PFIFO][0000:01:00.0] INTR 0x00800000", and some of the "INTR 0x01000000: 0x00000005" warnings now come _after_ it.

In Ubuntu 14.04 (not 13.10) with either this git kernel or its standard 3.13, there is also a hang on resume from suspend (https://bugs.launchpad.net/ubuntu/+source/xserver-xorg-video-nouveau/+bug/1333417), but it is not clear whether this is a driver or BIOS problem.

In the attached, the original bug is at 08:09:47 and 08:20:06, the resume failure (which can log "GPU lockup", "failed to idle chanel 0xcccc0001 [Xorg[1185]]", or nothing) is at 08:14:53.

Revision history for this message
In , Adrian Bacircea (adrian-bacircea) wrote :

Hi, sorry for barging in but I'm also hitting this on the same graphics card (NVS 4200M) on a Dell Latitude E6520 running Arch Linux.
I'm trying to use PRIME for a VA-API + VDPAU setup and also for some games but as soon as I start I get this message in dmesg :
kernel: nouveau E[ PFIFO][0000:01:00.0] INTR 0x00800000
kernel: nouveau W[ PFIFO][0000:01:00.0] INTR 0x01000000: 0x00000005
I'm running the 3.15.6-1-ARCH kernel with the following packages:
intel-dri-git 10.3.0_devel.64000-1
lib32-intel-dri-git 10.3.0_devel.64007-1
lib32-mesa-git 10.3.0_devel.64007-1
lib32-mesa-libgl-git 10.3.0_devel.64007-1
lib32-nouveau-dri-git 10.3.0_devel.64007-1
lib32-opencl-mesa-git 10.3.0_devel.64007-1
lib32-svga-dri-git 10.3.0_devel.64007-1
libdrm-git 2.4.54.19.gc0b34dc-1
mesa-git 10.3.0_devel.64000-1
mesa-libgl-git 10.3.0_devel.64000-1
nouveau-dri-git 10.3.0_devel.64000-1
opencl-mesa-git 10.3.0_devel.64000-1
xf86-video-nouveau-git 1.0.10.34.gedd1608-1
If I stop the application the uses the NVIDIA card as soon as I see the message, everything is OK. But if I continue, X freezes and the only way I can regain control is by a cold reboot.

Revision history for this message
In , Adrian Bacircea (adrian-bacircea) wrote :

Created attachment 103302
system log

Revision history for this message
In , Adrian Bacircea (adrian-bacircea) wrote :

Created attachment 103307
system log nouveau kernel linux-3.16 branch

I've compiled the kernel on the linux-3.16 branch from http://cgit.freedesktop.org/nouveau/linux-2.6/ and X still freezes but now it recovers temporarily.
After it I can close the program that caused the freeze and then I can't use the nouveau card anymore (glxgears would cause another temporary freeze followed by a black window).
After a while, though, everything started to become unresponsive and I had to cold reboot to do anything.

Revision history for this message
In , Răpițeanu Viorel-Cătălin (rapiteanu-catalin) wrote :

I'm having the same problem on a Dell Latitude e6420 having an NVIDIA Corporation GF119M [NVS 4200M] GPU with the following software installed:
xf86-video-nouveau 1.0.11+31+g1ff13a9-1
mesa 11.0.5-1
xorg-server 1.18.0-3

The error I'm getting on dmesg is the following:
[ 215.375729] nouveau E[ PFIFO][0000:01:00.0] INTR 0x00800000

I've attached the complete Xorg log under the name "Xorg-21-11-15.log".

Revision history for this message
In , Răpițeanu Viorel-Cătălin (rapiteanu-catalin) wrote :

Created attachment 120001
Xorg-21-11-15 - Nouveau Xorg failure

Revision history for this message
In , Răpițeanu Viorel-Cătălin (rapiteanu-catalin) wrote :

Downgrading libdrm from 2.4.65 to 2.4.64, the kernel to 4.2.3 and xorg-server to 1.17.2 hasn't changed the behaviour. If there is anything else I should try, leave a message.

Revision history for this message
In , Bhyoram (bhyoram) wrote :

I experience the same issue.
I have noticed that it only occurs if plasma is built with gles2 support (on gentoo, requiring the while QT+KDE stack to be built with gles2).

A very similar freeze occurs with enlightenment/wayland, so I assume it's a bug in nouveau.

Changed in nouveau:
importance: Medium → High
Revision history for this message
In , Răpițeanu Viorel-Cătălin (rapiteanu-catalin) wrote :

Created attachment 120290
dmesg log after the freeze occured.

Revision history for this message
In , Răpițeanu Viorel-Cătălin (rapiteanu-catalin) wrote :

Created attachment 120994
Added a more a dmesg debug log for the problem.

The freeze occurs after this error:
[ 409.260043] nouveau E[ PFIFO][0000:01:00.0] SCHED_ERROR [ CTXSW_TIMEOUT ]

As it can be seen imediately after the freeze, this error is spammed on dmesg every 5 seconds.

Revision history for this message
In , Răpițeanu Viorel-Cătălin (rapiteanu-catalin) wrote :

I've tested this problem again with the latest nouveau driver and the latest devel kernel kernel and is still reproducible.

Revision history for this message
In , Răpițeanu Viorel-Cătălin (rapiteanu-catalin) wrote :

Created attachment 121445
Full dmesg log with kernel 4.4.1

To post a comment you must log in.