Random kernel Oops

Bug #998895 reported by Andy Low
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Expired
Medium
Unassigned

Bug Description

Kernel Oops occurs on new PC about once per day. It was worse with previous BIOS and with proprietary nvidia graphics driver, but still occurs with latest BIOS and nouveau. Last time it happened while doing stress and gtkperf at the same time, but this time I was using thunderbird normally - I had just clicked on an email and the text console came up with kernel log. Mouse still moved, but keyboard appeared dead - but it had processed alt-sysrq-1, alt-sysrq-t. kern.log is now about 250Mbytes.

It runs 3 full passes of memtest86 with no errors. After BIOS update on Thursday, I loaded BIOS defaults and have not changed any settings since then. Max cpu temperatures even when running stress are 42deg.

Fresh install of Ubuntu 12.04

ProblemType: Bug
DistroRelease: Ubuntu 12.04
Package: linux-image-3.2.0-24-generic-pae 3.2.0-24.37
ProcVersionSignature: Ubuntu 3.2.0-24.37-generic-pae 3.2.14
Uname: Linux 3.2.0-24-generic-pae i686
AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.24.
AplayDevices: aplay: device_list:252: no soundcards found...
ApportVersion: 2.0.1-0ubuntu7
Architecture: i386
ArecordDevices: arecord: device_list:252: no soundcards found...
AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/by-path', '/dev/snd/controlC1', '/dev/snd/hwC1D0', '/dev/snd/hwC1D1', '/dev/snd/hwC1D2', '/dev/snd/hwC1D3', '/dev/snd/pcmC1D3p', '/dev/snd/pcmC1D7p', '/dev/snd/pcmC1D8p', '/dev/snd/pcmC1D9p', '/dev/snd/controlC0', '/dev/snd/hwC0D0', '/dev/snd/pcmC0D0c', '/dev/snd/pcmC0D0p', '/dev/snd/pcmC0D1p', '/dev/snd/pcmC0D2c', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1:
CRDA: Error: command ['iw', 'reg', 'get'] failed with exit code 1: nl80211 not found.
Date: Sun May 13 22:15:04 2012
HibernationDevice: RESUME=UUID=e27a62ef-26df-4400-817d-a6ae9a293a88
InstallationMedia: Ubuntu 12.04 LTS "Precise Pangolin" - Release i386 (20120423)
IwConfig:
 lo no wireless extensions.

 eth0 no wireless extensions.
MachineType: System manufacturer System Product Name
ProcEnviron:
 LANGUAGE=en_GB:en
 TERM=xterm
 PATH=(custom, user)
 LANG=en_GB.UTF-8
 SHELL=/bin/bash
ProcFB: 0 nouveaufb
ProcKernelCmdLine: BOOT_IMAGE=/boot/vmlinuz-3.2.0-24-generic-pae root=UUID=0db90db2-ebe4-46ea-8c53-f752230491a4 ro quiet splash vt.handoff=7
PulseList: Error: command ['pacmd', 'list'] failed with exit code 1: No PulseAudio daemon running, or not running as session daemon.
RelatedPackageVersions:
 linux-restricted-modules-3.2.0-24-generic-pae N/A
 linux-backports-modules-3.2.0-24-generic-pae N/A
 linux-firmware 1.79
RfKill:

SourcePackage: linux
StagingDrivers: mei
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 05/03/2012
dmi.bios.vendor: American Megatrends Inc.
dmi.bios.version: 1016
dmi.board.asset.tag: To be filled by O.E.M.
dmi.board.name: P8Z77-M PRO
dmi.board.vendor: ASUSTeK COMPUTER INC.
dmi.board.version: Rev 1.xx
dmi.chassis.asset.tag: Asset-1234567890
dmi.chassis.type: 3
dmi.chassis.vendor: Chassis Manufacture
dmi.chassis.version: Chassis Version
dmi.modalias: dmi:bvnAmericanMegatrendsInc.:bvr1016:bd05/03/2012:svnSystemmanufacturer:pnSystemProductName:pvrSystemVersion:rvnASUSTeKCOMPUTERINC.:rnP8Z77-MPRO:rvrRev1.xx:cvnChassisManufacture:ct3:cvrChassisVersion:
dmi.product.name: System Product Name
dmi.product.version: System Version
dmi.sys.vendor: System manufacturer

Revision history for this message
Andy Low (bj7u6139zd-andy-jjcftv6wld) wrote :
Revision history for this message
Andy Low (bj7u6139zd-andy-jjcftv6wld) wrote :

Added attachment of result of

cat kern.log |grep -A 40 -B 3 BUG

Brad Figg (brad-figg)
Changed in linux (Ubuntu):
status: New → Confirmed
Revision history for this message
Andy Low (bj7u6139zd-andy-jjcftv6wld) wrote :

Added result of:

cat kern.log |grep -v "0000:01:00.0: PMFB0_SUBP" including full result of alt-sysrq-1, alt-sysrq-t

The stack trace at 10:01 was just a test that alt-sysrq-1, alt-sysrq-t works. The trace following the BUG at 22:04:04 is the one to look at.

The rest of the 250Mbytes of kern.log was full of pairs of lines like this:

May 13 08:52:04 (none) kernel: [ 280.103501] [drm] nouveau 0000:01:00.0: PMFB0_SUBP0: 0x037f0000
May 13 08:52:04 (none) kernel: [ 280.103503] [drm] nouveau 0000:01:00.0: PMFB0_SUBP1: 0x037f0010

These lines come when the mouse crosses edges of (some) windows - eg maximuses firefox to unity launcher and thousands come when gtkperf or googleearth are active.

Revision history for this message
Joseph Salisbury (jsalisbury) wrote :

Would it be possible for you to test the latest upstream kernel? Refer to https://wiki.ubuntu.com/KernelMainlineBuilds . Please test the latest v3.4kernel[1] (Not a kernel in the daily directory). Once you've tested the upstream kernel, please remove the 'needs-upstream-testing' tag(Only that one tag, please leave the other tags). This can be done by clicking on the yellow pencil icon next to the tag located at the bottom of the bug description and deleting the 'needs-upstream-testing' text.

If this bug is fixed in the mainline kernel, please add the following tag 'kernel-fixed-upstream'.

If the mainline kernel does not fix this bug, please add the tag: 'kernel-bug-exists-upstream'.

If you are unable to test the mainline kernel, for example it will not boot, please add the tag: 'kernel-unable-to-test-upstream'.
Once testing of the upstream kernel is complete, please mark this bug as "Confirmed".

Thanks in advance.

http://kernel.ubuntu.com/~kernel-ppa/mainline/v3.4-rc7-precise/

Changed in linux (Ubuntu):
importance: Undecided → Medium
status: Confirmed → Incomplete
Revision history for this message
Andy Low (bj7u6139zd-andy-jjcftv6wld) wrote :

So I have been running it on 3.4.0-030400rc6-generic-pae today with no failures yet. I counldn't get rc7 as the files were not in http://kernel.ubuntu.com/~kernel-ppa/mainline/v3.4-rc7-precise/

Yesterday I ran it on the 64 bit kernel from Ubuntu 11.10 - also with no failures.

One day is not long enough to say that it is fixed. I will keep you informed.

Thanks for your support!!

tags: added: kernel-bug-exists-upstream
Revision history for this message
Andy Low (bj7u6139zd-andy-jjcftv6wld) wrote :

It crashed again after 3 days using 3.4.0-030400rc6-generic-pae. Similar in that it was when clicking on a email in thunderbird. I will post the kern.log when I get to the machine - it is my neighbour's (I could see it over ssh last night, but could not get it to scp to here).

I will put back the disk with 64 bit Ubuntu 11.10.

Revision history for this message
Andy Low (bj7u6139zd-andy-jjcftv6wld) wrote :
Revision history for this message
Andy Low (bj7u6139zd-andy-jjcftv6wld) wrote :

Uploaded kern.log_20120519 including boot at May 19 14:18:44 then crash at May 19 19:20:23

Changed in linux (Ubuntu):
status: Incomplete → Confirmed
Revision history for this message
Joseph Salisbury (jsalisbury) wrote :

This issue appears to be an upstream bug, since you tested the latest upstream kernel. Would it be possible for you to open an upstream bug report at bugzilla.kernel.org [1]? That will allow the upstream Developers to examine the issue, and may provide a quicker resolution to the bug.

If you are comfortable with opening a bug upstream, It would be great if you can report back the upstream bug number in this bug report. That will allow us to link this bug to the upstream report.

[1] https://wiki.ubuntu.com/Bugs/Upstream/kernel

Changed in linux (Ubuntu):
status: Confirmed → Triaged
Revision history for this message
penalvch (penalvch) wrote :

Andy Low, as per http://www.asus.com/Motherboards/P8Z77M_PRO/#support an update is available for your BIOS (2105). If you update to this following https://help.ubuntu.com/community/BiosUpdate , does it change anything?

If not, could you please both specify what happened, and provide the output of the following terminal command:
sudo dmidecode -s bios-version && sudo dmidecode -s bios-release-date

Please note your current BIOS is already in the Bug Description, so posting this on the old BIOS would not be helpful.

For more on BIOS updates and linux, please see https://help.ubuntu.com/community/ReportingBugs#Bug_reporting_etiquette .

Thank you for your understanding.

tags: added: bios-outdated-2105
Changed in linux (Ubuntu):
status: Triaged → Incomplete
Revision history for this message
Launchpad Janitor (janitor) wrote :

[Expired for linux (Ubuntu) because there has been no activity for 60 days.]

Changed in linux (Ubuntu):
status: Incomplete → Expired
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.