ips-adjust - BUG: unable to handle kernel NULL pointer dereference at (null)

Bug #632430 reported by Filippo Carletti
24
This bug affects 4 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Fix Released
Undecided
Unassigned

Bug Description

Oops during boot-up.
Immediately before oops, the following line is logged:
intel ips 0000:00:1f.6: CPU power or thermal limit exceeded

ProblemType: KernelOops
DistroRelease: Ubuntu 10.10
Package: linux-image-2.6.35-19-generic 2.6.35-19.28
Regression: Yes
Reproducible: Yes
ProcVersionSignature: Ubuntu 2.6.35-20.29-generic 2.6.35.4
Uname: Linux 2.6.35-20-generic x86_64
AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.23.
Annotation: Your system might become unstable now and might need to be restarted.
Architecture: amd64
ArecordDevices:
 **** List of CAPTURE Hardware Devices ****
 card 0: Intel [HDA Intel], device 0: ALC275 Analog [ALC275 Analog]
   Subdevices: 1/1
   Subdevice #0: subdevice #0
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC0: filippo 1765 F.... pulseaudio
 /dev/snd/pcmC0D0p: filippo 1765 F...m pulseaudio
CRDA: Error: [Errno 2] No such file or directory
Card0.Amixer.info:
 Card hw:0 'Intel'/'HDA Intel at 0xdb400000 irq 49'
   Mixer name : 'Intel IbexPeak HDMI'
   Components : 'HDA:10ec0275,104d4100,00100005 HDA:80862804,104d4100,00100000'
   Controls : 19
   Simple ctrls : 9
Date: Mon Sep 6 22:49:24 2010
Failure: oops
HibernationDevice: RESUME=UUID=cfb5b782-c971-4cdb-96bb-abe17076731f
InstallationMedia: Ubuntu 10.04 LTS "Lucid Lynx" - Release amd64 (20100429)
MachineType: Sony Corporation VPCS11G7E
ProcCmdLine: BOOT_IMAGE=/boot/vmlinuz-2.6.35-20-generic root=UUID=48bd86da-d44c-4f65-826c-eb99d593bc47 ro quiet splash
RelatedPackageVersions: linux-firmware 1.38
SourcePackage: linux
Title: BUG: unable to handle kernel NULL pointer dereference at (null)
dmi.bios.date: 01/15/2010
dmi.bios.vendor: INSYDE
dmi.bios.version: R0281Q3
dmi.board.asset.tag: N/A
dmi.board.name: VAIO
dmi.board.vendor: Sony Corporation
dmi.board.version: N/A
dmi.chassis.asset.tag: N/A
dmi.chassis.type: 10
dmi.chassis.vendor: Sony Corporation
dmi.chassis.version: N/A
dmi.modalias: dmi:bvnINSYDE:bvrR0281Q3:bd01/15/2010:svnSonyCorporation:pnVPCS11G7E:pvrC1041HEG:rvnSonyCorporation:rnVAIO:rvrN/A:cvnSonyCorporation:ct10:cvrN/A:
dmi.product.name: VPCS11G7E
dmi.product.version: C1041HEG
dmi.sys.vendor: Sony Corporation

Revision history for this message
Filippo Carletti (filippo-carletti) wrote :
Revision history for this message
Filippo Carletti (filippo-carletti) wrote :

Same oops using latest Linus kernel: 2.6.36-rc4.

Revision history for this message
Derek (bugs-m8y) wrote :

I think I'm experiencing the same issue on my system. Sony VPC-B11KGX.

Hardware seems identical to his.

Attached dmesg output.

Also, I appear to have no sound. I'm guessing that might be related...

Revision history for this message
Martin Stjernholm (msub) wrote :

I noticed the same oops (or one with identical stack, at least) when coming out of hibernation. It did not affect the sound in my case, though, and I haven't noticed any other malfunction, at least not any that I currently would pin on this. HP Envy 14.

Maybe this is relevant: http://permalink.gmane.org/gmane.linux.kernel/1036685

Revision history for this message
Derek (bugs-m8y) wrote :

Yeah, sound was an unrelated bug:
https://launchpad.net/bugs/546769

Also I filed: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/640154

For the oops, and am blacklisting that module for now.

Revision history for this message
Dave Gilbert (ubuntu-treblig) wrote :

So this bug is on a VAIO/VPCS11G7E, bugs 640154 and bug 646215 are both on VPCB11CGX and Bug 653591 is on
an Aspire 5820TG - all look like the same thing to me (646215 is 32bit but gives the same line from the ips_adjust+0x293 it has)
comes out as intel_ipc.c line 553 for me:

     c00: ff 53 70 callq *0x70(%rbx)
     c03: 84 c0 test %al,%al
     c05: eb 01 jmp c08 <ips_adjust+0x348>
     c07: 90 nop
     c08: 0f 85 12 ff ff ff jne b20 <ips_adjust+0x260>

        if (!ips->gpu_lower())
                ips->gpu_turbo_enabled = false;

so is gpu_lower NULL somehow?

I can see in the boot on this machine is the message:

[ 11.358121] intel ips 0000:00:1f.6: failed to get i915 symbols, graphics turbo disabled

which is the message that gets printed just after gpu_turbo_enabled is set to false; so I guess something must
be setting gpu_turbo_enabled = true later? Maybe update_turbo_limits or ips_irq_handler which look like they set it possibly from bios/hardware without taking account if it had got dropped due to symbols earlier?

Dave

Revision history for this message
Andy Whitcroft (apw) wrote :

This looks like it might be a bug in the ips_intel driver. I have attempted to patch up the bug and have applied the patch to some kernels. Perhaps those of you who can reproduce this issue could try the kernels at the URL below and report back. The kernels will be at the URL below once they finish synching:

    http://people.canonical.com/~apw/lp632430-maverick/

Please report any testing back here. Thanks.

Revision history for this message
Filippo Carletti (filippo-carletti) wrote :

The patched kernel solves the Oops issue.
Also, the "CPU power or thermal limit exceeded" boot-time message disappeared.

Revision history for this message
Martin Stjernholm (msub) wrote :

The oops is gone for me as well. Thanks.

Revision history for this message
Derek (bugs-m8y) wrote :

Fixed for me as well, I suppose that confirms the bug I was asked to file is a dupe.

Revision history for this message
Derek (bugs-m8y) wrote :

BTW, latest kernel update wiped out your custom kernel, so I'm back to blacklisting ips-adjust again.

Brad Figg (brad-figg)
tags: added: acpi-brightness
Andy Whitcroft (apw)
Changed in linux (Ubuntu):
status: New → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.