system halted while idled for a long time.
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
| Linux |
Fix Released
|
Medium
|
||
| linux (Ubuntu) |
Medium
|
Jesse Sung | ||
| Precise |
Undecided
|
Jesse Sung | ||
| Quantal |
Medium
|
Jesse Sung |
Bug Description
system: Lenovo ThinkCentre S510, 201108-8941
display can't function normally. See attached image(Photo 12-5-18 17 38 38.jpg).
It's hard to reproduce it, still trying to find the root cause.
ProblemType: Bug
DistroRelease: Ubuntu 12.04
Package: linux-image-
ProcVersionSign
Uname: Linux 3.2.0-24-generic x86_64
AlsaVersion: Advanced Linux Sound Architecture Driver Version 1.0.24.
ApportVersion: 2.0.1-0ubuntu5
Architecture: amd64
AudioDevicesInUse:
USER PID ACCESS COMMAND
/dev/snd/
/dev/snd/
CRDA: Error: command ['iw', 'reg', 'get'] failed with exit code 1: nl80211 not found.
Card0.Amixer.info:
Card hw:0 'PCH'/'HDA Intel PCH at 0xfe500000 irq 46'
Mixer name : 'Intel CougarPoint HDMI'
Components : 'HDA:10ec0269,
Controls : 21
Simple ctrls : 9
Card1.Amixer.info:
Card hw:1 'Camera'/'Vimicro Corp. Integrated Camera at usb-0000:
Mixer name : 'USB Mixer'
Components : 'USB0ac8:c448'
Controls : 2
Simple ctrls : 1
Card1.Amixer.
Simple mixer control 'Mic',0
Capabilities: cvolume cvolume-joined cswitch cswitch-joined penum
Capture channels: Mono
Limits: Capture 0 - 48
Mono: Capture 38 [79%] [11.00dB] [on]
CurrentDmesg:
[ 14.048567] ADDRCONF(
[ 24.820147] eth0: no IPv6 routers present
[ 28.780122] audit_printk_skb: 36 callbacks suppressed
[ 28.780124] type=1400 audit(133758182
Date: Mon May 21 02:38:11 2012
HibernationDevice: RESUME=
InstallationMedia: Ubuntu 12.04 LTS "Precise Pangolin" - Release amd64 (20120425)
IwConfig:
lo no wireless extensions.
eth0 no wireless extensions.
MachineType: LENOVO 098765432112345
ProcEnviron:
TERM=xterm
PATH=(custom, no user)
LANG=en_US.UTF-8
SHELL=/bin/bash
ProcFB: 0 inteldrmfb
ProcKernelCmdLine: BOOT_IMAGE=
RelatedPackageV
linux-
linux-
linux-firmware 1.79
RfKill:
SourcePackage: linux
StagingDrivers: mei rts5139
UpgradeStatus: No upgrade log present (probably fresh install)
dmi.bios.date: 06/22/2011
dmi.bios.vendor: LENOVO
dmi.bios.version: 9PKT20AUS
dmi.board.
dmi.board.name: To be filled by O.E.M.
dmi.board.vendor: LENOVO
dmi.board.version: To be filled by O.E.M.
dmi.chassis.type: 3
dmi.chassis.vendor: To Be Filled By O.E.M.
dmi.chassis.
dmi.modalias: dmi:bvnLENOVO:
dmi.product.name: 098765432112345
dmi.product.
dmi.sys.vendor: LENOVO
Changed in linux (Ubuntu): | |
status: | New → Confirmed |
description: | updated |
summary: |
- system halted whilen resuming from suspend + system halted while resuming from suspend |
Changed in linux (Ubuntu): | |
importance: | Undecided → Medium |
Changed in linux (Ubuntu): | |
assignee: | Anthony Wong (anthonywong) → Jesse Sung (wenchien) |
summary: |
- system halted while resuming from suspend + system halted while idled for a long time. |
description: | updated |
Changed in linux (Ubuntu): | |
status: | Confirmed → In Progress |
Jesse Sung (wenchien) wrote : | #2 |
Jesse Sung (wenchien) wrote : | #3 |
After enabling RC6, S510 halts after a 30-minute idle.
|
#7 |
Ref: https:/
Hi,
Lenovo ThinkCentre S510 (SandyBridge i5-2500S) may hang after idling for some period of time. This may be related to rc6 since that if I add i915.i915_
This issue also happens on kernel 3.4.
A screenshot when system hangs:
https:/
If there's any other information needed, please kindly let me know.
Thank you.
Jesse Sung (wenchien) wrote : | #4 |
Tested with mainline build kernel 3.4 in kernel-ppa, this issue still exists.
Please attach dmesg with drm.debug=0xe added to your kernel commandline. Also, how dead is the system? I.e. does network/ssh still work, does the magic SysRq to reboot still work, or is it a true hard-hang? And can you try to wire up netconsole so that we could have a peak at the last breaths of the system before it goes down?
|
#9 |
Created attachment 63062
netconsole output
Hi Daniel,
Please find the attached file for netconsole output.
When it hangs, neither SysRq magic nor network/ssh works. From ssh terminal I can tell that it died after 57 minutes, but the last entry in the log is at 1787.315652, so there's no log when the system goes down.
Can you also attach dmesg so that we have all the interesting lines from boot-up with drm.debug=0xe, too?
|
#11 |
Created attachment 63064
dmesg
dmesg is attached.
Thank you.
|
#12 |
Hi Daniel,
Is there anything I can do to get more info about this issue?
I'm running a bit low on ideas, but one thing would be to stop all drm clients (i.e. X) and check whether it still hangs. We still need to load the drm/i915.ko driver, because only when we load and enable rc6 can the cpu die actually reach the lowest power state, i.e. I want to check whether this might be an issue outside of the gpu, only brought to light due to the low power state.
|
#14 |
Daniel,
Tried with a normal boot, and stopped all X related processes. System hangs after 15 hours.
|
#15 |
Ok, what happens if the i915 is never loaded at all? Try something like adding
blacklist i915 to modprobe.conf, or append i915.noload to your kernel commandline.
|
#16 |
Hi Chris,
By adding i915 into blacklist and using text mode, system runs without any problem and has "2 days, 19:15" uptime so far.
tags: | added: quantal |
Jesse Sung (wenchien) wrote : | #6 |
Please try http://
Thank you.
|
#17 |
Created attachment 65942
disable rc6 for some models
Hi Daniel and Chris,
Since there's another snb machine does not work well when rc6 is enabled ( https:/
|
#18 |
By any chance, does this patch help?
https:/
Changed in linux: | |
importance: | Unknown → Medium |
status: | Unknown → In Progress |
tags: |
added: blocks-hwcert removed: blocks-hwcerts |
|
#19 |
Hi Ben,
No, this patch does not help. System hangs after 2 days and 6 hours.
|
#20 |
Hi,
What do you think of the patch in #c10 ? Should I send it to mailing list also?
I think it'd be much better to figure out the root cause and fix it - since likely these rc6 issues don't have anything to do with these models specifically, we just haven't figured out yet what the real problem is.
|
#22 |
Hi Daniel,
Then I guess it's better to have a new bug entry for lp1008867. :)
https:/
I'll create one later.
Also, please could you suggest what I can do to get useful info for finding out the root cause?
Thank you.
Changed in linux: | |
status: | In Progress → Confirmed |
Changed in linux (Ubuntu): | |
status: | In Progress → Confirmed |
Changed in linux (Ubuntu Quantal): | |
status: | Confirmed → Fix Committed |
Changed in linux (Ubuntu Precise): | |
status: | New → Fix Committed |
Luis Henriques (henrix) wrote : | #23 |
This bug is awaiting verification that the kernel for Precise in -proposed solves the problem (3.2.0-32.51). Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification-
If verification is not done by one week from today, this fix will be dropped from the source code, and this bug will be closed.
See https:/
tags: | added: verification-needed-precise |
Launchpad Janitor (janitor) wrote : | #24 |
This bug was fixed in the package linux - 3.5.0-16.24
---------------
linux (3.5.0-16.24) quantal-proposed; urgency=low
[ Andy Whitcroft ]
* SAUCE: ata_piix: add a disable_driver option
- LP: #994870
[ Christian König ]
* (pre-stable) drm/radeon: make 64bit fences more robust v3 (3.5 stable)
- LP: #1029582
[ David Henningsson ]
* SAUCE: ALSA: hda - use both input paths on Conexant auto parser
- LP: #1037642
* SAUCE: ALSA: hda - fix control names for multiple speaker out on
IDT/STAC
- LP: #1046734
[ Herton Ronaldo Krzesinski ]
* SAUCE: ALSA: hda/via - don't report presence on HPs with no presence
support
- LP: #1052499
* SAUCE: ext4: fix crash when accessing /proc/mounts concurrently
- LP: #1053019
* SAUCE: ALSA: hda/realtek - Fix detection of ALC271X codec
- LP: #1006690
[ Kyle Fazzari ]
* SAUCE: input: Cypress PS/2 Trackpad fix disabling tap-to-click
- LP: #1048816
[ Leann Ogasawara ]
* [Config] Disable CONFIG_DRM_AST
- LP: #1053290
[ Stefan Bader ]
* [Config] Disable the Cirrus QEMU drm driver
- LP: #1038055
[ Upstream Kernel Changes ]
* Revert "KVM: VMX: Fix KVM_SET_SREGS with big real mode segments"
- LP: #1045027
* x86, efi: Handover Protocol
* drm/i915: HDMI - Clear Audio Enable bit for Hot Plug
- LP: #1056729
* UBUNTU SAUCE: apparmor: fix IRQ stack overflow
- LP: #1056078
* drm/nouveau: fix booting with plymouth + dumb support
- LP: #1043518
* ALSA: hda - Add DeviceID for Haswell HDA
- LP: #1057698
* ALSA: hda - add Haswell HDMI codec id
- LP: #1057698
* ALSA: hda - Fix driver type of Haswell controller to AZX_DRIVER_SCH
- LP: #1057698
* ALSA: hda_intel: Add Device IDs for Intel Lynx Point-LP PCH
- LP: #1011438, #1057698
[ Wang Xingchao ]
* SAUCE: ALSA: hda - Add another pci id for Haswell board
- LP: #1057698
[ Wen-chien Jesse Sung ]
* SAUCE: drm/i915: Explicitly disable RC6 for certain models
- LP: #1002170, #1008867
-- Leann Ogasawara <email address hidden> Thu, 27 Sep 2012 13:55:52 -0700
Changed in linux (Ubuntu Quantal): | |
status: | Fix Committed → Fix Released |
Changed in linux (Ubuntu Precise): | |
assignee: | nobody → Jesse Sung (wenchien) |
TienFu Chen (ctf) wrote : | #25 |
Bug is fixed with kernel 3.5.0-16.25 on Quantal and 3.2.0-32.51 on Precise.
TienFu Chen (ctf) wrote : | #26 |
Continue comment #25, test time is over 12 hours.
Luis Henriques (henrix) wrote : | #27 |
As per comments #25 and #26 (and IRC chat), I'm tagging this bug as verified in Precise.
tags: |
added: verification-done-precise removed: verification-needed-precise |
Adam Conrad (adconrad) wrote : Update Released | #28 |
The verification of this Stable Release Update has completed successfully and the package has now been released to -updates. Subsequently, the Ubuntu Stable Release Updates Team is being unsubscribed and will not receive messages about this bug report. In the event that you encounter a regression using the package from -updates please report a new bug using ubuntu-bug and tag the bug report regression-update so we can easily find any regresssions.
Launchpad Janitor (janitor) wrote : | #29 |
This bug was fixed in the package linux - 3.2.0-32.51
---------------
linux (3.2.0-32.51) precise-proposed; urgency=low
[Luis Henriques]
* Release Tracking Bug
- LP: #1056036
[ Keng-Yu Lin ]
* SAUCE: Intel xhci: Only switch the switchable ports
- LP: #1034814
[ Kyle Fazzari ]
* SAUCE: input: Cypress PS/2 Trackpad fix disabling tap-to-click
- LP: #1048816
[ Seth Forshee ]
* SAUCE: Input: synaptics - Adjust threshold for treating position values
as negative
- LP: #1046512
[ Stefan Bader ]
* Revert "SAUCE: Force xsave off on older Xen hypervisors"
- LP: #1044550
[ Upstream Kernel Changes ]
* Revert "HID: wiimote: fix invalid power_supply_powers call"
- LP: #1048605
* Revert "drm/radeon: fix bo creation retry path"
- LP: #1049899
* HID: wiimote: fix invalid power_supply_powers call
- LP: #1048605
* HID: add ASUS AIO keyboard model AK1D
- LP: #1027789, #1049899
* nfs: tear down caches in nfs_init_
- LP: #1049899
* NFS: Use kcalloc() when allocating arrays
- LP: #1049899
* NFSv4.1 fix page number calculation bug for filelayout decode buffers
- LP: #1049899
* fix page number calculation bug for block layout decode buffer
- LP: #1049899
* pnfs: defer release of pages in layoutget
- LP: #1049899
* ext4: avoid kmemcheck complaint from reading uninitialized memory
- LP: #1049899
* fuse: verify all ioctl retry iov elements
- LP: #1049899
* Bluetooth: Fix legacy pairing with some devices
- LP: #1049899
* xhci: Increase reset timeout for Renesas 720201 host.
- LP: #1049899
* xhci: Add Etron XHCI_TRUST_
- LP: #1049899
* USB: ftdi_sio: Add VID/PID for Kondo Serial USB
- LP: #1049899
* USB: option: Add Vodafone/Huawei K5005 support
- LP: #1049899
* USB: add USB_VENDOR_
- LP: #1049899
* USB: support the new interfaces of Huawei Data Card devices in option
driver
- LP: #1049899
* usb: serial: mos7840: Fixup mos7840_
- LP: #1049899
* usb: gadget: u_ether: fix kworker 100% CPU issue with still used
interfaces in eth_stop
- LP: #1049899
* ARM: 7483/1: vfp: only advertise VFPv4 in hwcaps if CONFIG_VFPv3 is
enabled
- LP: #1049899
* ARM: 7488/1: mm: use 5 bits for swapfile type encoding
- LP: #1049899
* ARM: 7489/1: errata: fix workaround for erratum #720789 on UP systems
- LP: #1049899
* drm/i915: ignore eDP bpc settings from vbt
- LP: #1049899
* ALSA: hda - fix Copyright debug message
- LP: #1049899
* sched: fix divide by zero at {thread_
- LP: #1049899
* ath9k: fix decrypt_error initialization in ath_rx_tasklet()
- LP: #1049899
* drm/nvd0/disp: mask off high 16 bit of negative cursor x-coordinate
- LP: #1049899
* drm/i915: reorder edp disabling to fix ivb MacBook Air
- LP: #1049899
* audit: don't free_chunk() after fsnotify_add_mark()
- LP: #1049899
* audit: fix refcounting in audit-tree
- LP: #1049899
* vfs: canonicalize create mode in build_open_flags()
- LP: #1049899
* PCI: EHCI: Fix crash d...
Changed in linux (Ubuntu Precise): | |
status: | Fix Committed → Fix Released |
|
#30 |
*** Bug 53626 has been marked as a duplicate of this bug. ***
Created attachment 71524
implement Hiz w/a for msaa
Kernel patch, please test.
Also: Is this an SNB GT1? Please spec the exact model and pci id of the VGA device.
Please test and provide the requested info.
|
#34 |
Hello, I have a similiar issue. My system ocassionally hangs with the same screen corruption, plays audio in a loop for about a second and then the laptop fan revs up.
It happens more often when playing some flash videos. Sometime the system hangs twice a days, sometimes after a week. I'm going to try disabling rc6 after next crash.
It's a MSI CR640 Sandy Bridge laptop with i3-2310M and:
00:02.0 VGA compatible controller [0300]: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller [8086:0116] (rev 09)
Thank you
Changed in linux: | |
status: | Confirmed → Incomplete |
|
#35 |
It is an Intel(R) Core(TM) i5-2500S CPU @ 2.70GHz, and the VGA device is
00:02.0 VGA compatible controller [0300]: Intel Corporation 2nd Generation Core Processor Family Integrated Graphics Controller [8086:0102] (rev 09) (prog-if 00 [VGA controller])
Subsystem: Lenovo Device [17aa:307b]
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+
Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
Latency: 0
Interrupt: pin A routed to IRQ 45
Region 0: Memory at fe000000 (64-bit, non-prefetchable) [size=4M]
Region 2: Memory at d0000000 (64-bit, prefetchable) [size=256M]
Region 4: I/O ports at f000 [size=64]
Expansion ROM at <unassigned> [disabled]
Capabilities: <access denied>
Kernel driver in use: i915
Kernel modules: i915
Output of lspci and the content of cpuinfo can be found at
https:/
https:/
I'll test the patch next week and report the result.
Thank you.
|
#36 |
The patch in comment 17 should be the right one. With it applied, system stays alive after two days.
Thanks!
Awesome that this works out. Patch is merged into 3.8-rc2 as
commit 4283908ef7f11a7
Author: Daniel Vetter <email address hidden>
Date: Fri Dec 14 23:38:28 2012 +0100
drm/i915: Implement WaDisableHiZPla
I'm writing the mail to the stable kernel team right now so that it gets applied to older kernels. Thanks for reporting this issue.
Changed in linux: | |
status: | Incomplete → Fix Released |
S510 with 12.04 may halt in less than 24 hours. But if we disable RC6, it works quite well. Now ctf enables RC6 again, let's see how it goes this time.