System hangs randomly

Bug #1388645 reported by EricDP
12
This bug affects 2 people
Affects Status Importance Assigned to Milestone
linux-lts-trusty (Ubuntu)
Invalid
Low
Unassigned

Bug Description

I recently upgraded from 12.04 to 14.04. Afterwards, my system started hanging randomly, sometimes after just a couple hours of use, and the longest it went was about 32 hours. It's a full on freeze, not just X: Ctrl+Alt+Fx does nothing. SysRq+REISUB does nothing. I've tried tailing syslog and watching dmesg and there is nothing interesting logged near the hang.

I ran memtest86+ two full passes, no errors.

Assuming maybe there was a problem with the installation or upgrade process, I tried booting from a 14.04.1 USB (tried both 32-bit and 64-bit), and both hang after a few hours. I also tried stepping back, using a 12.10 USB that I found and it also hangs.

Assuming it was related to newer kernel, I then tried to boot from a brand new 12.04.5 USB (using the latest 3.13.0-32-generic kernel), and the system did *NOT* hang... I ran it for five full days with no problems before I rebooted to continue troubleshooting the problem -- so the newest 12.04 still seems good and the latest kernel seems good. But something in 12.10 and beyond is bad.

I installed a crash-dump kernel and it doesn't seem to work - still hangs and doesn't load the kernel.

I attempted to find help in the ubuntu forums, without much luck: http://ubuntuforums.org/showthread.php?t=2249927

Output of lspci:

00:00.0 Host bridge: Intel Corporation 82Q963/Q965 Memory Controller Hub (rev 02)
00:01.0 PCI bridge: Intel Corporation 82Q963/Q965 PCI Express Root Port (rev 02)
00:03.0 Communication controller: Intel Corporation 82Q963/Q965 HECI Controller (rev 02)
00:19.0 Ethernet controller: Intel Corporation 82566DM Gigabit Network Connection (rev 02)
00:1a.0 USB controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #4 (rev 02)
00:1a.1 USB controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #5 (rev 02)
00:1a.7 USB controller: Intel Corporation 82801H (ICH8 Family) USB2 EHCI Controller #2 (rev 02)
00:1c.0 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 1 (rev 02)
00:1c.1 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 2 (rev 02)
00:1c.2 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 3 (rev 02)
00:1c.3 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 4 (rev 02)
00:1c.4 PCI bridge: Intel Corporation 82801H (ICH8 Family) PCI Express Port 5 (rev 02)
00:1d.0 USB controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #1 (rev 02)
00:1d.1 USB controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #2 (rev 02)
00:1d.2 USB controller: Intel Corporation 82801H (ICH8 Family) USB UHCI Controller #3 (rev 02)
00:1d.7 USB controller: Intel Corporation 82801H (ICH8 Family) USB2 EHCI Controller #1 (rev 02)
00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev f2)
00:1f.0 ISA bridge: Intel Corporation 82801HO (ICH8DO) LPC Interface Controller (rev 02)
00:1f.2 SATA controller: Intel Corporation 82801HR/HO/HH (ICH8R/DO/DH) 6 port SATA Controller [AHCI mode] (rev 02)
00:1f.3 SMBus: Intel Corporation 82801H (ICH8 Family) SMBus Controller (rev 02)
01:00.0 VGA compatible controller: NVIDIA Corporation G73 [GeForce 7300 GT] (rev a1)
07:00.0 Ethernet controller: Qualcomm Atheros AR5212/AR5213 Wireless Network Adapter (rev 01)
07:01.0 Multimedia audio controller: Ensoniq 5880B [AudioPCI] (rev 02)
07:03.0 FireWire (IEEE 1394): Texas Instruments TSB43AB22A IEEE-1394a-2000 Controller (PHY/Link) [iOHCI-Lynx]

output of /proc/cpuinfo:

processor : 0
vendor_id : GenuineIntel
cpu family : 6
model : 15
model name : Intel(R) Core(TM)2 CPU 6600 @ 2.40GHz
stepping : 6
microcode : 0xcb
cpu MHz : 1596.000
cache size : 4096 KB
physical id : 0
siblings : 2
core id : 0
cpu cores : 2
apicid : 0
initial apicid : 0
fdiv_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 10
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx lm constant_tsc arch_perfmon pebs bts aperfmperf pni dtes64 monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr pdcm lahf_lm dtherm tpr_shadow
bogomips : 4794.89
clflush size : 64
cache_alignment : 64
address sizes : 36 bits physical, 48 bits virtual
power management:

processor : 1
vendor_id : GenuineIntel
cpu family : 6
model : 15
model name : Intel(R) Core(TM)2 CPU 6600 @ 2.40GHz
stepping : 6
microcode : 0xcb
cpu MHz : 1596.000
cache size : 4096 KB
physical id : 0
siblings : 2
core id : 1
cpu cores : 2
apicid : 1
initial apicid : 1
fdiv_bug : no
f00f_bug : no
coma_bug : no
fpu : yes
fpu_exception : yes
cpuid level : 10
wp : yes
flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx lm constant_tsc arch_perfmon pebs bts aperfmperf pni dtes64 monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr pdcm lahf_lm dtherm tpr_shadow
bogomips : 4794.89
clflush size : 64
cache_alignment : 64
address sizes : 36 bits physical, 48 bits virtual
power management:
---
ApportVersion: 2.14.1-0ubuntu3.5
Architecture: i386
AudioDevicesInUse:
 USER PID ACCESS COMMAND
 /dev/snd/controlC0: eric 3109 F.... pulseaudio
 /dev/snd/controlC1: eric 3109 F.... pulseaudio
CurrentDesktop: Unity
DistroRelease: Ubuntu 14.04
HibernationDevice: RESUME=UUID=b4985d2b-0b32-495e-90c6-4acaa1805b45
NonfreeKernelModules: nvidia
Package: linux (not installed)
ProcFB:

ProcKernelCmdLine: root=UUID=93d1b75d-d714-4a99-8759-d53956a1c447 ro quiet splash crashkernel=384M-2G:64M,2G-:128M
ProcVersionSignature: Ubuntu 3.13.0-39.66-generic 3.13.11.8
PulseList: Error: command ['pacmd', 'list'] failed with exit code 1: Daemon not responding.
RelatedPackageVersions:
 linux-restricted-modules-3.13.0-39-generic N/A
 linux-backports-modules-3.13.0-39-generic N/A
 linux-firmware 1.127.7
RfKill:
 0: phy0: Wireless LAN
  Soft blocked: no
  Hard blocked: no
Tags: trusty
Uname: Linux 3.13.0-39-generic i686
UpgradeStatus: Upgraded to trusty on 2014-09-09 (55 days ago)
UserGroups:

WpaSupplicantLog:

_MarkForUpload: True
dmi.bios.date: 06/18/2008
dmi.bios.vendor: Intel Corp.
dmi.bios.version: CO96510J.86A.6077.2008.0618.0057
dmi.board.asset.tag: Base Board Asset Tag
dmi.board.name: DQ965GF
dmi.board.vendor: Intel Corporation
dmi.board.version: AAD41676-305
dmi.chassis.type: 3
dmi.modalias: dmi:bvnIntelCorp.:bvrCO96510J.86A.6077.2008.0618.0057:bd06/18/2008:svn:pn:pvr:rvnIntelCorporation:rnDQ965GF:rvrAAD41676-305:cvn:ct3:cvr:

Revision history for this message
EricDP (eric-arcticbears) wrote :
Revision history for this message
Ubuntu Foundations Team Bug Bot (crichton) wrote :

Thank you for taking the time to report this bug and helping to make Ubuntu better. It seems that your bug report is not filed about a specific source package though, rather it is just filed against Ubuntu in general. It is important that bug reports be filed about source packages so that people interested in the package can find the bugs about it. You can find some hints about determining what package your bug might be about at https://wiki.ubuntu.com/Bugs/FindRightPackage. You might also ask for help in the #ubuntu-bugs irc channel on Freenode.

To change the source package that this bug is filed about visit https://bugs.launchpad.net/ubuntu/+bug/1388645/+editstatus and add the package name in the text box next to the word Package.

[This is an automated message. I apologize if it reached you inappropriately; please just reply to this message indicating so.]

tags: added: bot-comment
Revision history for this message
EricDP (eric-arcticbears) wrote :

Assuming kernel bug as it happens somewhat randomly and is not associated with anything I'm doing (or not).

affects: ubuntu → linux-lts-trusty (Ubuntu)
tags: added: apport-collected trusty
description: updated
Revision history for this message
EricDP (eric-arcticbears) wrote : AlsaInfo.txt

apport information

Revision history for this message
EricDP (eric-arcticbears) wrote : BootDmesg.txt

apport information

Revision history for this message
EricDP (eric-arcticbears) wrote : CRDA.txt

apport information

Revision history for this message
EricDP (eric-arcticbears) wrote : CurrentDmesg.txt

apport information

Revision history for this message
EricDP (eric-arcticbears) wrote : IwConfig.txt

apport information

Revision history for this message
EricDP (eric-arcticbears) wrote : Lspci.txt

apport information

Revision history for this message
EricDP (eric-arcticbears) wrote : Lsusb.txt

apport information

Revision history for this message
EricDP (eric-arcticbears) wrote : ProcCpuinfo.txt

apport information

Revision history for this message
EricDP (eric-arcticbears) wrote : ProcEnviron.txt

apport information

Revision history for this message
EricDP (eric-arcticbears) wrote : ProcInterrupts.txt

apport information

Revision history for this message
EricDP (eric-arcticbears) wrote : ProcModules.txt

apport information

Revision history for this message
EricDP (eric-arcticbears) wrote : UdevDb.txt

apport information

Revision history for this message
EricDP (eric-arcticbears) wrote : UdevLog.txt

apport information

Revision history for this message
EricDP (eric-arcticbears) wrote : WifiSyslog.txt

apport information

Revision history for this message
Launchpad Janitor (janitor) wrote :

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in linux-lts-trusty (Ubuntu):
status: New → Confirmed
Revision history for this message
EricDP (eric-arcticbears) wrote :

Anything I can do to help move this forward? Any more info required? Any investigating I could do? Thanks for any help!

Revision history for this message
EricDP (eric-arcticbears) wrote :

Understanding the Linux Mint is derived from Ubuntu, I tried Mint 17.1. It also hangs.

Understanding that Ubuntu is derived from Debian (though it roots may be far back), I tried Debian 7.8.0. It does *NOT* hang.

Revision history for this message
EricDP (eric-arcticbears) wrote :

Perhaps a clue: after the upgrade, I was also finding my wifi was a little flaky. Last week I disabled wifi and unloaded the ath5k drivers and other related drivers. The system has been up 10 days now, which is the longest uptime since this problem started.

From dmesg:

ath5k: phy0: Atheros AR2414 chip found (MAC: 0x79, PHY: 0x45)

Could this be the culprit?

Revision history for this message
penalvch (penalvch) wrote :

EricDP, thank you for reporting this and helping make Ubuntu better.

As per https://downloadcenter.intel.com/product/22642/Intel-Desktop-Board-DQ965GF an update to your computer's buggy and outdated BIOS is available (6100). If you update to this following https://help.ubuntu.com/community/BIOSUpdate does it change anything?

For more on BIOS updates and linux, please see https://help.ubuntu.com/community/ReportingBugs#Bug_reporting_etiquette .

Please note your current BIOS is already in the Bug Description, so posting this on the old BIOS would not be helpful. As well, you don't have to create a new bug report.

Once the BIOS is updated, if the problem is still reproducible:
1) Please provide the output of the following terminal command (not perform an apport-collect):
sudo dmidecode -s bios-version && sudo dmidecode -s bios-release-date
2) Please make a comment specifically advising on if there was an improvement or not.
3) Please mark this report Status Confirmed. If it's not reproducible, please mark this as Invalid.

Thank you for your understanding.

tags: added: bios-outdated-6100
Changed in linux-lts-trusty (Ubuntu):
importance: Undecided → Low
status: Confirmed → Incomplete
Revision history for this message
EricDP (eric-arcticbears) wrote :

BIOS updated... good find. Thanks for that.

# dmidecode -s bios-version
CO96510J.86A.6100.2009.1004.2331
# dmidecode -s bios-release-date
10/04/2009

I've re-enabled wifi. Let's see how long it goes. If, after two weeks, all seems well, I'll mark it resolved. Fingers crossed...

Revision history for this message
EricDP (eric-arcticbears) wrote :

Up 17 days with wifi enabled... no issues. I'm going to say that fixed it. Thanks!!

Changed in linux-lts-trusty (Ubuntu):
status: Incomplete → Invalid
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.