watchdog detected hard lockup on cpu 1

Bug #1004313 reported by Orange-juice
52
This bug affects 8 people
Affects Status Importance Assigned to Milestone
openSUSE
Won't Fix
Critical
linux (Ubuntu)
Incomplete
Low
Unassigned

Bug Description

On the latest kernel (3.1 and up?) waking up from sleep may fail with the message: "watchdog detected hard lockup on cpu 1"

Ubuntu 12.04 x64

Tags: bot-comment
Revision history for this message
Ubuntu Foundations Team Bug Bot (crichton) wrote :

Thank you for taking the time to report this bug and helping to make Ubuntu better. It seems that your bug report is not filed about a specific source package though, rather it is just filed against Ubuntu in general. It is important that bug reports be filed about source packages so that people interested in the package can find the bugs about it. You can find some hints about determining what package your bug might be about at https://wiki.ubuntu.com/Bugs/FindRightPackage. You might also ask for help in the #ubuntu-bugs irc channel on Freenode.

To change the source package that this bug is filed about visit https://bugs.launchpad.net/ubuntu/+bug/1004313/+editstatus and add the package name in the text box next to the word Package.

[This is an automated message. I apologize if it reached you inappropriately; please just reply to this message indicating so.]

tags: added: bot-comment
Revision history for this message
Launchpad Janitor (janitor) wrote :

Status changed to 'Confirmed' because the bug affects multiple users.

Changed in ubuntu:
status: New → Confirmed
Revision history for this message
Nils Geylen (nilsgeylen) wrote :

This is a kernel 3.1 (and up?) bug, not Ubuntu-specific.

Changed in ubuntu:
status: Confirmed → Opinion
status: Opinion → Confirmed
Revision history for this message
Attila Lendvai (attila-lendvai) wrote :

i think it's a me, too.

it happens about every 5th time when my laptop tries to wake up from sleep. it got introduced somewhat recently, because i keep my debian testing more or less up to date, and it started to happen about a month ago.

on debian testing, on a Dell Latitude E6320:
Linux xxx 3.2.0-3-amd64 #1 SMP Thu Jun 28 09:07:26 UTC 2012 x86_64 GNU/Linux

below are some pieces of backtraces that i copied by hand. unfortunately the screen gets updated every few seconds with different backtraces (of different CPU's?), so i could only copy portions of the backtraces. a newline below means its from a different backtrace.

"watchdog detected hard lockup"
warn_slowpath_fmt

power_supply_am_i_supplied

native_flush_tlb_others
...
eventfd_ctx_read
try_to_wake_up

wq_worker_sleeping

acpi_ns_evaluate

arch_read_lock
led_trigger_event

menu_select
cpuidle_idle_call

CPU0: Intel(R) Core(TM) i7-2620M CPU @ 2.70GHz stepping 07
Performance Events: PEBS fmt1+, SandyBridge events, Intel PMU driver.
PEBS disabled due to CPU errata.
... version: 3
... bit width: 48
... generic registers: 4
... value mask: 0000ffffffffffff
... max period: 000000007fffffff
... fixed-purpose events: 3
... event mask: 000000070000000f
NMI watchdog enabled, takes one hw-pmu counter.
Booting Node 0, Processors #1
NMI watchdog enabled, takes one hw-pmu counter.
 #2
NMI watchdog enabled, takes one hw-pmu counter.
 #3
NMI watchdog enabled, takes one hw-pmu counter.
Brought up 4 CPUs
Total of 4 processors activated (21550.18 BogoMIPS).

description: updated
Revision history for this message
In , A-smi (a-smi) wrote :

User-Agent: Mozilla/5.0 (X11; Linux i686; rv:15.0) Gecko/20100101 Firefox/15.0.1

This problem occurred after I opened gmail chat talk plugin configuraton to verify web camera setting. Crash occurs randomly when I click on "verify settings"

I didn't try any other applications.

Neither of the log files contain useful information at the time of the event.

Please advise what should I install/configure to actually capture some information about the event.

Reproducible: Sometimes

Steps to Reproduce:
1. In firefox open gmail settins
2. Select Chat tab
3. Select "verify your settings"
Actual Results:
System freezes. One time it produced a console output (attached as an image)

Expected Results:
Should see a vidoe in a small frame etc, without a crash.

System #1 configuration:
Motherboard - Gigabyte GA-Z68AP-D3
CPU - i7-2600
RAM - 16Gb (CORSAIR DOMINATOR, 4x4 CMP8GX3M2A1600C9), using XMP profile
HDD - OCZ Vertex 4
Video - ATI FirePro 2460 (using a stock driver)
Webcamera - 04f2:b029 Chicony Electronics Co., Ltd 1.3M UVC

Revision history for this message
In , A-smi (a-smi) wrote :

Created an attachment (id=508553)
Screenshot of the event.

Revision history for this message
In , A-smi (a-smi) wrote :

Currently using kernel 3.4.13-1 and have a kdump ready for review. It is large.
Let me know if it is ok to attach it here.

Changed in opensuse:
importance: Unknown → Critical
status: Unknown → Confirmed
Revision history for this message
Gregaloch (scott-scottmcgregor) wrote :

This started with LibraOffice. If it helps, my system is HP Pavilion Elite HPE-101f, AY597AA-ABA, AMD Phenom II X4 925, H-RS880-uATX (Aloe) motherboard, ATI Radeon HD4350, 1Tb internal HDD, 2Tb internal HDD, 8Gb RAM, Running exclusively Ubuntu 12.04 -64 (boots on first 1Tb HDD), Two Monitors
LibraWriter (Libraoffice) chrashes each time I try to load it. I update & upgrade every few days via terminal. For several days I’ve tried to load LibraWriter. The last 3 docs went into Recovery, ten “Finished” and a flash- then nothing. (ergo reproducible consistently). I updated LibraOffice Suite completely. No cigar. I restarted the system and the errors below caused it to hang on “shutdown”.
WARNING: at /build/buildd/linux-3.2.0/kernel/watchdog.c:241 watchdog_overflow_callback+0x9a/0xc0
Pid: 23200, comm: killall5 Tainted: P D W 0 3.2.0-37-generic #58-Ubuntu
Watchdog detected hard LOCKUP on cpu 1 (and 2, 3, and 4)
Call Traces included (a couple of many): warn_slowpath_common+0x7f/0xc0; …overflow_callback+0x9a/0xc0; system_call_fastpath+0x16/0x1b
 I'll report back if I find anything. Thank you for your assistance,
Scott

Revision history for this message
In , A-smi (a-smi) wrote :

The problem appears to be related to a faulty motherboard by Gigabyte.

Changed in opensuse:
status: Confirmed → Won't Fix
Revision history for this message
penalvch (penalvch) wrote :

Orange-juice, this bug was reported a while ago and there hasn't been any activity in it recently. We were wondering if this is still an issue? If so, could you please test for this with the latest development release of Ubuntu? ISO images are available from http://cdimage.ubuntu.com/daily-live/current/ .

If it remains an issue, could you please run the following command in the development release from a Terminal (Applications->Accessories->Terminal), as it will automatically gather and attach updated debug information to this report:

apport-collect -p linux <replace-with-bug-number>

Also, could you please test the latest upstream kernel available (not the daily folder) following https://wiki.ubuntu.com/KernelMainlineBuilds ? It will allow additional upstream developers to examine the issue. Once you've tested the upstream kernel, please comment on which kernel version specifically you tested. If this bug is fixed in the mainline kernel, please add the following tags:
kernel-fixed-upstream
kernel-fixed-upstream-VERSION-NUMBER

where VERSION-NUMBER is the version number of the kernel you tested. For example:
kernel-fixed-upstream-v3.13-rc3

This can be done by clicking on the yellow circle with a black pencil icon next to the word Tags located at the bottom of the bug description. As well, please remove the tag:
needs-upstream-testing

If the mainline kernel does not fix this bug, please add the following tags:
kernel-bug-exists-upstream
kernel-bug-exists-upstream-VERSION-NUMBER

As well, please remove the tag:
needs-upstream-testing

Once testing of the upstream kernel is complete, please mark this bug's Status as Confirmed. Please let us know your results. Thank you for your understanding.

affects: ubuntu → linux (Ubuntu)
Changed in linux (Ubuntu):
importance: Undecided → Medium
status: Confirmed → Incomplete
Revision history for this message
Jaime Pérez (jaime-91) wrote :

We are having that error with Ubuntu 14.04.

bug 1275116

Revision history for this message
penalvch (penalvch) wrote :

Jaime Pérez, thank you for your comment. So your hardware and problem may be tracked, could you please file a new report with Ubuntu by executing the following in a terminal while booted into a Ubuntu repository kernel (not a mainline one) via:
ubuntu-bug linux

For more on this, please read the official Ubuntu documentation:
Ubuntu Bug Control and Ubuntu Bug Squad: https://wiki.ubuntu.com/Bugs/BestPractices#X.2BAC8-Reporting.Focus_on_One_Issue
Ubuntu Kernel Team: https://wiki.ubuntu.com/KernelTeam/KernelTeamBugPolicies#Filing_Kernel_Bug_reports
Ubuntu Community: https://help.ubuntu.com/community/ReportingBugs#Bug_reporting_etiquette

When opening up the new report, please feel free to subscribe me to it.

Thank you for your understanding.

Helpful bug reporting tips:
https://wiki.ubuntu.com/ReportingBugs

Revision history for this message
Journeyman (jman-f) wrote :

Happening here too after 14.04 update

Revision history for this message
penalvch (penalvch) wrote :

Journeyman, thank you for your comment. So your hardware and problem may be tracked, could you please file a new report with Ubuntu by executing the following in a terminal while booted into a Ubuntu repository kernel (not a mainline one) via:
ubuntu-bug linux

For more on this, please read the official Ubuntu documentation:
Ubuntu Bug Control and Ubuntu Bug Squad: https://wiki.ubuntu.com/Bugs/BestPractices#X.2BAC8-Reporting.Focus_on_One_Issue
Ubuntu Kernel Team: https://wiki.ubuntu.com/KernelTeam/KernelTeamBugPolicies#Filing_Kernel_Bug_reports
Ubuntu Community: https://help.ubuntu.com/community/ReportingBugs#Bug_reporting_etiquette

When opening up the new report, please feel free to subscribe me to it.

Thank you for your understanding.

Helpful bug reporting tips:
https://wiki.ubuntu.com/ReportingBugs

Revision history for this message
TK (biotech) wrote :

Experiencing this problem too on 14.04.2, kernel 3.16.0-36.

Just filed a new report to gather hardware info.

Revision history for this message
Roman Gritsulyak (rtg-mail) wrote :
Download full text (5.3 KiB)

uname -a
Linux rtg-Aspire-V3-571G 3.16.0-45-generic #60~14.04.1-Ubuntu SMP Fri Jul 24 21:16:23 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux

kern.log:

 ------------[ cut here ]------------
Aug 1 00:26:26 rtg-Aspire-V3-571G kernel: [ 4925.188271] WARNING: CPU: 7 PID: 58 at /build/linux-lts-utopic-AIjdaQ/linux-lts-utopic-3.16.0/kernel/watchdog.c:265 watchdog_overflow_callback+0x9c/0xd0()
Aug 1 00:26:26 rtg-Aspire-V3-571G kernel: [ 4925.188273] Watchdog detected hard LOCKUP on cpu 7
Aug 1 00:26:26 rtg-Aspire-V3-571G kernel: [ 4925.188274] Modules linked in: ctr ccm pci_stub vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) bnep rfcomm nls_iso8859_1 ath3k btusb bluetooth 6lowpan_iphc uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_core v4l2_common videodev media snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm crct10dif_pclmul arc4 crc32_pclmul snd_hda_intel snd_hda_controller snd_hda_codec ath9k snd_hwdep snd_pcm ath9k_common ghash_clmulni_intel ath9k_hw aesni_intel snd_seq_midi snd_seq_midi_event aes_x86_64 lrw snd_rawmidi gf128mul glue_helper ablk_helper ath cryptd snd_seq snd_seq_device mac80211 nouveau snd_timer serio_raw i915 cfg80211 mxm_wmi ttm drm_kms_helper lpc_ich snd drm mei_me mei soundcore i2c_algo_bit shpchp joydev parport_pc ppdev lp parport acer_wmi sparse_keymap wmi mac_hid video hid_generic usbhid hid psmouse ahci tg3 libahci sdhci_pci ptp sdhci pps_core
Aug 1 00:26:26 rtg-Aspire-V3-571G kernel: [ 4925.188334] CPU: 7 PID: 58 Comm: migration/7 Tainted: G D W OE 3.16.0-45-generic #60~14.04.1-Ubuntu
Aug 1 00:26:26 rtg-Aspire-V3-571G kernel: [ 4925.188336] Hardware name: Acer Aspire V3-571G/VA50_HC_CR, BIOS V2.11 12/25/2012
Aug 1 00:26:26 rtg-Aspire-V3-571G kernel: [ 4925.188338] 0000000000000009 ffff88045f3c5c00 ffffffff81765ca1 ffff88045f3c5c48
Aug 1 00:26:26 rtg-Aspire-V3-571G kernel: [ 4925.188340] ffff88045f3c5c38 ffffffff8106de3d ffff8804489d8000 0000000000000000
Aug 1 00:26:26 rtg-Aspire-V3-571G kernel: [ 4925.188343] ffff88045f3c5d60 0000000000000000 ffff88045f3c5ef8 ffff88045f3c5c98
Aug 1 00:26:26 rtg-Aspire-V3-571G kernel: [ 4925.188346] Call Trace:
Aug 1 00:26:26 rtg-Aspire-V3-571G kernel: [ 4925.188347] <NMI> [<ffffffff81765ca1>] dump_stack+0x45/0x56
Aug 1 00:26:26 rtg-Aspire-V3-571G kernel: [ 4925.188355] [<ffffffff8106de3d>] warn_slowpath_common+0x7d/0xa0
Aug 1 00:26:26 rtg-Aspire-V3-571G kernel: [ 4925.188358] [<ffffffff8106deac>] warn_slowpath_fmt+0x4c/0x50
Aug 1 00:26:26 rtg-Aspire-V3-571G kernel: [ 4925.188361] [<ffffffff8111c42c>] watchdog_overflow_callback+0x9c/0xd0
Aug 1 00:26:26 rtg-Aspire-V3-571G kernel: [ 4925.188365] [<ffffffff81158a7d>] __perf_event_overflow+0x8d/0x230
Aug 1 00:26:26 rtg-Aspire-V3-571G kernel: [ 4925.188370] [<ffffffff81029e18>] ? x86_perf_event_set_period+0xe8/0x150
Aug 1 00:26:26 rtg-Aspire-V3-571G kernel: [ 4925.188372] [<ffffffff81159524>] perf_event_overflow+0x14/0x20
Aug 1 00:26:26 rtg-Aspire-V3-571G kernel: [ 4925.188374] [<ffffffff8103136d>] intel_pmu_handle_irq+0x1ed/0x3e0
Aug 1 00:26:26 rtg-Aspire-V3-571G kernel: [ 4925.188377] [<ffffffff81028b1b>] perf...

Read more...

Revision history for this message
penalvch (penalvch) wrote :

Roman Gritsulyak, it will help immensely if you filed a new report via a terminal:
ubuntu-bug linux

Please feel free to subscribe me to it.

Revision history for this message
Roman Gritsulyak (rtg-mail) wrote :

Hello Christopher,
the problem no longer reproduces after installing non-free nvidia driver, so no new reports.

I have 2 video cards laptop - nvidia is connected to external display.

After substituting noveau driver by nvidia problem no longer reproduces.

Revision history for this message
Jaime Pérez (jaime-91) wrote :

I set to invalid as Roman Gritsulyak said it seem to be fixed

Changed in linux (Ubuntu):
status: Incomplete → Invalid
Revision history for this message
penalvch (penalvch) wrote :

Jaime Pérez, thanks helping towards this.

However, the original reporter is Orange-juice. Hence, it's not considered Invalid given Roman Gritsulyak (not the original reporter) noted it's fixed for him and his hardware.

Changed in linux (Ubuntu):
importance: Medium → Low
status: Invalid → Incomplete
Revision history for this message
Jaime Pérez (jaime-91) wrote :

Oh, ok. My fault

Revision history for this message
whenselm (whenselm) wrote :

I have same problem with Elitebook 2530p- not waking up from standby any more- I have to remove battery. After reboot I get a long kernel trace stating the cpu1 hard lockup. The problem exists since a few weeks

Revision history for this message
penalvch (penalvch) wrote :

whenselm, it will help immensely if you filed a new report via a terminal:
ubuntu-bug linux

Please feel free to subscribe me to it.

Revision history for this message
marcin (mpekalski) wrote :
Download full text (5.2 KiB)

I have a similar issue on Ubuntu 15.10. While running make runtest for opencl-caffe (https://github.com/amd/OpenCL-caffe)

$ uname -a
Linux thesun 4.2.0-22-generic #27-Ubuntu SMP Thu Dec 17 22:57:08 UTC 2015 x86_64 x86_64 x86_64 GNU/Linux

Dec 26 01:36:38 thesun kernel: [ 926.843143] WARNING: CPU: 1 PID: 2792 at /build/linux-cRemOf/linux-4.2.0/kernel/watchdog.c:311 watchdog_overflow_callback+0x79/0xa0()
Dec 26 01:36:38 thesun kernel: [ 926.843144] Watchdog detected hard LOCKUP on cpu 1
Dec 26 01:36:38 thesun kernel: [ 926.843144] Modules linked in: rfcomm bnep fglrx(POE) uas intel_rapl usb_storage iosf_mbi x86_pkg_temp_thermal intel_powerclamp arc4 ath9k ath9k_common ath9k_hw hid_generic joydev ath input_leds mac80211 coretemp snd_hda_codec_realtek kvm_intel snd_hda_codec_hdmi snd_hda_codec_generic eeepc_wmi asus_wmi sparse_keymap kvm snd_hda_intel crct10dif_pclmul snd_hda_codec cfg80211 snd_hda_core crc32_pclmul snd_hwdep snd_pcm ath3k btusb snd_seq_midi snd_seq_midi_event snd_rawmidi snd_seq snd_seq_device btrtl btbcm btintel snd_timer snd aesni_intel bluetooth aes_x86_64 lrw amd_iommu_v2 gf128mul soundcore glue_helper ablk_helper mei_me mei lpc_ich cryptd shpchp serio_raw tpm_infineon mac_hid parport_pc ppdev lp parport autofs4 hid_microsoft usbhid hid mxm_wmi psmouse e1000e ahci libahci ptp pps_core wmi video
Dec 26 01:36:38 thesun kernel: [ 926.843169] CPU: 1 PID: 2792 Comm: test.testbin Tainted: P W OE 4.2.0-22-generic #27-Ubuntu
Dec 26 01:36:38 thesun kernel: [ 926.843170] Hardware name: ASUS All Series/Z87-PRO, BIOS 2103 08/18/2014
Dec 26 01:36:38 thesun kernel: [ 926.843171] 0000000000000000 000000000e4dc345 ffff88082ec45aa0 ffffffff817e94c9
Dec 26 01:36:38 thesun kernel: [ 926.843172] 0000000000000000 ffff88082ec45af8 ffff88082ec45ae0 ffffffff8107b3d6
Dec 26 01:36:38 thesun kernel: [ 926.843173] 0000000000000000 ffff88080afb8000 0000000000000000 ffff88082ec45c00
Dec 26 01:36:38 thesun kernel: [ 926.843174] Call Trace:
Dec 26 01:36:38 thesun kernel: [ 926.843175] <NMI> [<ffffffff817e94c9>] dump_stack+0x45/0x57
Dec 26 01:36:38 thesun kernel: [ 926.843180] [<ffffffff8107b3d6>] warn_slowpath_common+0x86/0xc0
Dec 26 01:36:38 thesun kernel: [ 926.843181] [<ffffffff8107b465>] warn_slowpath_fmt+0x55/0x70
Dec 26 01:36:38 thesun kernel: [ 926.843183] [<ffffffff811329d9>] watchdog_overflow_callback+0x79/0xa0
Dec 26 01:36:38 thesun kernel: [ 926.843185] [<ffffffff81177cc0>] __perf_event_overflow+0x90/0x1c0
Dec 26 01:36:38 thesun kernel: [ 926.843186] [<ffffffff811788c4>] perf_event_overflow+0x14/0x20
Dec 26 01:36:38 thesun kernel: [ 926.843188] [<ffffffff81034521>] intel_pmu_handle_irq+0x1e1/0x460
Dec 26 01:36:38 thesun kernel: [ 926.843190] [<ffffffff8102acf6>] perf_event_nmi_handler+0x26/0x40
Dec 26 01:36:38 thesun kernel: [ 926.843192] [<ffffffff810185f3>] nmi_handle+0x83/0x120
Dec 26 01:36:38 thesun kernel: [ 926.843193] [<ffffffff81018b62>] default_do_nmi+0x42/0x100
Dec 26 01:36:38 thesun kernel: [ 926.843194] [<ffffffff81018d0a>] do_nmi+0xea/0x140
Dec 26 01:36:38 thesun kernel: [ 926.843195] [<ffffffff817f2591>] end_repeat_nmi+0x1a/0x1e
Dec 26 01:36:38 thesun kernel: [ 926.843227] [<ffffff...

Read more...

Revision history for this message
penalvch (penalvch) wrote :

marcin, it will help immensely if you filed a new report via a terminal:
ubuntu-bug linux

Please feel free to subscribe me to it.

For more on why this is helpful, please see https://wiki.ubuntu.com/ReportingBugs.

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.