BUG: soft lockup - CPU#0 stuck for 11s!

Bug #240071 reported by Rovano on 2008-06-14
72
This bug affects 10 people
Affects Status Importance Assigned to Milestone
linux (Ubuntu)
Undecided
Unassigned
Hardy
Undecided
Unassigned

Bug Description

Binary package hint: tremulous

Clean installation Ubuntu 8.04 + update - 13.6.2008
without update:
deskbar-applet (2.22.1-0ubuntu1) to 2.22.2.1-0ubuntu1
xserver-xorg-core (2:1.4.1~git20080131-1ubuntu9) to 2:1.4.1~git20080131-1ubuntu9.2

Bug:
BUG: soft lockup - CPU#0 stuck for 11s!
« kdy: 12.06.2008, 23:58:58:44 »
 Odpověď s citacíCitace Změnit zprávuZměnit Smazat zprávuOdstranit
Jun 13 00:45:07 :-o kernel: [ 4771.433061] BUG: soft lockup - CPU#0 stuck for 11s! [tremulous:6105]
Jun 13 00:45:07 rovano kernel: [ 4771.433067] Pid: 6105, comm: tremulous Tainted: P (2.6.24-18-generic #1)
Jun 13 00:45:07 rovano kernel: [ 4771.433071] EIP: 0060:[<f8d094be>] EFLAGS: 00200206 CPU: 0
Jun 13 00:45:07 rovano kernel: [ 4771.433148] EIP is at _ZN4Asic18isTimeStampExpiredE14_LARGE_INTEGERj+0x2e/0x150 [fglrx]
Jun 13 00:45:07 rovano kernel: [ 4771.433152] EAX: 000e1a85 EBX: 000e1a85 ECX: 00000000 EDX: f8a06000
Jun 13 00:45:07 rovano kernel: [ 4771.433154] ESI: 00000000 EDI: f9066020 EBP: f7837d10 ESP: f7837cf8
Jun 13 00:45:07 rovano kernel: [ 4771.433157] DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068
Jun 13 00:45:07 rovano kernel: [ 4771.433160] CR0: 80050033 CR2: a08e7000 CR3: 37cad000 CR4: 00000690
Jun 13 00:45:07 rovano kernel: [ 4771.433163] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
Jun 13 00:45:07 rovano kernel: [ 4771.433166] DR6: ffff0ff0 DR7: 00000400
Jun 13 00:45:07 rovano kernel: [ 4771.433181] [<f8d09de0>] _ZN4Asic22ElapsedTS_PollingUntil19ConditionSuccessfulEv+0x30/0x80 [fglrx]
Jun 13 00:45:07 rovano kernel: [ 4771.433270] [<f8d096e8>] _ZN4Asic9WaitUntil15WaitForCompleteEv+0x38/0xf0 [fglrx]
Jun 13 00:45:07 rovano kernel: [ 4771.433362] [<f8d08776>] _ZN4Asic19PM4ElapsedTimeStampERK23PM4_TS_INTERRUPT_PARAMSj14_LARGE_INTEGER+0x166/0x200 [fglrx]
Jun 13 00:45:07 rovano kernel: [ 4771.433468] [<f8ceb4b6>] QSSubmitList+0x146/0x150 [fglrx]
Jun 13 00:45:07 rovano kernel: [ 4771.433559] [<f8cfd030>] _Z19uQSTimeStampRetiredjjj14_LARGE_INTEGER+0xf0/0x100 [fglrx]
Jun 13 00:45:07 rovano kernel: [ 4771.433650] [<f8cfa42f>] _Z8uCWDDEQCjjjPvjS_+0x2cf/0x11a0 [fglrx]
Jun 13 00:45:07 rovano kernel: [ 4771.433740] [<f8ceadf4>] CMMQS_uCWDDEQC+0x34/0x40 [fglrx]
Jun 13 00:45:07 rovano kernel: [ 4771.433822] [<f8cb1e48>] firegl_cmmqs_CWDDE_32+0x238/0x340 [fglrx]
Jun 13 00:45:07 rovano kernel: [ 4771.433898] [sys_quotactl+0x526/0x680] sys_quotactl+0x526/0x680
Jun 13 00:45:07 rovano kernel: [ 4771.433902] [<f8cb0e06>] firegl_cmmqs_CWDDE32+0x76/0x110 [fglrx]
Jun 13 00:45:07 rovano kernel: [ 4771.433973] [lock_timer_base+0x27/0x60] lock_timer_base+0x27/0x60
Jun 13 00:45:07 rovano kernel: [ 4771.433978] [<f8ad2510>] VBoxDrvLinuxGipTimer+0x0/0x100 [vboxdrv]
Jun 13 00:45:07 rovano kernel: [ 4771.433994] [<f8cb0d90>] firegl_cmmqs_CWDDE32+0x0/0x110 [fglrx]
Jun 13 00:45:07 rovano kernel: [ 4771.434064] [<f8ca34be>] firegl_ioctl+0x19e/0x220 [fglrx]
Jun 13 00:45:07 rovano kernel: [ 4771.434128] [sys_quotactl+0x526/0x680] sys_quotactl+0x526/0x680
Jun 13 00:45:07 rovano kernel: [ 4771.434139] [run_timer_softirq+0x17e/0x1e0] run_timer_softirq+0x17e/0x1e0
Jun 13 00:45:07 rovano kernel: [ 4771.434150] [sys_quotactl+0x526/0x680] sys_quotactl+0x526/0x680
Jun 13 00:45:07 rovano kernel: [ 4771.434155] [<f8c9885c>] ip_firegl_ioctl+0x1c/0x30 [fglrx]
Jun 13 00:45:07 rovano kernel: [ 4771.434220] [sys_quotactl+0x526/0x680] sys_quotactl+0x526/0x680
Jun 13 00:45:07 rovano kernel: [ 4771.434227] [do_ioctl+0x78/0x90] do_ioctl+0x78/0x90
Jun 13 00:45:07 rovano kernel: [ 4771.434235] [vfs_ioctl+0x22e/0x2b0] vfs_ioctl+0x22e/0x2b0
Jun 13 00:45:07 rovano kernel: [ 4771.434245] [sys_ioctl+0x56/0x70] sys_ioctl+0x56/0x70
Jun 13 00:45:07 rovano kernel: [ 4771.434253] [sysenter_past_esp+0x6b/0xa9] sysenter_past_esp+0x6b/0xa9
Jun 13 00:45:07 rovano kernel: [ 4771.434258] [sys_quotactl+0x526/0x680] sys_quotactl+0x526/0x680
Jun 13 00:45:07 rovano kernel: [ 4771.434276] =======================

Total crash Ubuntu, must use hard reset.

Mark Grandi (markgrandi) wrote :

i marked this as confirmed, as i get the same problem. I noticed it appears right after i close my laptop lid, and then open it and log in and start doing some stuff, then after a minute the computer entirely locks up, and i cant even alt+sysrq+reisub .

it just spams this a bunch of times in my kernel.log, i can post the entire thing if needed:

Jul 3 22:26:00 Australis kernel: [ 123.873686] =======================
Jul 3 22:26:12 Australis kernel: [ 128.587949] BUG: soft lockup - CPU#0 stuck for 11s! [hald-runner:9384]
Jul 3 22:26:12 Australis kernel: [ 128.587953]
Jul 3 22:26:12 Australis kernel: [ 128.587956] Pid: 9384, comm: hald-runner Tainted: P D (2.6.24-19-generic #1)
Jul 3 22:26:12 Australis kernel: [ 128.587960] EIP: 0060:[forcedeth:_spin_lock+0x7/0x90] EFLAGS: 00000286 CPU: 0
Jul 3 22:26:12 Australis kernel: [ 128.587963] EIP is at _spin_lock+0x7/0x10
Jul 3 22:26:12 Australis kernel: [ 128.587965] EAX: c0418e80 EBX: f66489b4 ECX: 00000000 EDX: f50e4000
Jul 3 22:26:12 Australis kernel: [ 128.587968] ESI: c19fb960 EDI: 00000033 EBP: 00000000 ESP: f50e5f20
Jul 3 22:26:12 Australis kernel: [ 128.587970] DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068
Jul 3 22:26:12 Australis kernel: [ 128.587972] CR0: 8005003b CR2: 0809be20 CR3: 37d83000 CR4: 00000690
Jul 3 22:26:12 Australis kernel: [ 128.587975] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
Jul 3 22:26:12 Australis kernel: [ 128.587977] DR6: ffff0ff0 DR7: 00000400
Jul 3 22:26:12 Australis kernel: [ 128.587980] [kmap_high+0x13/0x1b0] kmap_high+0x13/0x1b0
Jul 3 22:26:12 Australis kernel: [ 128.588001] [copy_strings+0x109/0x190] copy_strings+0x109/0x190
Jul 3 22:26:12 Australis kernel: [ 128.588015] [copy_strings_kernel+0x21/0x40] copy_strings_kernel+0x21/0x40
Jul 3 22:26:12 Australis kernel: [ 128.588022] [do_execve+0x13d/0x1d0] do_execve+0x13d/0x1d0
Jul 3 22:26:12 Australis kernel: [ 128.588032] [sys_execve+0x2f/0x80] sys_execve+0x2f/0x80
Jul 3 22:26:12 Australis kernel: [ 128.588038] [sysenter_past_esp+0x6b/0xa9] sysenter_past_esp+0x6b/0xa9
Jul 3 22:26:12 Australis kernel: [ 128.588057] =======================

Changed in linux:
status: New → Confirmed
forall (forall-stalowka) wrote :

Pid: 3753, comm: postgres Not tainted (2.6.24-19-xen #1)
Jul 16 15:20:26 zeus kernel: [26239.210572] BUG: soft lockup - CPU#0 stuck for 11s! [postgres:3753]
Jul 16 15:20:26 zeus kernel: [26239.210635]
Jul 16 15:20:26 zeus kernel: [26239.210637] Pid: 3753, comm: postgres Not tainted (2.6.24-19-xen #1)
Jul 16 15:20:26 zeus kernel: [26239.210639] EIP: 0061:[dm_mod:_spin_lock+0x7/0x10] EFLAGS: 00000286 CPU: 0
Jul 16 15:20:26 zeus kernel: [26239.210646] EIP is at _spin_lock+0x7/0x10
Jul 16 15:20:26 zeus kernel: [26239.210648] EAX: c1dce98c EBX: 00000000 ECX: 2414c000 EDX: c1dce980
Jul 16 15:20:26 zeus kernel: [26239.210650] ESI: 00000000 EDI: ec397d90 EBP: 0002414c ESP: e4691ee0
Jul 16 15:20:26 zeus kernel: [26239.210651] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0069
Jul 16 15:20:26 zeus kernel: [26239.210655] CR0: 8005003b CR2: b7b640a0 CR3: 018d2000 CR4: 00000660
Jul 16 15:20:26 zeus kernel: [26239.210658] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
Jul 16 15:20:26 zeus kernel: [26239.210660] DR6: ffff0ff0 DR7: 00000400
Jul 16 15:20:26 zeus kernel: [26239.210661] [_pin_lock+0xfd/0x190] _pin_lock+0xfd/0x190
Jul 16 15:20:26 zeus kernel: [26239.210677] [mm_unpin+0x1a/0x30] mm_unpin+0x1a/0x30
Jul 16 15:20:26 zeus kernel: [26239.210681] [arch_exit_mmap+0x81/0x190] arch_exit_mmap+0x81/0x190
Jul 16 15:20:26 zeus kernel: [26239.210684] [do_page_fault+0x366/0xe90] do_page_fault+0x366/0xe90
Jul 16 15:20:26 zeus kernel: [26239.210688] [exit_mmap+0x17/0x100] exit_mmap+0x17/0x100
Jul 16 15:20:26 zeus kernel: [26239.210692] [usbcore:down_read+0x8/0x9a0] down_read+0x8/0x20
Jul 16 15:20:26 zeus kernel: [26239.210695] [mmput+0x23/0x80] mmput+0x23/0x80
Jul 16 15:20:26 zeus kernel: [26239.210698] [do_exit+0x165/0x8b0] do_exit+0x165/0x8b0
Jul 16 15:20:26 zeus kernel: [26239.210702] [vfs_write+0x11e/0x170] vfs_write+0x11e/0x170
Jul 16 15:20:26 zeus kernel: [26239.210707] [do_group_exit+0x2a/0xa0] do_group_exit+0x2a/0xa0
Jul 16 15:20:26 zeus kernel: [26239.210710] [syscall_call+0x7/0x0b] syscall_call+0x7/0xb
Jul 16 15:20:26 zeus kernel: [26239.210714] [vcc_getsockopt+0x150/0x170] vcc_getsockopt+0x150/0x170
Jul 16 15:20:26 zeus kernel: [26239.210719] =======================
Jul 16 15:20:38 zeus kernel: [26251.024575] BUG: soft lockup - CPU#0 stuck for 11s! [postgres:3753]

Neil Jeffery (neilneil2000) wrote :

I get the same problem running an Intel atom board,

Mine says

BUG: soft lockup - CPU#0 stuck for 11s! [kacpid:44]

I ntoiced that the box is still pingable once it has crashed but I cannot ssh in. If anyone wants further details I am happy to post any outpur you deem relevant

IMBatman (malvagio44) wrote :

[1003.301883]BUG: soft lockup - CPU#0 stuck for 11s! [init:1]
[1015.089290]BUG: soft lockup - CPU#0 stuck for 11s! [init:1]
[1026.880689]BUG: soft lockup - CPU#0 stuck for 11s! [init:1]
[1038.67.................
..............................................
i try the "Ubuntu 8.04.01, kernel 2.6.24-19-generic (recovery mode)"
but it pauses and freezes.i do have dual os for a back up.
 it just keep going down bug: soft lockup, "what does it mean!" and how do i fix it.
i hope that it dosent mean to re-install the os.:(.. is there any where to use the command-line and then do what?
im a cherry so i really dont now what to do!!

gerbalblaste (gerbalblaste) wrote :
Download full text (4.5 KiB)

I also have a similar ( if not identical bug). Occurs under heavy load. Has been ongoing for several versions. Lockup is always preceded by an identical NetworkManager error.

Jul 24 14:52:20 gerbal-laptop NetworkManager: <info> Error getting killswitch power: org.freedesktop.DBus.Error.NoReply - Did not receive a reply. Possible causes include: the remote application did not send a reply, the message bus security policy blocked the reply, the reply timeout expired, or the network connection was broken.
Jul 24 14:52:26 gerbal-laptop kernel: [13162.463932] BUG: soft lockup - CPU#1 stuck for 11s! [Xorg:5801]
Jul 24 14:52:26 gerbal-laptop kernel: [13162.463942]
Jul 24 14:52:26 gerbal-laptop kernel: [13162.463946] Pid: 5801, comm: Xorg Tainted: P (2.6.24-19-generic #1)
Jul 24 14:52:26 gerbal-laptop kernel: [13162.463951] EIP: 0060:[<f904e633>] EFLAGS: 00203283 CPU: 1
Jul 24 14:52:26 gerbal-laptop kernel: [13162.464109] EIP is at _ZN4Asic16Is_WPTR_equ_RPTR19ConditionSuccessfulEv+0x33/0x50 [fglrx]
Jul 24 14:52:26 gerbal-laptop kernel: [13162.464115] EAX: f8e8c080 EBX: eefe7d98 ECX: f95caf00 EDX: f93ba020
Jul 24 14:52:26 gerbal-laptop kernel: [13162.464120] ESI: f8e8c084 EDI: eefe7d98 EBP: eefe7d00 ESP: eefe7ce8
Jul 24 14:52:26 gerbal-laptop kernel: [13162.464125] DS: 007b ES: 007b FS: 00d8 GS: 0033 SS: 0068
Jul 24 14:52:26 gerbal-laptop kernel: [13162.464129] CR0: 8005003b CR2: b695c000 CR3: 1f8dc000 CR4: 00000690
Jul 24 14:52:26 gerbal-laptop kernel: [13162.464134] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
Jul 24 14:52:26 gerbal-laptop kernel: [13162.464139] DR6: ffff0ff0 DR7: 00000400
Jul 24 14:52:26 gerbal-laptop kernel: [13162.464170] [<f904d6e8>] _ZN4Asic9WaitUntil15WaitForCompleteEv+0x38/0xf0 [fglrx]
Jul 24 14:52:26 gerbal-laptop kernel: [13162.464336] [<f9051786>] _ZN6AsicR616ASICIdleInternalEN4Asic15idle_WaitMethodE+0x96/0x1f0 [fglrx]
Jul 24 14:52:26 gerbal-laptop kernel: [13162.464549] [<f902f4b6>] QSSubmitList+0x146/0x150 [fglrx]
Jul 24 14:52:26 gerbal-laptop kernel: [13162.464698] [<f904bf9c>] _ZN4Asic7PM4idleENS_15idle_WaitMethodE+0x4c/0x80 [fglrx]
Jul 24 14:52:26 gerbal-laptop kernel: [13162.464858] [unix_stream_recvmsg+0x25d/0x550] unix_stream_recvmsg+0x25d/0x550
Jul 24 14:52:26 gerbal-laptop kernel: [13162.464868] [<c0125f20>] default_wake_function+0x0/0x10
Jul 24 14:52:26 gerbal-laptop kernel: [13162.464887] [<f9046645>] _ZN15QS_PRIVATE_CORE7PM4idleEN4Asic15idle_WaitMethodE+0x35/0x70 [fglrx]
Jul 24 14:52:26 gerbal-laptop kernel: [13162.465051] [<f9036321>] _ZN10QS_PRIVATE11synchronizeEv+0x31/0x40 [fglrx]
Jul 24 14:52:26 gerbal-laptop kernel: [13162.465204] [<f902f4d7>] QSSynchronize+0x17/0x20 [fglrx]
Jul 24 14:52:26 gerbal-laptop kernel: [13162.465352] [<f9036e69>] _Z14uQSSynchronizej+0x19/0x20 [fglrx]
Jul 24 14:52:26 gerbal-laptop kernel: [13162.465504] [<f903e49f>] _Z8uCWDDEQCjjjPvjS_+0x33f/0x11a0 [fglrx]
Jul 24 14:52:26 gerbal-laptop kernel: [13162.465670] [<f902edf4>] CMMQS_uCWDDEQC+0x34/0x40 [fglrx]
Jul 24 14:52:26 gerbal-laptop kernel: [13162.465822] [<f8ff5e48>] firegl_cmmqs_CWDDE_32+0x238/0x340 [fglrx]
Jul 24 14:52:26 gerbal-laptop kernel: [13162.465951] [sys_quotactl+0x4e6/0x680] sys_q...

Read more...

Vladislav Muravyev (rex-lux) wrote :

Same bug on Acer Aspire 5585 WXMi. Generic image 2.6.24-19.17 on x86/x86_64 (using standard grub booting parameters, w/o quiet splash params). Sometimes it freezes (while mouse could moving, clock is stopped).

syslog:

Jul 28 17:39:01 om-laptop /USR/SBIN/CRON[4610]: (root) CMD ( [ -x /usr/lib/php5/maxlifetime ] && [ -d /var/lib/php5 ] && find /var/lib/php5/ -type f -cmin +$(/usr/lib/php5/maxlifetime) -print0 | xargs -r -0 rm)
Jul 28 17:47:30 om-laptop kernel: [23321.634344] NVRM: Xid (0001:00): 12, COCOD 00000002 beef4401 00000044 00001400 beef1901
Jul 28 17:47:30 om-laptop kernel: [23321.638418] NVRM: Xid (0001:00): 9, Channel 00000020 Instance 00000000 Intr 00100000
Jul 28 17:47:33 om-laptop kernel: [23324.932876] NVRM: Xid (0001:00): 12, COCOD 0000001e 31415930 00000019 00000dcc 00efebe7
Jul 28 17:47:33 om-laptop kernel: [23324.941019] NVRM: Xid (0001:00): 36, L1 -> L0
Jul 28 17:47:43 om-laptop kernel: [23334.612936] BUG: soft lockup - CPU#0 stuck for 11s! [ata/0:1574]

The Ubuntu Kernel Team is planning to move to the 2.6.27 kernel for the upcoming Intrepid Ibex 8.10 release. As a result, the kernel team would appreciate it if you could please test this newer 2.6.27 Ubuntu kernel. There are one of two ways you should be able to test:

1) If you are comfortable installing packages on your own, the linux-image-2.6.27-* package is currently available for you to install and test.

--or--

2) The upcoming Alpha5 for Intrepid Ibex 8.10 will contain this newer 2.6.27 Ubuntu kernel. Alpha5 is set to be released Thursday Sept 4. Please watch http://www.ubuntu.com/testing for Alpha5 to be announced. You should then be able to test via a LiveCD.

Please let us know immediately if this newer 2.6.27 kernel resolves the bug reported here or if the issue remains. More importantly, please open a new bug report for each new bug/regression introduced by the 2.6.27 kernel and tag the bug report with 'linux-2.6.27'. Also, please specifically note if the issue does or does not appear in the 2.6.26 kernel. Thanks again, we really appreicate your help and feedback.

Hi,

I have installed this kernel on my server and enabled the on board LAN that was causing the problem.

I haven't yet attempted to use the port for connectivity but I did notice that the drops was ranking up very quickly (millions per second). This is without any cable plugged in. I then configured the port in /etc/network/interfaces and the same continued.

So far I have not seen any crashing though.

I will try and establish connectivity on this port in the near future and let you know the results, however with the number of drops increasing so rapidly with no connection I am not holding out too much hope.

> From: <email address hidden>
> To: <email address hidden>
> Date: Fri, 29 Aug 2008 01:36:32 +0000
> Subject: [Bug 240071] Re: BUG: soft lockup - CPU#0 stuck for 11s!
>
> The Ubuntu Kernel Team is planning to move to the 2.6.27 kernel for the
> upcoming Intrepid Ibex 8.10 release. As a result, the kernel team would
> appreciate it if you could please test this newer 2.6.27 Ubuntu kernel.
> There are one of two ways you should be able to test:
>
> 1) If you are comfortable installing packages on your own, the linux-
> image-2.6.27-* package is currently available for you to install and
> test.
>
> --or--
>
> 2) The upcoming Alpha5 for Intrepid Ibex 8.10 will contain this newer
> 2.6.27 Ubuntu kernel. Alpha5 is set to be released Thursday Sept 4.
> Please watch http://www.ubuntu.com/testing for Alpha5 to be announced.
> You should then be able to test via a LiveCD.
>
> Please let us know immediately if this newer 2.6.27 kernel resolves the
> bug reported here or if the issue remains. More importantly, please
> open a new bug report for each new bug/regression introduced by the
> 2.6.27 kernel and tag the bug report with 'linux-2.6.27'. Also, please
> specifically note if the issue does or does not appear in the 2.6.26
> kernel. Thanks again, we really appreicate your help and feedback.
>
> ** Tags added: cft-2.6.27
>
> --
> BUG: soft lockup - CPU#0 stuck for 11s!
> https://bugs.launchpad.net/bugs/240071
> You received this bug notification because you are a direct subscriber
> of the bug.

_________________________________________________________________
Get all your favourite content with the slick new MSN Toolbar - FREE
http://clk.atdmt.com/UKM/go/111354027/direct/01/

Neil Jeffery (neilneil2000) wrote :

I can now confirm that the bug is fixed in the new kernel!

The port still doesn't work but I guess that is now a driver issue.
----------------------------------------
> From: <email address hidden>
> To: <email address hidden>
> Date: Fri, 29 Aug 2008 01:36:32 +0000
> Subject: [Bug 240071] Re: BUG: soft lockup - CPU#0 stuck for 11s!
>
> The Ubuntu Kernel Team is planning to move to the 2.6.27 kernel for the
> upcoming Intrepid Ibex 8.10 release. As a result, the kernel team would
> appreciate it if you could please test this newer 2.6.27 Ubuntu kernel.
> There are one of two ways you should be able to test:
>
> 1) If you are comfortable installing packages on your own, the linux-
> image-2.6.27-* package is currently available for you to install and
> test.
>
> --or--
>
> 2) The upcoming Alpha5 for Intrepid Ibex 8.10 will contain this newer
> 2.6.27 Ubuntu kernel. Alpha5 is set to be released Thursday Sept 4.
> Please watch http://www.ubuntu.com/testing for Alpha5 to be announced.
> You should then be able to test via a LiveCD.
>
> Please let us know immediately if this newer 2.6.27 kernel resolves the
> bug reported here or if the issue remains. More importantly, please
> open a new bug report for each new bug/regression introduced by the
> 2.6.27 kernel and tag the bug report with 'linux-2.6.27'. Also, please
> specifically note if the issue does or does not appear in the 2.6.26
> kernel. Thanks again, we really appreicate your help and feedback.
>
> ** Tags added: cft-2.6.27
>
> --
> BUG: soft lockup - CPU#0 stuck for 11s!
> https://bugs.launchpad.net/bugs/240071
> You received this bug notification because you are a direct subscriber
> of the bug.

_________________________________________________________________
Discover Bird's Eye View now with Multimap from Live Search
http://clk.atdmt.com/UKM/go/111354026/direct/01/

Mark Grandi (markgrandi) wrote :

i will test intrepid on my laptop once a different outstanding bug lets me actually USE ubuntu (it freezes during bootup)

Download full text (4.1 KiB)

Same here on a fresh install of hardy/server on a Sun Fire X4600:

Oct 14 11:02:42 jaq kernel: [77822.037534] CPU 1:
Oct 14 11:02:42 jaq kernel: [77822.037536] Modules linked in: des_generic cbc blkcipher binfmt_misc nfsd exportfs autofs4 rpcsec_gss_krb5 auth_rpcgss iptable_filter ip_tables x_tables xfs ipv6 ac nfs lockd nfs_acl sunrpc parport_pc lp parport loop joydev jedec_probe cfi_probe gen_probe mtd i2c_nforce2 chipreg pcspkr serio_raw map_funcs evdev button psmouse k8temp i2c_core shpchp pci_hotplug ext3 jbd mbcache sg sr_mod cdrom pata_amd pata_acpi sd_mod usb_storage libusual usbhid hid mptsas mptscsih mptbase ata_generic qla2xxx(F) scsi_transport_fc ehci_hcd scsi_transport_sas ohci_hcd libata scsi_tgt e1000 scsi_mod usbcore thermal processor fan fuse vesafb fbcon tileblit font bitblit softcursor
Oct 14 11:02:42 jaq kernel: [77822.037582] Pid: 8190, comm: hald-addon-stor Tainted: GF M 2.6.24-19-server #1
Oct 14 11:02:42 jaq kernel: [77822.037584] RIP: 0010:[auth_rpcgss:lock_kernel+0x1a/0x40] [auth_rpcgss:lock_kernel+0x1a/0x40] lock_kernel+0x1a/0x40
Oct 14 11:02:42 jaq kernel: [77822.037594] RSP: 0018:ffff8101f2c0bdf0 EFLAGS: 00000286
Oct 14 11:02:42 jaq kernel: [77822.037595] RAX: ffff8101fa1ed7a0 RBX: ffff8103fbc4b900 RCX: 0000000000000000
Oct 14 11:02:42 jaq kernel: [77822.037597] RDX: 0000000000000000 RSI: ffff810145953000 RDI: ffff8103fbc4b900
Oct 14 11:02:42 jaq kernel: [77822.037598] RBP: ffff8103fbc4b900 R08: 0000000000000000 R09: 0000000000000000
Oct 14 11:02:42 jaq kernel: [77822.037600] R10: 0000000000000000 R11: ffffffff8031a350 R12: ffffffff802b754e
Oct 14 11:02:42 jaq kernel: [77822.037601] R13: ffff810145953000 R14: 0000000000008001 R15: fffffffffffffff2
Oct 14 11:02:42 jaq kernel: [77822.037603] FS: 00007fab20284780(0000) GS:ffff8101fae42000(0000) knlGS:00000000abffbb90
Oct 14 11:02:42 jaq kernel: [77822.037605] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b
Oct 14 11:02:42 jaq kernel: [77822.037606] CR2: 00007f5803717f10 CR3: 00000003fb855000 CR4: 00000000000006e0
Oct 14 11:02:42 jaq kernel: [77822.037608] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Oct 14 11:02:42 jaq kernel: [77822.037609] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Oct 14 11:02:42 jaq kernel: [77822.037611]
Oct 14 11:02:42 jaq kernel: [77822.037611] Call Trace:
Oct 14 11:02:42 jaq kernel: [77822.037617] [do_open+0x43/0x330] do_open+0x43/0x330
Oct 14 11:02:42 jaq kernel: [77822.037625] [blkdev_open+0x0/0x90] blkdev_open+0x0/0x90
Oct 14 11:02:42 jaq kernel: [77822.037627] [blkdev_open+0x3c/0x90] blkdev_open+0x3c/0x90
Oct 14 11:02:42 jaq kernel: [77822.037633] [__dentry_open+0xdb/0x200] __dentry_open+0xdb/0x200
Oct 14 11:02:42 jaq kernel: [77822.037638] [do_filp_open+0x3a/0x50] do_filp_open+0x3a/0x50
Oct 14 11:02:42 jaq kernel: [77822.037645] [get_unused_fd_flags+0x77/0x120] get_unused_fd_flags+0x77/0x120
Oct 14 11:02:42 jaq kernel: [77822.037650] [do_sys_open+0x5a/0xf0] do_sys_open+0x5a/0xf0
Oct 14 11:02:42 jaq kernel: [77822.037655] [system_call+0x7e/0x83] system_call+0x7e/0x83
Oct 14 11:02:42 jaq kernel: [77822.037662]
Oct 14 11:02:42 jaq kernel: [77822.037668] BUG: soft lockup - CPU#14...

Read more...

Based on the feedback here it seems this is resolved for Intrepid. I'll mark this Fix Released for Intrepid and open a Hardy nomination thanks.

Changed in linux:
status: Confirmed → Fix Released
ced (ubuntu-grumly) wrote :
Download full text (7.6 KiB)

I am running 2.6.27-9 and I have the bug !

Linux sql3-server3 2.6.27-9-server #1 SMP Thu Nov 20 22:56:07 UTC 2008 x86_64 GNU/Linux

[696070.321236] BUG: soft lockup - CPU#2 stuck for 61s! [mysqld:14816]
[696070.321252] Modules linked in: 8021q garp stp ipv6 iptable_filter ip_tables x_tables ext3 jbd mbcache parport_pc lp parport loop joydev dcdbas psmouse evdev pcspkr serio_raw usbhid hid iTCO_wdt iTCO_vendor_support button shpchp pci_hotplug i5000_edac edac_core xfs sr_mod cdrom pata_acpi ata_piix ses enclosure sd_mod crc_t10dif sg ata_generic libata dock ehci_hcd uhci_hcd usbcore bnx2 megaraid_sas scsi_mod dm_mirror dm_log dm_snapshot dm_mod thermal processor fan fbcon tileblit font bitblit softcursor fuse
[696070.321252] CPU 2:
[696070.321252] Modules linked in: 8021q garp stp ipv6 iptable_filter ip_tables x_tables ext3 jbd mbcache parport_pc lp parport loop joydev dcdbas psmouse evdev pcspkr serio_raw usbhid hid iTCO_wdt iTCO_vendor_support button shpchp pci_hotplug i5000_edac edac_core xfs sr_mod cdrom pata_acpi ata_piix ses enclosure sd_mod crc_t10dif sg ata_generic libata dock ehci_hcd uhci_hcd usbcore bnx2 megaraid_sas scsi_mod dm_mirror dm_log dm_snapshot dm_mod thermal processor fan fbcon tileblit font bitblit softcursor fuse
[696070.321252] Pid: 14816, comm: mysqld Not tainted 2.6.27-9-server #1
[696070.321252] RIP: 0010:[<ffffffff802abf0c>] [<ffffffff802abf0c>] find_get_pages+0x6c/0x110
[696070.321252] RSP: 0018:ffff8801a315ba78 EFLAGS: 00000246
[696070.321252] RAX: ffff8801a42bb920 RBX: ffff8801a315bab8 RCX: 0000000000000003
[696070.321252] RDX: 0000000000000004 RSI: 0000000000000000 RDI: ffffe200064eb200
[696070.321252] RBP: ffffffff807a5418 R08: ffffe200064ea248 R09: 0000000000000009
[696070.321252] R10: 000000000000000b R11: 000000000000590c R12: ffffffff8024535c
[696070.321252] R13: ffff8801a315bb28 R14: ffffffff803a23ba R15: ffff8801a315b9d8
[696070.321252] FS: 00000000451e1950(0063) GS:ffff88022fc02d00(0000) knlGS:0000000000000000
[696070.321252] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[696070.321252] CR2: 00007f340b2d2000 CR3: 0000000227dfe000 CR4: 00000000000006e0
[696070.321252] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[696070.321252] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
[696070.321252]
[696070.321252] Call Trace:
[696070.321252] [<ffffffff802abee3>] ? find_get_pages+0x43/0x110
[696070.321252] [<ffffffff802b6984>] ? pagevec_lookup+0x24/0x30
[696070.321252] [<ffffffffa021994b>] ? xfs_probe_cluster+0x11b/0x2c0 [xfs]
[696070.321252] [<ffffffffa021a645>] ? xfs_page_state_convert+0x565/0x760 [xfs]
[696070.321252] [<ffffffff803a6930>] ? radix_tree_gang_lookup_tag_slot+0xc0/0xe0
[696070.321252] [<ffffffff8026b46e>] ? up_write+0xe/0x10
[696070.321252] [<ffffffffa021a9a1>] ? xfs_vm_writepage+0x71/0x120 [xfs]
[696070.321252] [<ffffffff802bbc9a>] ? __dec_zone_page_state+0x2a/0x30
[696070.321252] [<ffffffff802b4237>] ? __writepage+0x17/0x50
[696070.321252] [<ffffffff802b5455>] ? write_cache_pages+0x255/0x3f0
[696070.321252] [<ffffffff802b4220>] ? __writepage+0x0/0x50
[696070.321252] [<ffffffffa021dc88>] ? xfs_file_aio_write+0x58/0x60 [xfs]
[696070.321252...

Read more...

Ross (ross-excess) wrote :
Download full text (4.5 KiB)

I too am experiencing this problem with my ubuntu XEN system.
setting vcpus = 1 in my domU xen config file seems to have successfully worked around the problem.

The problem appeared immediately following a kernel upgrade and a dom0 distro upgrade from dapper to hardy.
old kernel 2.6.16.13-xen working correctly (compiled by hand)
new kernel 2.6.24-23-xen ubuntu hardy packaged kernel has the soft lockup problems.

The domU is still dapper, totally unchanged.

Here are some details about my setup:

It is an Intel Pentium D dual-core cpu
3Gigs of RAM
Intel chipsets and dual Intel onboard NICs
headless, racked server, no graphics drivers

lspci:
00:00.0 Host bridge: Intel Corporation E7230/3000/3010 Memory Controller Hub (rev 81)
00:01.0 PCI bridge: Intel Corporation E7230/3000/3010 PCI Express Root Port (rev 81)
00:1c.0 PCI bridge: Intel Corporation 82801G (ICH7 Family) PCI Express Port 1 (rev 01)
00:1c.4 PCI bridge: Intel Corporation 82801GR/GH/GHM (ICH7 Family) PCI Express Port 5 (rev 01)
00:1c.5 PCI bridge: Intel Corporation 82801GR/GH/GHM (ICH7 Family) PCI Express Port 6 (rev 01)
00:1d.0 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI Controller #1 (rev 01)
00:1d.1 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI Controller #2 (rev 01)
00:1d.2 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI Controller #3 (rev 01)
00:1d.3 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI Controller #4 (rev 01)
00:1d.7 USB Controller: Intel Corporation 82801G (ICH7 Family) USB2 EHCI Controller (rev 01)
00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev e1)
00:1f.0 ISA bridge: Intel Corporation 82801GB/GR (ICH7 Family) LPC Interface Bridge (rev 01)
00:1f.1 IDE interface: Intel Corporation 82801G (ICH7 Family) IDE Controller (rev 01)
00:1f.2 IDE interface: Intel Corporation 82801GB/GR/GH (ICH7 Family) SATA IDE Controller (rev 01)
00:1f.3 SMBus: Intel Corporation 82801G (ICH7 Family) SMBus Controller (rev 01)
02:00.0 PCI bridge: Intel Corporation 6702PXH PCI Express-to-PCI Bridge A (rev 09)
02:00.1 PIC: Intel Corporation 6700/6702PXH I/OxAPIC Interrupt Controller A (rev 09)
03:01.0 SCSI storage controller: Marvell Technology Group Ltd. MV88SX6081 8-port SATA II PCI-X Controller (rev 09)
04:00.0 Ethernet controller: Intel Corporation 82573E Gigabit Ethernet Controller (Copper) (rev 03)
04:00.3 Serial controller: Intel Corporation Active Management Technology - SOL (rev 03)
04:00.4 IPMI SMIC interface: Intel Corporation 82573E KCS (Active Management) (rev 03)
05:00.0 Ethernet controller: Intel Corporation 82573L Gigabit Ethernet Controller
0a:00.0 VGA compatible controller: ATI Technologies Inc Rage XL (rev 27)
0a:04.0 RAID bus controller: VIA Technologies, Inc. VT6421 IDE RAID Controller (rev 50)

Jan 26 04:49:48 infocentral kernel: [ 5849.193197] BUG: soft lockup - CPU#1 stuck for 11s! [postmaster:9944]
Jan 26 04:49:48 infocentral kernel: [ 5849.193224]
Jan 26 04:49:48 infocentral kernel: [ 5849.193230] Pid: 9944, comm: postmaster Not tainted (2.6.24-23-xen #1)
Jan 26 04:49:48 infocentral kernel: [ 5849.193236] EIP: 0061:[<c03279d7>] EFLAGS: 00000282 CPU: 1
Jan 26 04:49:48 infocentral kerne...

Read more...

Zoltán Vigh (zool) wrote :

Are there any changes with this error?

Is there a way that this can be back ported to the 8.04 LTS release ? I currently have to run intrepid kernels on LTS machines which isn't ideal. Happy to do the leg work if someone can point me in the right direction .

I agree to a backport to 8.04 LTS, I have a bunch of servers running it and they lock randomly which is *very* annoying, beside striking a blow to the legendary stability of Ubuntu Linux.

A Backport of the Fix would be very cool.

At the moment XEN isn't really usable with 8.04, if you use the XEN image from the repos.

Tomas Plesek (tomas-plesek) wrote :

Backport for 8.04, plesase! +1

As Bastian Mäuser wrote, the current xen kernel of 8.04.3 (2.6.24-24) is virtually unusable. I've been trying to get Ubuntu LTS on my VPS servers up and running, however, this situation prevents me from using it altogether, as the servers lock up randomly in span of 3 days.
I reasearched the issue for better part of the day and it seems that this affects all kernels from version 2.6.18-something up to 2.6.29, although some people reported that upgrade to 2.6.27 and higher solved their problem.

Is there any way to bump this to maintainers of 8.04 LTS?

Dave Aitken (dave-aitken) wrote :

+1 from me as well :)

Hassan El Jacifi (waver) wrote :

Still not fixed on 2.6.24-26.64.

Dec 30 14:19:39 dbh01 kernel: [5808092.087564] BUG: soft lockup - CPU#4 stuck for 11s! [mysqld:26433]

Is it possible to have a fix for this problem?

+1 from me.

I am running 8.04 LTS on a Sun Fire X4140, and this is really not my idea of stable :-(

sn00p (art-silenkov) wrote :
Download full text (3.9 KiB)

Confirm on Ubuntu 9.10 under KVM virtualisation.

silenkov@v_mysql03:~$ uname -a
Linux v_mysql06 2.6.31-14-generic #48-Ubuntu SMP Fri Oct 16 14:05:01 UTC 2009 x86_64 GNU/Linux

Kern.log output:

Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] BUG: soft lockup - CPU#2 stuck for 61s! [mysqld:17040]
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] Modules linked in: ufs qnx4 hfsplus hfs minix ntfs vfat msdos fat jfs xfs exportfs reiserfs iptable_filter ip_tables x_tables i2c_piix4 psmouse lp parport serio_raw virtio_balloon virtio_net virtio_blk floppy virtio_pci virtio_ring virtio
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] CPU 2:
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] Modules linked in: ufs qnx4 hfsplus hfs minix ntfs vfat msdos fat jfs xfs exportfs reiserfs iptable_filter ip_tables x_tables i2c_piix4 psmouse lp parport serio_raw virtio_balloon virtio_net virtio_blk floppy virtio_pci virtio_ring virtio
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] Pid: 17040, comm: mysqld Not tainted 2.6.31-14-generic #48-Ubuntu
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] RIP: 0010:[<ffffffff81035257>] [<ffffffff81035257>] kvm_mmu_op+0x47/0x70
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] RSP: 0018:ffff88007c10fc18 EFLAGS: 00000297
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] RAX: 0000000000000000 RBX: ffff88007c10fc48 RCX: 0000000001a3e5b0
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] RDX: 0000000000000000 RSI: 0000000000000020 RDI: ffff880001a3e5b0
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] RBP: ffffffff81012bae R08: 00000000ffffff77 R09: 0000000000000000
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] R10: 0000000000000000 R11: 0000000000000000 R12: ffff88007c10fc48
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] R13: ffffffff81012bae R14: 0000000000000000 R15: 0000000000000002
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] FS: 00007fc401254910(0000) GS:ffff880001a30000(0000) knlGS:0000000000000000
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] CR2: 00007fc3e04351d8 CR3: 000000005acd1000 CR4: 00000000000006e0
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] Call Trace:
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] [<ffffffff8103524b>] ? kvm_mmu_op+0x3b/0x70
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] [<ffffffff810352b4>] ? kvm_leave_lazy_mmu+0x34/0x50
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] [<ffffffff810f5c07>] ? zap_pte_range+0x187/0x400
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] [<ffffffff810e5344>] ? release_pages+0x234/0x260
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] [<ffffffff810dba70>] ? T.768+0x3a0/0x440
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] [<ffffffff810f6040>] ? unmap_page_range+0x1c0/0x290
Mar 3 05:32:51 v_mysql03 kernel: [217826.790638] [<ffffffff810f6277>] ? unmap_vmas+0x167/0x2f0
Mar 3 05:32:51 v_mys...

Read more...

Is Canonical about to drop Support completely for XEN? Now that Xensource has integrated pvops() and domU support is in the mainline?

I was forced to build myself own .deb packages for 2.6.32.10, as well as xen-4.0.0 and xen-tools for Lucid, which lacks dom0 support completely.

It took me kinda 5 hours to do everything and roll out (yet the packages wouldn't really conform official repositories) and so far it runs rock solid. What's so hard about it?

greets,
bastian

Julian Wiedmann (jwiedmann) wrote :

This release has reached end-of-life [0].

[0] https://wiki.ubuntu.com/Releases

Changed in linux (Ubuntu Hardy):
status: New → Invalid
To post a comment you must log in.
This report contains Public information  Edit
Everyone can see this information.

Other bug subscribers