kernel warning of uncore_discovery.c:184 uncore_insert_box_info+0x134/0x350

Bug #2008037 reported by Zhanglei Mao
8
This bug affects 1 person
Affects Status Importance Assigned to Milestone
linux-hwe-5.15 (Ubuntu)
New
Undecided
Unassigned

Bug Description

Kernel taitned 512 ude to warning of below:

#grep taint /var/log/syslog -b2
235407-Feb 20 15:27:14 xfusion kernel: [ 3.779483] WARNING: CPU: 64 PID: 1 at arch/x86/events/intel/uncore_discovery.c:184 uncore_insert_box_info+0x134/0x350
235561-Feb 20 15:27:14 xfusion kernel: [ 3.779495] Modules linked in:
235627:Feb 20 15:27:14 xfusion kernel: [ 3.779499] CPU: 64 PID: 1 Comm: swapper/0 Not tainted 5.15.0-60-generic #66-Ubuntu
235746-Feb 20 15:27:14 xfusion kernel: [ 3.779505] Hardware name: XFUSION 2288 V7/BC15MBSC, BIOS 2.00.20.Btg 02/08/2023
235862-Feb 20 15:27:14 xfusion kernel: [ 3.779509] RIP: 0010:uncore_insert_box_info+0x134/0x350

Revision history for this message
Zhanglei Mao (zhanglei-mao) wrote :

#grep -i 'call trace' /var/log/dmesg -b19
113581-[ 3.779479] kernel: ------------[ cut here ]------------
113641-[ 3.779483] kernel: WARNING: CPU: 64 PID: 1 at arch/x86/events/intel/uncore_discovery.c:184 uncore_insert_box_info+0x134/0x350
113771-[ 3.779495] kernel: Modules linked in:
113813-[ 3.779499] kernel: CPU: 64 PID: 1 Comm: swapper/0 Not tainted 5.15.0-60-generic #66-Ubuntu
113908-[ 3.779505] kernel: Hardware name: XFUSION 2288 V7/BC15MBSC, BIOS 2.00.20.Btg 02/08/2023
114000-[ 3.779509] kernel: RIP: 0010:uncore_insert_box_info+0x134/0x350
114068-[ 3.779515] kernel: Code: c2 01 48 83 c0 04 39 d1 0f 8e c6 01 00 00 49 8b 4c 24 38 8b 0c 01 41 89 0c 07 49 8b 74 24 40 8b 34 06 41 89 34 06 39 f9 75 cf <0f> 0b 4c 89 ff e8 42 95 32 00 4c 89 f7 e8 3a 95 32 00 5b 41 5c 41
114291-[ 3.779521] kernel: RSP: 0000:ff5bc606001fbc98 EFLAGS: 00010246
114358-[ 3.779527] kernel: RAX: 0000000000000008 RBX: 0000000000000000 RCX: 0000000000000003
114447-[ 3.779531] kernel: RDX: 0000000000000002 RSI: 0000000000018000 RDI: 0000000000000003
114536-[ 3.779534] kernel: RBP: ff5bc606001fbcc0 R08: 0000000000000010 R09: ff456694ca656f20
114625-[ 3.779538] kernel: R10: ff456694ca32aec8 R11: ffffffffffffe000 R12: ff4566984965a9c0
114714-[ 3.779541] kernel: R13: ff5bc606001fbcf8 R14: ff456694ca656000 R15: ff456694ca656f20
114803-[ 3.779545] kernel: FS: 0000000000000000(0000) GS:ff4566982f800000(0000) knlGS:0000000000000000
114903-[ 3.779549] kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
114976-[ 3.779553] kernel: CR2: 0000000000000000 CR3: 00000007ba610001 CR4: 0000000000771ee0
115065-[ 3.779556] kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
115154-[ 3.779559] kernel: DR3: 0000000000000000 DR6: 00000000fffe07f0 DR7: 0000000000000400
115243-[ 3.779563] kernel: PKRU: 55555554
115281:[ 3.779565] kernel: Call Trace:
115316-[ 3.779569] kernel: <TASK>
115347-[ 3.779573] kernel: parse_discovery_table.isra.0+0x162/0x1a0
115412-[ 3.779580] kernel: intel_uncore_has_discovery_tables+0x19e/0x270
115482-[ 3.779585] kernel: ? type_pmu_register+0x1c/0x42
115536-[ 3.779593] kernel: intel_uncore_init+0xe3/0x226
115589-[ 3.779597] kernel: ? type_pmu_register+0x42/0x42
115643-[ 3.779601] kernel: do_one_initcall+0x46/0x1e0
115694-[ 3.779608] kernel: do_initcalls+0x12f/0x159
115743-[ 3.779615] kernel: kernel_init_freeable+0x162/0x1b5
115800-[ 3.779622] kernel: ? rest_init+0x100/0x100
115848-[ 3.779630] kernel: kernel_init+0x1b/0x150
115895-[ 3.779636] kernel: ? rest_init+0x100/0x100
115943-[ 3.779641] kernel: ret_from_fork+0x1f/0x30
115991-[ 3.779647] kernel: </TASK>
116023-[ 3.779650] kernel: ---[ end trace 3503eae85cdd4085 ]---

Revision history for this message
Zhanglei Mao (zhanglei-mao) wrote :

This was found at 20.04.1 ga-kernel of 5.15.0-60 which is lasted at the time and on a new intel cpu of Intel(R) Xeon(R) Gold 6438Y+

Revision history for this message
Michael Reed (mreed8855) wrote :

I opened a similar issue that I will close because this is a kernel issue.

https://github.com/canonical/checkbox/issues/312

The following kernel warning causes this warning:
WARNING: CPU: 64 PID: 1 at arch/x86/events/intel/uncore_discovery.c:184 uncore_insert_box_info+0x134/0x350

The kernel warning message is triggered when SPR MCC is used.

Revision history for this message
Michael Reed (mreed8855) wrote :
Download full text (3.3 KiB)

Partial Dmesg log

Jan 4 07:49:51 proven-gnu kernel: [ 4.202465] WARNING: CPU: 64 PID: 1 at arch/x86/events/intel/uncore_discovery.c:184 uncore_insert_box_info+0x134/0x350
Jan 4 07:49:51 proven-gnu kernel: [ 4.202474] Modules linked in:
Jan 4 07:49:51 proven-gnu kernel: [ 4.202478] CPU: 64 PID: 1 Comm: swapper/0 Not tainted 5.15.0-56-generic #62-Ubuntu
Jan 4 07:49:51 proven-gnu kernel: [ 4.202482] Hardware name: Dell Inc. PowerEdge T560/0PWDKY, BIOS 0.2.13 12/09/2022
Jan 4 07:49:51 proven-gnu kernel: [ 4.202485] RIP: 0010:uncore_insert_box_info+0x134/0x350
Jan 4 07:49:51 proven-gnu kernel: [ 4.202488] Code: c2 01 48 83 c0 04 39 d1 0f 8e c6 01 00 00 49 8b 4c 24 38 8b 0c 01 41 89 0c 07 49 8b 74 24 40 8b 34 06 41 89 34 06 39 f9 75 cf <0f> 0b 4c 89 ff e8 52 86 32 00 4c 89 f7 e8 4a 86 32 00 5b 41 5c 41
Jan 4 07:49:51 proven-gnu kernel: [ 4.202494] RSP: 0000:ff58c947c01efc98 EFLAGS: 00010246
Jan 4 07:49:51 proven-gnu kernel: [ 4.202498] RAX: 0000000000000008 RBX: 0000000000000000 RCX: 0000000000000003
Jan 4 07:49:51 proven-gnu kernel: [ 4.202500] RDX: 0000000000000002 RSI: 0000000000018000 RDI: 0000000000000003
Jan 4 07:49:51 proven-gnu kernel: [ 4.202503] RBP: ff58c947c01efcc0 R08: 0000000000000010 R09: ff356d890b9282f0
Jan 4 07:49:51 proven-gnu kernel: [ 4.202505] R10: ff356d988a2e2000 R11: ffffffffffffe000 R12: ff356d9889edc6c0
Jan 4 07:49:51 proven-gnu kernel: [ 4.202507] R13: ff58c947c01efcf8 R14: ff356d890b928db0 R15: ff356d890b9282f0
Jan 4 07:49:51 proven-gnu kernel: [ 4.202510] FS: 0000000000000000(0000) GS:ff356d983fa00000(0000) knlGS:0000000000000000
Jan 4 07:49:51 proven-gnu kernel: [ 4.202513] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 4 07:49:51 proven-gnu kernel: [ 4.202515] CR2: 0000000000000000 CR3: 00000010be610001 CR4: 0000000000771ee0
Jan 4 07:49:51 proven-gnu kernel: [ 4.202518] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
Jan 4 07:49:51 proven-gnu kernel: [ 4.202520] DR3: 0000000000000000 DR6: 00000000fffe07f0 DR7: 0000000000000400
Jan 4 07:49:51 proven-gnu kernel: [ 4.202522] PKRU: 55555554
Jan 4 07:49:51 proven-gnu kernel: [ 4.202524] Call Trace:
Jan 4 07:49:51 proven-gnu kernel: [ 4.202526] <TASK>
Jan 4 07:49:51 proven-gnu kernel: [ 4.202530] parse_discovery_table.isra.0+0x162/0x1a0
Jan 4 07:49:51 proven-gnu kernel: [ 4.202534] intel_uncore_has_discovery_tables+0x19e/0x270
Jan 4 07:49:51 proven-gnu kernel: [ 4.202538] ? type_pmu_register+0x16/0x42
Jan 4 07:49:51 proven-gnu kernel: [ 4.202545] intel_uncore_init+0xe3/0x226
Jan 4 07:49:51 proven-gnu kernel: [ 4.202549] ? type_pmu_register+0x42/0x42
Jan 4 07:49:51 proven-gnu kernel: [ 4.202553] do_one_initcall+0x46/0x1e0
Jan 4 07:49:51 proven-gnu kernel: [ 4.202559] do_initcalls+0x12f/0x159
Jan 4 07:49:51 proven-gnu kernel: [ 4.202564] kernel_init_freeable+0x162/0x1b5
Jan 4 07:49:51 proven-gnu kernel: [ 4.202568] ? rest_init+0x100/0x100
Jan 4 07:49:51 proven-gnu kernel: [ 4.202575] kernel_init+0x1b/0x150
Jan 4 07:49:51 proven-gnu kernel: [ 4.202578] ? rest_init+0x100/0x100
Jan 4 07:49:51 proven-gnu kernel: [ ...

Read more...

Revision history for this message
Michael Reed (mreed8855) wrote :

There appears to be a patch set that fixes this issue but I do not think it has been accepted upstream yet.

https://lore.kernel.org/lkml/167429456532.4906.14087166098724750776.tip-bot2@tip-bot2/T/

If you search for "SPR MCC" the explanation for this issue is under the 3rd occurrence.

Mao, if you are able to apply these patches can you check to see if it fixes the issue?

To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.