Ubuntu 18.04 [ WSP DD2.2 with stop4 and stop5 enabled ]: kdump fails to capture dump when smt=2 or off.
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
The Ubuntu-power-systems project |
Fix Released
|
High
|
Canonical Kernel Team | ||
linux (Ubuntu) |
Fix Released
|
High
|
Unassigned | ||
Bionic |
Fix Released
|
High
|
Unassigned |
Bug Description
---Problem Description---
Ubuntu 18.04 [ WSP DD2.2 with stop4 and stop5 enabled ]: kdump fails to capture dump when smt=2 or off.
---Environment--
Kernel Build: 4.15.0-13-generic
System Name : ltc-wspoon4
Model/Type : P9
Platform : BML
---Steps to reproduce--
1. Configure kdump.
2. Set smt=off
# ppc64_cpu --smt=off
3. trigger crash.
echo 1 > /proc/sys/
echo "c" > /proc/sysrq-trigger
---Logs----
root@ltc-wspoon4:~# dpkg -l|grep kexec
ii kexec-tools 1:2.0.16-1ubuntu1 ppc64el tools to support fast kexec reboots
root@ltc-wspoon4:~# makedumpfile -v
makedumpfile: version 1.6.3 (released on 29 Jun 2018)
lzo enabled
snappy disabled
[ 285.519832] [c000001fe2d83de0] [c0000000003d1898] SyS_write+
[ 285.519926] [c000001fe2d83e30] [c00000000000b184] system_
[ 285.520007] Instruction dump:
[ 285.520053] 4bfff9f1 4bfffe50 3c4c00f0 3842e800 7c0802a6 60000000 39200001 3d42001c
[ 285.520158] 394a6db0 912a0000 7c0004ac 39400000 <992a0000> 4e800020 3c4c00f0 3842e7d0
[ 285.520261] ---[ end trace 90a666dc7ca6f0ec ]---
[ 286.525787]
[ 286.525883] Sending IPI to other CPUs
[ 28[ 401.296284048,5] OPAL: Switch to big-endian OS
[ 402.297026662,3] OPAL: CPU 0x1 not in OPAL !
6.851284] IPI complete
[ 403.455520784,3] OPAL: CPU 0x1 not in OPAL !nce.
[ 403.455569636,5] OPAL: Switch to little-endian OS
[ 404.455711332,3] OPAL: CPU 0x1 not in OPAL !
[ 404.470276386,3] PHB#0000[0:0]: CRESET: Unexpected slot state 00000102, resetting...
[ 413.140065625,3] PHB#0003[0:3]: CRESET: Unexpected slot state 00000102, resetting...
[ 421.393193605,3] PHB#0030[8:0]: CRESET: Unexpected slot state 00000102, resetting...
[ 423.353977316,3] PHB#0033[8:3]: CRESET: Unexpected slot state 00000102, resetting...
[ 425.314547966,3] PHB#0034[8:4]: CRESET: Unexpected slot state 00000102, resetting...
[ 5.004718] Processor 1 is stuck.
[ 10.007584] Processor 2 is stuck.
[ 15.010425] Processor 3 is stuck.
[ 16.135550] integrity: Unable to open file: /etc/keys/
[ 16.135554] integrity: Unable to open file: /etc/keys/
[ 16.250952] vio vio: uevent: failed to send synthetic uevent
--== Welcome to Hostboot hostboot-
4.52180|
4.53193|
6.00924|Booting from SBE side 0 on master proc=00050000
There could be a firmware issue there but still there is need for the below kernel
patches to be included to ensure kdump kernel captures dump successfully
when SMT is set to 2/off
https:/
("powerpc/crash: Remove the test for cpu_online in the IPI callback")
https:/
("powernv/kdump: Fix cases where the kdump kernel can get HMI's")
https:/
("powerpc/kdump: Fix powernv build break when KEXEC_CORE=n")
Thanks
Hari
Changed in ubuntu-power-systems: | |
importance: | Undecided → High |
assignee: | nobody → Canonical Kernel Team (canonical-kernel-team) |
tags: | added: triage-g |
Changed in ubuntu-power-systems: | |
status: | New → Triaged |
Changed in linux (Ubuntu): | |
status: | New → In Progress |
assignee: | Ubuntu on IBM Power Systems Bug Triage (ubuntu-power-triage) → Joseph Salisbury (jsalisbury) |
importance: | Undecided → High |
Changed in ubuntu-power-systems: | |
status: | Triaged → In Progress |
Changed in linux (Ubuntu Bionic): | |
status: | In Progress → Fix Committed |
Changed in ubuntu-power-systems: | |
status: | In Progress → Fix Committed |
Changed in ubuntu-power-systems: | |
status: | Fix Committed → Fix Released |
Default Comment by Bridge