Kernel getting crashed on DLPAR add/remove commands for halfsail on ubuntu14.04.5
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux (Ubuntu) |
New
|
Undecided
|
Taco Screen team |
Bug Description
== Comment: #0 - Naresh Bannoth - 2016-07-01 02:07:26 ==
---Problem Description---
executing Dlpar add/remove adapter operation on Halfsail card in ubuntu14.04.5.
on executing these command for 5-10 times, the kernel getting crashed.
Syslog messages while crashing
-------
[52469.743034] Unable to handle kernel paging request for instruction fetch
[52469.743045] Faulting instruction address: 0x552f73746f6c732c
[52469.743051] Oops: Kernel access of bad area, sig: 11 [#1]
[52469.743055] SMP NR_CPUS=2048 NUMA pSeries
[52469.743061] Modules linked in: rpadlpar_io rpaphp dm_round_robin pseries_rng rtc_generic dm_multipath qla2xxx ibmvscsi ibmveth scsi_transport_fc
[52469.743079] CPU: 10 PID: 1954 Comm: multipathd Not tainted 4.4.0-29-generic #48~14.04.1-Ubuntu
[52469.743086] task: c0000003f9f42a20 ti: c0000003f22d4000 task.ti: c0000003f22d4000
[52469.743091] NIP: 552f73746f6c732c LR: c00000000004803c CTR: 552f73746f6c732f
[52469.743097] REGS: c0000003f22d6d60 TRAP: 0400 Not tainted (4.4.0-29-generic)
[52469.743101] MSR: 8000000140009033 <SF,EE,
[52469.743116] CFAR: c000000000008468 SOFTE: 1
GPR00: c000000000048018 c0000003f22d6fe0 c000000001598f00 c000000006e6c000
GPR04: 0000000000000001 0000000000000000 000000000000de0b c000000000f7fdc8
GPR08: 00000003fe920000 552f73746f6c732f 0000000000000000 c000000006121e40
GPR12: 552f73746f6c732f c000000007af5f00 00003fff8f00eea8 00003fff8f00eea8
GPR16: 00003fff8f00eea8 00003fff8f00eea8 00003fff8f00eea8 0000000000000001
GPR20: 0000000000000000 0000000000000083 00003fff8eeb04cc 00003fff8f023810
GPR24: 000000000000004e c0000000062dfc28 c0000003f2c24000 0000000000000100
GPR28: c0000000015050e0 c000000005018e80 c0000003f3282c00 c000000006e6c000
[52469.743181] NIP [552f73746f6c732c] 0x552f73746f6c732c
[52469.743188] LR [c00000000004803c] pcibios_
[52469.743192] Call Trace:
[52469.743196] [c0000003f22d6fe0] [c000000000048018] pcibios_
[52469.743205] [c0000003f22d7010] [c0000000005c15f4] pci_release_
[52469.743212] [c0000003f22d7040] [c0000000006b2390] device_
[52469.743219] [c0000003f22d70c0] [c00000000056bd14] kobject_
[52469.743226] [c0000003f22d7140] [c0000000006b2b24] put_device+
[52469.743232] [c0000003f22d7170] [c00000000073c034] scsi_host_
[52469.743239] [c0000003f22d71b0] [c0000000006b2390] device_
[52469.743246] [c0000003f22d7230] [c00000000056bd14] kobject_
[52469.743253] [c0000003f22d72b0] [c0000000006b2b24] put_device+
[52469.743265] [c0000003f22d72e0] [d0000000023d097c] fc_rport_
[52469.743275] [c0000003f22d7310] [c0000000006b2390] device_
[52469.743283] [c0000003f22d7390] [c00000000056bd14] kobject_
[52469.743292] [c0000003f22d7410] [c0000000006b2b24] put_device+
[52469.743301] [c0000003f22d7440] [c000000000748a50] scsi_target_
[52469.743309] [c0000003f22d7470] [c0000000006b2390] device_
[52469.743317] [c0000003f22d74f0] [c00000000056bd14] kobject_
[52469.743324] [c0000003f22d7570] [c0000000006b2b24] put_device+
[52469.743332] [c0000003f22d75a0] [c00000000074d3e8] scsi_device_
[52469.743341] [c0000003f22d7600] [c0000000000da4f4] execute_
[52469.743349] [c0000003f22d7630] [c00000000074d254] scsi_device_
[52469.743357] [c0000003f22d7660] [c0000000006b2390] device_
[52469.743364] [c0000003f22d76e0] [c00000000056bd14] kobject_
[52469.743371] [c0000003f22d7760] [c0000000006b2b24] put_device+
[52469.743379] [c0000003f22d7790] [c0000000007399e0] scsi_device_
[52469.743386] [c0000003f22d77c0] [c00000000075b528] scsi_disk_
[52469.743394] [c0000003f22d7800] [c0000000003282c8] __blkdev_
[52469.743402] [c0000003f22d78f0] [c0000000008cbc1c] dm_put_
[52469.743410] [c0000003f22d7930] [c0000000008d09b0] dm_put_
[52469.743418] [c0000003f22d79b0] [d000000004a41830] free_priority_
[52469.743427] [c0000003f22d7a10] [d000000004a41914] free_multipath+
[52469.743436] [c0000003f22d7a60] [c0000000008d1b30] dm_table_
[52469.743443] [c0000003f22d7af0] [c0000000008d711c] dev_suspend+
[52469.743449] [c0000003f22d7b30] [c0000000008d7fec] ctl_ioctl+
[52469.743456] [c0000003f22d7d10] [c0000000008d81d8] dm_ctl_
[52469.743464] [c0000003f22d7d40] [c0000000002ec710] do_vfs_
[52469.743472] [c0000003f22d7de0] [c0000000002ecaf4] SyS_ioctl+0xd4/0xf0
[52469.743480] [c0000003f22d7e30] [c000000000009204] system_
[52469.743486] Instruction dump:
[52469.743490] XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX
[52469.743501] XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX
[52469.743516] ---[ end trace 93e18aa68a2c9922 ]---
[52469.747068]
[52469.747074] Sending IPI to other CPUs
[52469.748095] IPI complete
I'm in purgatory
---uname output---
Linux ubuntu 4.4.0-29-generic #48~14.04.1-Ubuntu SMP Wed Jun 29 19:55:03 UTC 2016 ppc64le ppc64le ppc64le GNU/Linux
Machine Type = 8284-22A
---Steps to Reproduce---
1. Install all required rsct packages for dlpar operation
2. make sure that the adapter is added as Desired in HMC
3. Activate the Dlpar services using "starsrc -g rsct" and "starsrc -g rsct_rm"
4. Make sure that thedlpar service are active using lssrc -a
root@ubuntu:~# lssrc -a
Subsystem Group PID Status
ctrmc rsct 1558 active
IBM.DRM rsct_rm 1720 active
IBM.MgmtDomainRM rsct_rm 1785 active
IBM.ServiceRM rsct_rm 1796 active
IBM.HostRM rsct_rm 1855 active
ctcas rsct 2918 active
IBM.ERRM rsct_rm 2926 active
IBM.AuditRM rsct_rm 2927 active
IBM.SensorRM rsct_rm 2928 active
root@ubuntu:~#
Now start executing the Dlpar commands as follows,
1. List the adapter present in the lpar and note down the lpar_id and drc_index of the adapter on which you want to execute Dlpar operations,
lshwres -r io -m server-name --rsubtype slot --filter "lpar_names=
eg:
lshwres -r io -m tuletapio2-fsp --rsubtype slot --filter "lpar_names=
output:
--------
unit_phys_
Remove command:
-------
chhwres -r io -m tuletapio2-fsp -o r --id lpar_id -l drc_index
eg:
chhwres -r io -m tuletapio2-fsp -o r --id 8 -l 2103001D
Add command
-------------------
chhwres -r io -m tuletapio2-fsp -o a --id lpar_id -l drc_index
eg:
chhwres -r io -m tuletapio2-fsp -o a --id 8 -l 2103001D
Userspace tool common name: DLPAR(rsct)
== Comment: #12 - Mauricio Faria De Oliveira - 2016-09-08 12:19:56 ==
Hi Canonical,
Can you please apply this patch in 16.04.x and latest 14.04.x HWE?
It is accepted in linux-4.8-rc5 [1].
Thanks,
[1] 2016-08-22 "powerpc/pseries: use pci_host_
https:/
tags: | added: architecture-ppc64le bugnameltc-143294 severity-critical targetmilestone-inin14045 |
Changed in ubuntu: | |
assignee: | nobody → Taco Screen team (taco-screen-team) |
affects: | ubuntu → linux (Ubuntu) |
I think this is a duplicate, or at least bug #1618151 requires the same commit.