After reassign RoCE CX5 Mojawe Card Pchid to another LPAR dmesg show under Ubuntu the following Error message Ubuntu 20.04.01 with updates oot@t35lp02:~# uname -a Linux t35lp02.lnxne.boe 5.4.0-80-generic #90-Ubuntu SMP Fri Jul 9 17:41:33 UTC 2021 s390x s390x s390x GNU/Linux root@t35lp02:~# DMESG Output 761.778422] mlx5_core 0018:00:00.1: poll_health:715:(pid 0): Fatal error 1 detected [ 761.778432] mlx5_core 0018:00:00.1: print_health_info:381:(pid 0): assert_var[0] 0xffffffff [ 761.778435] mlx5_core 0018:00:00.1: print_health_info:381:(pid 0): assert_var[1] 0xffffffff [ 761.778437] mlx5_core 0018:00:00.1: print_health_info:381:(pid 0): assert_var[2] 0xffffffff [ 761.778439] mlx5_core 0018:00:00.1: print_health_info:381:(pid 0): assert_var[3] 0xffffffff [ 761.778442] mlx5_core 0018:00:00.1: print_health_info:381:(pid 0): assert_var[4] 0xffffffff [ 761.778444] mlx5_core 0018:00:00.1: print_health_info:384:(pid 0): assert_exit_ptr 0xffffffff [ 761.778447] mlx5_core 0018:00:00.1: print_health_info:386:(pid 0): assert_callra 0xffffffff [ 761.778451] mlx5_core 0018:00:00.1: print_health_info:389:(pid 0): fw_ver 65535.65535.65535 [ 761.778454] mlx5_core 0018:00:00.1: print_health_info:390:(pid 0): hw_id 0xffffffff [ 761.778456] mlx5_core 0018:00:00.1: print_health_info:391:(pid 0): irisc_index 255 [ 761.778460] mlx5_core 0018:00:00.1: print_health_info:392:(pid 0): synd 0xff: unrecognized error [ 761.778462] mlx5_core 0018:00:00.1: print_health_info:394:(pid 0): ext_synd 0xffff [ 761.778465] mlx5_core 0018:00:00.1: print_health_info:396:(pid 0): raw fw_ver 0xffffffff [ 761.778467] mlx5_core 0018:00:00.1: mlx5_trigger_health_work:696:(pid 0): new health works are not permitted at this stage [ 763.179016] mlx5_core 0018:00:00.1: E-Switch: cleanup [ 768.348431] mlx5_core 0018:00:00.1: mlx5_reclaim_startup_pages:562:(pid 123): FW did not return all pages. giving up... [ 768.348433] ------------[ cut here ]------------ [ 768.348434] FW pages counter is 43318 after reclaiming all pages [ 768.348562] WARNING: CPU: 0 PID: 123 at drivers/net/ethernet/mellanox/mlx5/core/pagealloc.c:567 mlx5_reclaim_startup_pages+0x12c/0x1c0 [mlx5_core] [ 768.348563] Modules linked in: s390_trng chsc_sch eadm_sch vfio_ccw vfio_mdev mdev vfio_iommu_type1 vfio sch_fq_codel drm drm_panel_orientation_quirks i2c_core ip_tables x_tables btrfs zstd_compress zlib_deflate raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 linear dm_service_time mlx5_ib ib_uverbs ib_core pkey qeth_l2 zcrypt crc32_vx_s390 ghash_s390 prng aes_s390 des_s390 libdes sha3_512_s390 sha3_256_s390 sha512_s390 mlx5_core sha256_s390 sha1_s390 sha_common tls mlxfw ptp pps_core zfcp scsi_transport_fc dasd_eckd_mod dasd_mod qeth qdio ccwgroup scsi_dh_emc scsi_dh_rdac scsi_dh_alua dm_multipath [ 768.348586] CPU: 0 PID: 123 Comm: kmcheck Tainted: G W 5.4.0-80-generic #90-Ubuntu [ 768.348586] Hardware name: IBM 8561 T01 703 (LPAR) [ 768.348587] Krnl PSW : 0704c00180000000 000003ff808d33ac (mlx5_reclaim_startup_pages+0x12c/0x1c0 [mlx5_core]) [ 768.348607] R:0 T:1 IO:1 EX:1 Key:0 M:1 W:0 P:0 AS:3 CC:0 PM:0 RI:0 EA:3 [ 768.348608] Krnl GPRS: 0000000000000004 0000000000000006 0000000000000034 0000000000000007 [ 768.348608] 0000000000000007 00000000fcb4fa00 000000000000007b 000003e00458fafc [ 768.348609] 00000000b7d406f0 000000004d849c00 00000000b7d00120 000000010000b6c0 [ 768.348610] 00000000f4cb1100 000003e00458fe70 000003ff808d33a8 000003e00458fa50 [ 768.348615] Krnl Code: 000003ff808d339c: c02000041043 larl %r2,000003ff80955422 000003ff808d33a2: c0e5ffff70c7 brasl %r14,000003ff808c1530 #000003ff808d33a8: a7f40001 brc 15,000003ff808d33aa >000003ff808d33ac: e330a5f04012 lt %r3,263664(%r10) 000003ff808d33b2: a784ffd7 brc 8,000003ff808d3360 000003ff808d33b6: b9140033 lgfr %r3,%r3 000003ff808d33ba: c0200004104e larl %r2,000003ff80955456 000003ff808d33c0: c0e5ffff70b8 brasl %r14,000003ff808c1530 [ 768.348622] Call Trace: [ 768.348641] ([<000003ff808d33a8>] mlx5_reclaim_startup_pages+0x128/0x1c0 [mlx5_core]) [ 768.348661] [<000003ff808c8e14>] mlx5_function_teardown+0x44/0xa0 [mlx5_core] [ 768.348680] [<000003ff808c95b0>] mlx5_unload_one+0x80/0x160 [mlx5_core] [ 768.348699] [<000003ff808c9720>] remove_one+0x50/0xd0 [mlx5_core] [ 768.348702] [<000000004d2704c0>] pci_device_remove+0x40/0xa0 [ 768.348706] [<000000004d2f724e>] device_release_driver_internal+0xee/0x1c0 [ 768.348707] [<000000004d267054>] pci_stop_bus_device+0x94/0xc0 [ 768.348708] [<000000004d267210>] pci_stop_and_remove_bus_device_locked+0x30/0x50 [ 768.348710] [<000000004cd36cbe>] __zpci_event_availability+0x26e/0x340 [ 768.348713] [<000000004d382794>] chsc_process_crw+0x2e4/0x300 [ 768.348714] [<000000004d389fd6>] crw_collect_info+0x276/0x340 [ 768.348716] [<000000004cd681e6>] kthread+0x126/0x160 [ 768.348719] [<000000004d5a568c>] ret_from_fork+0x28/0x30 [ 768.348720] [<000000004d5a5694>] kernel_thread_starter+0x0/0x10 [ 768.348720] Last Breaking-Event-Address: [ 768.348739] [<000003ff808d33a8>] mlx5_reclaim_startup_pages+0x128/0x1c0 [mlx5_core] [ 768.348740] ---[ end trace 1056779ff3084977 ]--- [ 768.354255] pci 0018:00:00.1: Removing from iommu group 2 [ 768.359097] pci_bus 0018:00: busn_res: [bus 00] is released [ 768.359122] crw_info : CRW reports slct=0, oflw=0, chn=0, rsc=B, anc=0, erc=0, rsid=0 root@t35lp02:~# == Comment: #2 -