[linux-azure] Ubuntu 16.04 + INFINIBAND-OPEN-MPI-2VM
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
linux-azure (Ubuntu) |
Invalid
|
Undecided
|
Unassigned |
Bug Description
We ran an RDMA test case against gallery image Ubuntu 16.04 (with proposed kernel), and found the below issue. The kernel prior to proposed does not exhibit this bug, so it is a regression:
The issue is when ibv_devinfo is run, we get the below info:
ibv_devinfo
libibverbs: Warning: no userspace device-specific driver found for /sys/class/
No IB devices found
ibv_devices
libibverbs: Warning: no userspace device-specific driver found for /sys/class/
device node GUID
------ ----------------
Ibstat works as expected
CA 'mlx5_0'
CA type: MT4120
Number of ports: 1
Firmware version: 16.23.1020
Hardware version: 0
Node GUID: 0x00155dfffe33ff49
System image GUID: 0x506b4b0300f521ec
Port 1:
SM lid: 16
This Family exhibits the bug, with a subsystem of MT28800:
lspci -v|egrep 'Mel|mlx'
0002:00:02.0 Infiniband controller: Mellanox Technologies MT27800 Family [ConnectX-5 Virtual Function]
Subsystem: Mellanox Technologies MT28800 Family [ConnectX-5 Virtual Function]
Kernel driver in use: mlx5_core
Kernel modules: mlx5_core
This issue does not occur with Ubuntu 18.04, which has a different Subsystem(MT27800):
lspci -v|egrep 'Mel|mlx'
0002:00:02.0 Infiniband controller: Mellanox Technologies MT27800 Family [ConnectX-5 Virtual Function]
Subsystem: Mellanox Technologies MT27800 Family [ConnectX-5 Virtual Function]
Kernel driver in use: mlx5_core
Kernel modules: mlx5_core
If we could, we would like to move to rmda-core to version 22 or higher.
Status changed to 'Confirmed' because the bug affects multiple users.