PCI RoCe IB perftest Aborted (core dumped)
Affects | Status | Importance | Assigned to | Milestone | |
---|---|---|---|---|---|
Ubuntu on IBM z Systems |
Fix Released
|
High
|
Unassigned | ||
perftest (Ubuntu) |
Fix Released
|
High
|
Unassigned |
Bug Description
SRU:
====
[Impact]
* the perftest tools (ib_*) incl. in the perftest package cannot be used at all, they all always core dump on all platforms
* a backport is required for getting again a working perftest package / tool set
* the fix was officially provided by Mellanox and fixes the version comparison that was broken before (partly using int, partly string compares)
[Test Case]
* the bug can easily be reproduced on two systems both with RoCE cards installed
and by starting a perftest run like this:
on one machine as 'server': sudo ./ib_read_bw -d mlx4_0 -a
and on a second machine as 'client', pointing to the servers IP address: sudo ./ib_read_bw <server IP> -d mlx4_0
detailed instructions how to reproduce the bug
* install the perftest package including all dependencies
* configure the RoCE devices as network devices using a private network range in /etc/network/
# The 1st RoCE interface configuration
auto enP1p0s0
iface enP1p0s0 inet static
address 192.168.1.141
netmask 255.255.255.0
network 192.168.1.0
broadcast 192.168.1.255
* test if the network is okay, with ping (or rping, udaddy rdma_client/
* and run the Test Case above
[Regression Potential]
* the regression is low due to the fact that the current tools that are part of the perftest package just segfault and are unusable
* the target for this patched perftest package is Zesty (and higher), because only Zesty has the depending packages in an up-to-date version
* people may ask to SRU that to Xenial as well, but that would require the update of several other packages ...
-------
== Comment: #0 - Helmut Grauer - 2016-03-04 06:46:50 ==
Hi
Configure IB for perftest
Ethernet Interface
np0s0 Link encap:Ethernet HWaddr 82:01:14:32:f0:90
inet addr:10.100.80.2 Bcast:10.
inet6 addr: fe80::8001:
inet6 addr: fd00:10:
inet6 addr: fd00:10:
UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
RX packets:25938 errors:0 dropped:0 overruns:0 frame:0
TX packets:253 errors:0 dropped:0 overruns:0 carrier:0
RX bytes:8228251 (8.2 MB) TX bytes:21494 (21.4 KB)
Installing related packages for dapltest
librdmacm-dev install
librdmacm1 install
librdmacm1-dbg install
dapl2-utils install
libibumad3 install
libibverbs-dev install
libibverbs1 install
libmlx4-1 install
libmlx4-1-dbg install
libmlx4-dev install
libmlx5-1 install
libmlx5-1-dbg install
libmlx5-dev install
perftest install
+++++++
+++++++
+++ PCI-Overview: +++
+++++++
+++++++
=======
DEVICE_List:
-------------
0000:00:00.0
0001:00:00.0
=======
FunctionID_List:
-----------------
0x0000003e
0x0000003f
=======
PCHID_List:
------------
0x0108
0x013c
=======
Interface_List:
----------------
/sys/bus/
enp0s0
enp0s0d1
/sys/bus/
enP1p0s0
enP1p0s0d1
=======
Infiniband_List:
----------------
/sys/bus/
mlx4_0
/sys/bus/
mlx4_1
-------
server
root@s83lp02:~# dpkg -S /etc/dat.conf
libdapl2: /etc/dat.conf
root@s83lp02:~# ib_read_bw -d mlx4_0 -a
*******
* Waiting for client to connect... *
*******
*** stack smashing detected ***: ib_read_bw terminated
Aborted (core dumped)
-------
root@s83lp18:~# ./xpci.sh
+++++++
+++++++
+++ PCI-Overview: +++
+++++++
+++++++
=======
DEVICE_List:
-------------
0000:00:00.0
0001:00:00.0
=======
FunctionID_List:
-----------------
0x00000040
0x00000041
=======
PCHID_List:
------------
0x0108
0x013c
=======
Interface_List:
----------------
/sys/bus/
ens40
ens40d1
/sys/bus/
enP1s41
enP1s41d1
=======
Infiniband_List:
----------------
/sys/bus/
mlx4_0
/sys/bus/
mlx4_1
Client
root@s83lp18:~# ib_read_bw 10.100.80.2 -d mlx4_1
Couldn't connect to 10.100.80.2:18515
Unable to open file descriptor for socket connection Unable to init the socket connection
root@s83lp18:~# ib_read_bw 10.100.80.2 -d mlx4_1
*** stack smashing detected ***: ib_read_bw terminated
Aborted (core dumped)
(reverse-
I will add SOSReport and dgbinfo.sh
affects: | ubuntu → perftest (Ubuntu) |
summary: |
- PCI RoCe IB pertest Aborted (core dumped) + PCI RoCe IB perftest Aborted (core dumped) |
Changed in perftest (Ubuntu): | |
assignee: | Skipper Bug Screeners (skipper-screen-team) → Canonical Server Team (canonical-server) |
Changed in perftest (Ubuntu): | |
milestone: | none → ubuntu-16.04 |
Changed in ubuntu-z-systems: | |
status: | New → Triaged |
Changed in perftest (Ubuntu Xenial): | |
status: | New → Triaged |
importance: | Undecided → High |
Changed in perftest (Ubuntu Yakkety): | |
milestone: | xenial-updates → ubuntu-16.10 |
Changed in perftest (Ubuntu Xenial): | |
milestone: | none → ubuntu-16.04.1 |
Changed in ubuntu-z-systems: | |
importance: | Undecided → High |
tags: | added: roce |
Changed in ubuntu-z-systems: | |
status: | Triaged → Confirmed |
Changed in perftest (Ubuntu): | |
status: | Triaged → Confirmed |
Changed in perftest (Ubuntu Xenial): | |
status: | Triaged → Confirmed |
Changed in perftest (Ubuntu Yakkety): | |
status: | Triaged → Confirmed |
Changed in perftest (Ubuntu): | |
milestone: | ubuntu-16.10 → ubuntu-17.06 |
description: | updated |
Changed in perftest (Ubuntu Zesty): | |
status: | New → In Progress |
importance: | Undecided → High |
no longer affects: | perftest (Ubuntu Yakkety) |
no longer affects: | perftest (Ubuntu Xenial) |
Changed in perftest (Ubuntu Zesty): | |
assignee: | nobody → Dimitri John Ledkov (xnox) |
milestone: | none → zesty-updates |
status: | In Progress → Triaged |
Changed in ubuntu-z-systems: | |
status: | Confirmed → Triaged |
no longer affects: | perftest (Ubuntu Zesty) |
Changed in ubuntu-z-systems: | |
status: | Triaged → Fix Released |
tags: | added: universe |
Default Comment by Bridge