https://launchpad.net/ubuntu/+source/gemmlowp/0.0~git20211220.e844ffd-1/+build/24109081 RUN: /usr/share/launchpad-buildd/bin/builder-prep Kernel version: Linux lcy02-amd64-088 5.4.0-120-generic #136-Ubuntu SMP Fri Jun 10 13:40:48 UTC 2022 x86_64 Buildd toolchain package versions: launchpad-buildd_215~563~ubuntu20.04.1 python3-lpbuildd_215~563~ubuntu20.04.1 sbuild_0.79.0-1ubuntu1 git-build-recipe_0.3.6 git_1:2.25.1-1ubuntu3.4 dpkg-dev_1.19.7ubuntu3.2 python3-debian_0.1.36ubuntu1. Syncing the system clock with the buildd NTP service... 24 Jun 11:16:16 ntpdate[1867]: adjust time server 10.131.248.1 offset -0.025833 sec RUN: /usr/share/launchpad-buildd/bin/in-target unpack-chroot --backend=chroot --series=kinetic --arch=amd64 PACKAGEBUILD-24109081 --image-type chroot /home/buildd/filecache-default/4d18961cd05542d5def75b14497015b674c37821 Creating target for build PACKAGEBUILD-24109081 RUN: /usr/share/launchpad-buildd/bin/in-target mount-chroot --backend=chroot --series=kinetic --arch=amd64 PACKAGEBUILD-24109081 Starting target for build PACKAGEBUILD-24109081 RUN: /usr/share/launchpad-buildd/bin/in-target override-sources-list --backend=chroot --series=kinetic --arch=amd64 PACKAGEBUILD-24109081 'deb http://ftpmaster.internal/ubuntu kinetic main universe' 'deb http://ftpmaster.internal/ubuntu kinetic-security main universe' 'deb http://ftpmaster.internal/ubuntu kinetic-updates main universe' 'deb http://ftpmaster.internal/ubuntu kinetic-proposed main universe' Overriding sources.list in build-PACKAGEBUILD-24109081 RUN: /usr/share/launchpad-buildd/bin/in-target update-debian-chroot --backend=chroot --series=kinetic --arch=amd64 PACKAGEBUILD-24109081 Updating target for build PACKAGEBUILD-24109081 Get:1 http://ftpmaster.internal/ubuntu kinetic InRelease [267 kB] Get:2 http://ftpmaster.internal/ubuntu kinetic-security InRelease [90.7 kB] Get:3 http://ftpmaster.internal/ubuntu kinetic-updates InRelease [90.7 kB] Get:4 http://ftpmaster.internal/ubuntu kinetic-proposed InRelease [118 kB] Get:5 http://ftpmaster.internal/ubuntu kinetic/main amd64 Packages [1398 kB] Get:6 http://ftpmaster.internal/ubuntu kinetic/main Translation-en [510 kB] Get:7 http://ftpmaster.internal/ubuntu kinetic/universe amd64 Packages [14.3 MB] Get:8 http://ftpmaster.internal/ubuntu kinetic/universe Translation-en [5731 kB] Get:9 http://ftpmaster.internal/ubuntu kinetic-proposed/main amd64 Packages [81.9 kB] Get:10 http://ftpmaster.internal/ubuntu kinetic-proposed/main Translation-en [23.6 kB] Get:11 http://ftpmaster.internal/ubuntu kinetic-proposed/universe amd64 Packages [269 kB] Get:12 http://ftpmaster.internal/ubuntu kinetic-proposed/universe Translation-en [120 kB] Fetched 23.0 MB in 2s (9970 kB/s) Reading package lists... Reading package lists... Building dependency tree... Reading state information... Calculating upgrade... The following NEW packages will be installed: libgprofng0 util-linux-extra The following packages will be upgraded: adduser apt base-files binutils binutils-common binutils-x86-64-linux-gnu bsdutils cpp-11 dash dpkg dpkg-dev e2fsprogs fakeroot g++-11 gcc-11 gcc-11-base gcc-12-base init init-system-helpers libapparmor1 libapt-pkg6.0 libasan6 libatomic1 libbinutils libblkid1 libcap-ng0 libcc1-0 libcom-err2 libctf-nobfd0 libctf0 libdb5.3 libdpkg-perl libext2fs2 libfakeroot libgcc-11-dev libgcc-s1 libgcrypt20 libgnutls30 libgomp1 libgpg-error0 libip4tc2 libitm1 libkeyutils1 liblsan0 liblzma5 libmount1 libncurses6 libncursesw6 libpng16-16 libquadmath0 libreadline8 libselinux1 libsemanage-common libsemanage2 libsepol2 libsmartcols1 libsqlite3-0 libss2 libssl3 libstdc++-11-dev libstdc++6 libsystemd0 libtinfo6 libtsan0 libubsan1 libudev1 libuuid1 libzstd1 linux-libc-dev login logsave lto-disabled-list mawk mount ncurses-base ncurses-bin openssl passwd pinentry-curses policyrcd-script-zg2 readline-common systemd systemd-sysv util-linux xz-utils 85 upgraded, 2 newly installed, 0 to remove and 0 not upgraded. Need to get 81.1 MB of archives. After this operation, 5364 kB of additional disk space will be used. Get:1 http://ftpmaster.internal/ubuntu kinetic/main amd64 base-files amd64 12.2ubuntu1 [62.6 kB] Get:2 http://ftpmaster.internal/ubuntu kinetic/main amd64 bsdutils amd64 1:2.38-4ubuntu1 [80.7 kB] Get:3 http://ftpmaster.internal/ubuntu kinetic/main amd64 libzstd1 amd64 1.5.2+dfsg-1 [270 kB] Get:4 http://ftpmaster.internal/ubuntu kinetic-proposed/main amd64 libatomic1 amd64 12.1.0-4ubuntu1 [10.4 kB] Get:5 http://ftpmaster.internal/ubuntu kinetic-proposed/main amd64 libubsan1 amd64 12.1.0-4ubuntu1 [969 kB] Get:6 http://ftpmaster.internal/ubuntu kinetic-proposed/main amd64 libquadmath0 amd64 12.1.0-4ubuntu1 [152 kB] Get:7 http://ftpmaster.internal/ubuntu kinetic-proposed/main amd64 liblsan0 amd64 12.1.0-4ubuntu1 [1060 kB] Get:8 http://ftpmaster.internal/ubuntu kinetic-proposed/main amd64 libitm1 amd64 12.1.0-4ubuntu1 [29.4 kB] Get:9 http://ftpmaster.internal/ubuntu kinetic-proposed/main amd64 libgomp1 amd64 12.1.0-4ubuntu1 [125 kB] Get:10 http://ftpmaster.internal/ubuntu kinetic-proposed/main amd64 gcc-12-base amd64 12.1.0-4ubuntu1 [19.1 kB] Get:11 http://ftpmaster.internal/ubuntu kinetic-proposed/main amd64 libgcc-s1 amd64 12.1.0-4ubuntu1 [54.3 kB] Get:12 http://ftpmaster.internal/ubuntu kinetic-proposed/main amd64 libcc1-0 amd64 12.1.0-4ubuntu1 [46.4 kB] Get:13 http://ftpmaster.internal/ubuntu kinetic-proposed/main amd64 libstdc++6 amd64 12.1.0-4ubuntu1 [678 kB] Get:14 http://ftpmaster.internal/ubuntu kinetic/main amd64 libgpg-error0 amd64 1.45-2 [69.0 kB] Get:15 http://ftpmaster.internal/ubuntu kinetic/main amd64 libgcrypt20 amd64 1.10.1-2ubuntu1 [524 kB] Get:16 http://ftpmaster.internal/ubuntu kinetic/main amd64 liblzma5 amd64 5.2.5-2.1 [99.2 kB] Get:17 http://ftpmaster.internal/ubuntu kinetic/main amd64 systemd-sysv amd64 249.11-0ubuntu4 [10.2 kB] Get:18 http://ftpmaster.internal/ubuntu kinetic/main amd64 libapparmor1 amd64 3.0.4-2ubuntu3 [38.0 kB] Get:19 http://ftpmaster.internal/ubuntu kinetic/main amd64 libgnutls30 amd64 3.7.6-2ubuntu1 [968 kB] Get:20 http://ftpmaster.internal/ubuntu kinetic/main amd64 libip4tc2 amd64 1.8.7-1ubuntu6 [19.7 kB] Get:21 http://ftpmaster.internal/ubuntu kinetic/main amd64 libblkid1 amd64 2.38-4ubuntu1 [103 kB] Get:22 http://ftpmaster.internal/ubuntu kinetic/main amd64 libselinux1 amd64 3.4-1 [77.6 kB] Get:23 http://ftpmaster.internal/ubuntu kinetic/main amd64 libmount1 amd64 2.38-4ubuntu1 [121 kB] Get:24 http://ftpmaster.internal/ubuntu kinetic/main amd64 libsmartcols1 amd64 2.38-4ubuntu1 [50.8 kB] Get:25 http://ftpmaster.internal/ubuntu kinetic/main amd64 login amd64 1:4.11.1+dfsg1-2ubuntu1 [183 kB] Get:26 http://ftpmaster.internal/ubuntu kinetic/main amd64 util-linux-extra amd64 2.38-4ubuntu1 [83.8 kB] Get:27 http://ftpmaster.internal/ubuntu kinetic/main amd64 util-linux amd64 2.38-4ubuntu1 [1069 kB] Get:28 http://ftpmaster.internal/ubuntu kinetic/main amd64 mount amd64 2.38-4ubuntu1 [114 kB] Get:29 http://ftpmaster.internal/ubuntu kinetic/main amd64 libsemanage-common all 3.4-1 [9828 B] Get:30 http://ftpmaster.internal/ubuntu kinetic/main amd64 libsepol2 amd64 3.4-2 [294 kB] Get:31 http://ftpmaster.internal/ubuntu kinetic/main amd64 libsemanage2 amd64 3.4-1 [91.8 kB] Get:32 http://ftpmaster.internal/ubuntu kinetic/main amd64 passwd amd64 1:4.11.1+dfsg1-2ubuntu1 [765 kB] Get:33 http://ftpmaster.internal/ubuntu kinetic/main amd64 adduser all 3.121ubuntu1 [150 kB] Get:34 http://ftpmaster.internal/ubuntu kinetic/main amd64 systemd amd64 249.11-0ubuntu4 [4517 kB] Get:35 http://ftpmaster.internal/ubuntu kinetic/main amd64 libsystemd0 amd64 249.11-0ubuntu4 [314 kB] Get:36 http://ftpmaster.internal/ubuntu kinetic/main amd64 libudev1 amd64 249.11-0ubuntu4 [76.1 kB] Get:37 http://ftpmaster.internal/ubuntu kinetic/main amd64 libapt-pkg6.0 amd64 2.5.0 [896 kB] Get:38 http://ftpmaster.internal/ubuntu kinetic/main amd64 dpkg amd64 1.21.8ubuntu1 [1333 kB] Get:39 http://ftpmaster.internal/ubuntu kinetic/main amd64 dash amd64 0.5.11+git20210903+057cd650a4ed-8ubuntu1 [88.2 kB] Get:40 http://ftpmaster.internal/ubuntu kinetic/main amd64 ncurses-bin amd64 6.3+20220423-2 [183 kB] Get:41 http://ftpmaster.internal/ubuntu kinetic/main amd64 init-system-helpers all 1.63 [38.7 kB] Get:42 http://ftpmaster.internal/ubuntu kinetic/main amd64 ncurses-base all 6.3+20220423-2 [21.2 kB] Get:43 http://ftpmaster.internal/ubuntu kinetic/main amd64 apt amd64 2.5.0 [1370 kB] Get:44 http://ftpmaster.internal/ubuntu kinetic/main amd64 logsave amd64 1.46.5-2ubuntu2 [9914 B] Get:45 http://ftpmaster.internal/ubuntu kinetic/main amd64 libext2fs2 amd64 1.46.5-2ubuntu2 [205 kB] Get:46 http://ftpmaster.internal/ubuntu kinetic/main amd64 e2fsprogs amd64 1.46.5-2ubuntu2 [585 kB] Get:47 http://ftpmaster.internal/ubuntu kinetic/main amd64 init amd64 1.63 [5674 B] Get:48 http://ftpmaster.internal/ubuntu kinetic/main amd64 libcap-ng0 amd64 0.8.3-1 [15.7 kB] Get:49 http://ftpmaster.internal/ubuntu kinetic/main amd64 libdb5.3 amd64 5.3.28+dfsg1-0.9 [717 kB] Get:50 http://ftpmaster.internal/ubuntu kinetic/main amd64 libncurses6 amd64 6.3+20220423-2 [110 kB] Get:51 http://ftpmaster.internal/ubuntu kinetic/main amd64 libncursesw6 amd64 6.3+20220423-2 [146 kB] Get:52 http://ftpmaster.internal/ubuntu kinetic/main amd64 libtinfo6 amd64 6.3+20220423-2 [99.6 kB] Get:53 http://ftpmaster.internal/ubuntu kinetic/main amd64 libuuid1 amd64 2.38-4ubuntu1 [23.2 kB] Get:54 http://ftpmaster.internal/ubuntu kinetic/main amd64 libcom-err2 amd64 1.46.5-2ubuntu2 [9074 B] Get:55 http://ftpmaster.internal/ubuntu kinetic/main amd64 libss2 amd64 1.46.5-2ubuntu2 [12.3 kB] Get:56 http://ftpmaster.internal/ubuntu kinetic/main amd64 mawk amd64 1.3.4.20200120-3.1 [109 kB] Get:57 http://ftpmaster.internal/ubuntu kinetic/main amd64 libkeyutils1 amd64 1.6.1-3ubuntu1 [10.1 kB] Get:58 http://ftpmaster.internal/ubuntu kinetic/main amd64 readline-common all 8.1.2-1.2 [53.6 kB] Get:59 http://ftpmaster.internal/ubuntu kinetic/main amd64 libreadline8 amd64 8.1.2-1.2 [153 kB] Get:60 http://ftpmaster.internal/ubuntu kinetic/main amd64 libsqlite3-0 amd64 3.38.5-1 [646 kB] Get:61 http://ftpmaster.internal/ubuntu kinetic/main amd64 libssl3 amd64 3.0.3-5ubuntu3 [1897 kB] Get:62 http://ftpmaster.internal/ubuntu kinetic/main amd64 openssl amd64 3.0.3-5ubuntu3 [1175 kB] Get:63 http://ftpmaster.internal/ubuntu kinetic/main amd64 libpng16-16 amd64 1.6.37-5 [187 kB] Get:64 http://ftpmaster.internal/ubuntu kinetic/main amd64 xz-utils amd64 5.2.5-2.1 [83.8 kB] Get:65 http://ftpmaster.internal/ubuntu kinetic-proposed/main amd64 libctf0 amd64 2.38.50.20220615-4ubuntu1 [103 kB] Get:66 http://ftpmaster.internal/ubuntu kinetic-proposed/main amd64 libctf-nobfd0 amd64 2.38.50.20220615-4ubuntu1 [107 kB] Get:67 http://ftpmaster.internal/ubuntu kinetic-proposed/main amd64 binutils-x86-64-linux-gnu amd64 2.38.50.20220615-4ubuntu1 [2476 kB] Get:68 http://ftpmaster.internal/ubuntu kinetic-proposed/main amd64 libbinutils amd64 2.38.50.20220615-4ubuntu1 [655 kB] Get:69 http://ftpmaster.internal/ubuntu kinetic-proposed/main amd64 binutils amd64 2.38.50.20220615-4ubuntu1 [3292 B] Get:70 http://ftpmaster.internal/ubuntu kinetic-proposed/main amd64 binutils-common amd64 2.38.50.20220615-4ubuntu1 [224 kB] Get:71 http://ftpmaster.internal/ubuntu kinetic-proposed/main amd64 libgprofng0 amd64 2.38.50.20220615-4ubuntu1 [963 kB] Get:72 http://ftpmaster.internal/ubuntu kinetic/main amd64 g++-11 amd64 11.3.0-3ubuntu1 [11.3 MB] Get:73 http://ftpmaster.internal/ubuntu kinetic/main amd64 libstdc++-11-dev amd64 11.3.0-3ubuntu1 [2074 kB] Get:74 http://ftpmaster.internal/ubuntu kinetic/main amd64 libasan6 amd64 11.3.0-3ubuntu1 [2265 kB] Get:75 http://ftpmaster.internal/ubuntu kinetic/main amd64 gcc-11 amd64 11.3.0-3ubuntu1 [20.0 MB] Get:76 http://ftpmaster.internal/ubuntu kinetic/main amd64 libgcc-11-dev amd64 11.3.0-3ubuntu1 [2484 kB] Get:77 http://ftpmaster.internal/ubuntu kinetic/main amd64 libtsan0 amd64 11.3.0-3ubuntu1 [2241 kB] Get:78 http://ftpmaster.internal/ubuntu kinetic/main amd64 cpp-11 amd64 11.3.0-3ubuntu1 [9910 kB] Get:79 http://ftpmaster.internal/ubuntu kinetic/main amd64 gcc-11-base amd64 11.3.0-3ubuntu1 [20.8 kB] Get:80 http://ftpmaster.internal/ubuntu kinetic/main amd64 dpkg-dev all 1.21.8ubuntu1 [1069 kB] Get:81 http://ftpmaster.internal/ubuntu kinetic/main amd64 libdpkg-perl all 1.21.8ubuntu1 [237 kB] Get:82 http://ftpmaster.internal/ubuntu kinetic/main amd64 lto-disabled-list all 27 [12.4 kB] Get:83 http://ftpmaster.internal/ubuntu kinetic/main amd64 libfakeroot amd64 1.29-1ubuntu1 [31.1 kB] Get:84 http://ftpmaster.internal/ubuntu kinetic/main amd64 fakeroot amd64 1.29-1ubuntu1 [60.0 kB] Get:85 http://ftpmaster.internal/ubuntu kinetic-proposed/main amd64 linux-libc-dev amd64 5.15.0-35.36+22.10.1 [1287 kB] Get:86 http://ftpmaster.internal/ubuntu kinetic/main amd64 pinentry-curses amd64 1.2.0-1ubuntu1 [34.5 kB] Get:87 http://ftpmaster.internal/ubuntu kinetic/main amd64 policyrcd-script-zg2 all 0.1-3.1 [5730 B] debconf: delaying package configuration, since apt-utils is not installed Fetched 81.1 MB in 1s (103 MB/s) (Reading database ... 13449 files and directories currently installed.) Preparing to unpack .../base-files_12.2ubuntu1_amd64.deb ... Unpacking base-files (12.2ubuntu1) over (12ubuntu5) ... Setting up base-files (12.2ubuntu1) ... (Reading database ... 13449 files and directories currently installed.) Preparing to unpack .../bsdutils_1%3a2.38-4ubuntu1_amd64.deb ... Unpacking bsdutils (1:2.38-4ubuntu1) over (1:2.37.2-4ubuntu3) ... Setting up bsdutils (1:2.38-4ubuntu1) ... (Reading database ... 13449 files and directories currently installed.) Preparing to unpack .../libzstd1_1.5.2+dfsg-1_amd64.deb ... Unpacking libzstd1:amd64 (1.5.2+dfsg-1) over (1.4.8+dfsg-3build1) ... Setting up libzstd1:amd64 (1.5.2+dfsg-1) ... (Reading database ... 13449 files and directories currently installed.) Preparing to unpack .../0-libatomic1_12.1.0-4ubuntu1_amd64.deb ... Unpacking libatomic1:amd64 (12.1.0-4ubuntu1) over (12-20220428-1ubuntu1) ... Preparing to unpack .../1-libubsan1_12.1.0-4ubuntu1_amd64.deb ... Unpacking libubsan1:amd64 (12.1.0-4ubuntu1) over (12-20220428-1ubuntu1) ... Preparing to unpack .../2-libquadmath0_12.1.0-4ubuntu1_amd64.deb ... Unpacking libquadmath0:amd64 (12.1.0-4ubuntu1) over (12-20220428-1ubuntu1) ... Preparing to unpack .../3-liblsan0_12.1.0-4ubuntu1_amd64.deb ... Unpacking liblsan0:amd64 (12.1.0-4ubuntu1) over (12-20220428-1ubuntu1) ... Preparing to unpack .../4-libitm1_12.1.0-4ubuntu1_amd64.deb ... Unpacking libitm1:amd64 (12.1.0-4ubuntu1) over (12-20220428-1ubuntu1) ... Preparing to unpack .../5-libgomp1_12.1.0-4ubuntu1_amd64.deb ... Unpacking libgomp1:amd64 (12.1.0-4ubuntu1) over (12-20220428-1ubuntu1) ... Preparing to unpack .../6-gcc-12-base_12.1.0-4ubuntu1_amd64.deb ... Unpacking gcc-12-base:amd64 (12.1.0-4ubuntu1) over (12-20220428-1ubuntu1) ... Setting up gcc-12-base:amd64 (12.1.0-4ubuntu1) ... (Reading database ... 13449 files and directories currently installed.) Preparing to unpack .../libgcc-s1_12.1.0-4ubuntu1_amd64.deb ... Unpacking libgcc-s1:amd64 (12.1.0-4ubuntu1) over (12-20220428-1ubuntu1) ... Setting up libgcc-s1:amd64 (12.1.0-4ubuntu1) ... (Reading database ... 13449 files and directories currently installed.) Preparing to unpack .../libcc1-0_12.1.0-4ubuntu1_amd64.deb ... Unpacking libcc1-0:amd64 (12.1.0-4ubuntu1) over (12-20220428-1ubuntu1) ... Preparing to unpack .../libstdc++6_12.1.0-4ubuntu1_amd64.deb ... Unpacking libstdc++6:amd64 (12.1.0-4ubuntu1) over (12-20220428-1ubuntu1) ... Setting up libstdc++6:amd64 (12.1.0-4ubuntu1) ... (Reading database ... 13449 files and directories currently installed.) Preparing to unpack .../libgpg-error0_1.45-2_amd64.deb ... Unpacking libgpg-error0:amd64 (1.45-2) over (1.43-3) ... Setting up libgpg-error0:amd64 (1.45-2) ... (Reading database ... 13449 files and directories currently installed.) Preparing to unpack .../libgcrypt20_1.10.1-2ubuntu1_amd64.deb ... Unpacking libgcrypt20:amd64 (1.10.1-2ubuntu1) over (1.9.4-3ubuntu3) ... Setting up libgcrypt20:amd64 (1.10.1-2ubuntu1) ... (Reading database ... 13451 files and directories currently installed.) Preparing to unpack .../liblzma5_5.2.5-2.1_amd64.deb ... Unpacking liblzma5:amd64 (5.2.5-2.1) over (5.2.5-2ubuntu1) ... Setting up liblzma5:amd64 (5.2.5-2.1) ... (Reading database ... 13451 files and directories currently installed.) Preparing to unpack .../systemd-sysv_249.11-0ubuntu4_amd64.deb ... Unpacking systemd-sysv (249.11-0ubuntu4) over (249.11-0ubuntu3.1) ... Preparing to unpack .../libapparmor1_3.0.4-2ubuntu3_amd64.deb ... Unpacking libapparmor1:amd64 (3.0.4-2ubuntu3) over (3.0.4-2ubuntu2) ... Preparing to unpack .../libgnutls30_3.7.6-2ubuntu1_amd64.deb ... Unpacking libgnutls30:amd64 (3.7.6-2ubuntu1) over (3.7.4-2ubuntu1) ... Setting up libgnutls30:amd64 (3.7.6-2ubuntu1) ... (Reading database ... 13451 files and directories currently installed.) Preparing to unpack .../libip4tc2_1.8.7-1ubuntu6_amd64.deb ... Unpacking libip4tc2:amd64 (1.8.7-1ubuntu6) over (1.8.7-1ubuntu5) ... Preparing to unpack .../libblkid1_2.38-4ubuntu1_amd64.deb ... Unpacking libblkid1:amd64 (2.38-4ubuntu1) over (2.37.2-4ubuntu3) ... Setting up libblkid1:amd64 (2.38-4ubuntu1) ... (Reading database ... 13451 files and directories currently installed.) Preparing to unpack .../libselinux1_3.4-1_amd64.deb ... Unpacking libselinux1:amd64 (3.4-1) over (3.3-1build2) ... Setting up libselinux1:amd64 (3.4-1) ... (Reading database ... 13451 files and directories currently installed.) Preparing to unpack .../libmount1_2.38-4ubuntu1_amd64.deb ... Unpacking libmount1:amd64 (2.38-4ubuntu1) over (2.37.2-4ubuntu3) ... Setting up libmount1:amd64 (2.38-4ubuntu1) ... (Reading database ... 13451 files and directories currently installed.) Preparing to unpack .../libsmartcols1_2.38-4ubuntu1_amd64.deb ... Unpacking libsmartcols1:amd64 (2.38-4ubuntu1) over (2.37.2-4ubuntu3) ... Setting up libsmartcols1:amd64 (2.38-4ubuntu1) ... (Reading database ... 13451 files and directories currently installed.) Preparing to unpack .../login_1%3a4.11.1+dfsg1-2ubuntu1_amd64.deb ... Unpacking login (1:4.11.1+dfsg1-2ubuntu1) over (1:4.8.1-2ubuntu2) ... Setting up login (1:4.11.1+dfsg1-2ubuntu1) ... Installing new version of config file /etc/login.defs ... (Reading database ... 13451 files and directories currently installed.) Preparing to unpack .../util-linux_2.38-4ubuntu1_amd64.deb ... Unpacking util-linux (2.38-4ubuntu1) over (2.37.2-4ubuntu3) ... dpkg: warning: unable to delete old directory '/usr/lib/udev': Directory not empty Selecting previously unselected package util-linux-extra. Preparing to unpack .../util-linux-extra_2.38-4ubuntu1_amd64.deb ... Unpacking util-linux-extra (2.38-4ubuntu1) ... Setting up util-linux-extra (2.38-4ubuntu1) ... (Reading database ... 13463 files and directories currently installed.) Preparing to unpack .../mount_2.38-4ubuntu1_amd64.deb ... Unpacking mount (2.38-4ubuntu1) over (2.37.2-4ubuntu3) ... Preparing to unpack .../libsemanage-common_3.4-1_all.deb ... Unpacking libsemanage-common (3.4-1) over (3.3-1build2) ... Setting up libsemanage-common (3.4-1) ... Installing new version of config file /etc/selinux/semanage.conf ... (Reading database ... 13463 files and directories currently installed.) Preparing to unpack .../libsepol2_3.4-2_amd64.deb ... Unpacking libsepol2:amd64 (3.4-2) over (3.3-1build1) ... Setting up libsepol2:amd64 (3.4-2) ... (Reading database ... 13463 files and directories currently installed.) Preparing to unpack .../libsemanage2_3.4-1_amd64.deb ... Unpacking libsemanage2:amd64 (3.4-1) over (3.3-1build2) ... Setting up libsemanage2:amd64 (3.4-1) ... (Reading database ... 13463 files and directories currently installed.) Preparing to unpack .../passwd_1%3a4.11.1+dfsg1-2ubuntu1_amd64.deb ... Unpacking passwd (1:4.11.1+dfsg1-2ubuntu1) over (1:4.8.1-2ubuntu2) ... Setting up passwd (1:4.11.1+dfsg1-2ubuntu1) ... Installing new version of config file /etc/default/useradd ... (Reading database ... 13463 files and directories currently installed.) Preparing to unpack .../adduser_3.121ubuntu1_all.deb ... Unpacking adduser (3.121ubuntu1) over (3.118ubuntu5) ... Setting up adduser (3.121ubuntu1) ... Installing new version of config file /etc/deluser.conf ... (Reading database ... 13460 files and directories currently installed.) Preparing to unpack .../systemd_249.11-0ubuntu4_amd64.deb ... Unpacking systemd (249.11-0ubuntu4) over (249.11-0ubuntu3.1) ... Preparing to unpack .../libsystemd0_249.11-0ubuntu4_amd64.deb ... Unpacking libsystemd0:amd64 (249.11-0ubuntu4) over (249.11-0ubuntu3.1) ... Setting up libsystemd0:amd64 (249.11-0ubuntu4) ... (Reading database ... 13460 files and directories currently installed.) Preparing to unpack .../libudev1_249.11-0ubuntu4_amd64.deb ... Unpacking libudev1:amd64 (249.11-0ubuntu4) over (249.11-0ubuntu3.1) ... Setting up libudev1:amd64 (249.11-0ubuntu4) ... (Reading database ... 13460 files and directories currently installed.) Preparing to unpack .../libapt-pkg6.0_2.5.0_amd64.deb ... Unpacking libapt-pkg6.0:amd64 (2.5.0) over (2.4.5) ... Setting up libapt-pkg6.0:amd64 (2.5.0) ... (Reading database ... 13460 files and directories currently installed.) Preparing to unpack .../dpkg_1.21.8ubuntu1_amd64.deb ... Unpacking dpkg (1.21.8ubuntu1) over (1.21.1ubuntu2) ... Setting up dpkg (1.21.8ubuntu1) ... (Reading database ... 13464 files and directories currently installed.) Preparing to unpack .../dash_0.5.11+git20210903+057cd650a4ed-8ubuntu1_amd64.deb ... Unpacking dash (0.5.11+git20210903+057cd650a4ed-8ubuntu1) over (0.5.11+git20210903+057cd650a4ed-3build1) ... Setting up dash (0.5.11+git20210903+057cd650a4ed-8ubuntu1) ... (Reading database ... 13465 files and directories currently installed.) Preparing to unpack .../ncurses-bin_6.3+20220423-2_amd64.deb ... Unpacking ncurses-bin (6.3+20220423-2) over (6.3-2) ... Setting up ncurses-bin (6.3+20220423-2) ... (Reading database ... 13465 files and directories currently installed.) Preparing to unpack .../init-system-helpers_1.63_all.deb ... Unpacking init-system-helpers (1.63) over (1.62) ... Setting up init-system-helpers (1.63) ... (Reading database ... 13465 files and directories currently installed.) Preparing to unpack .../ncurses-base_6.3+20220423-2_all.deb ... Unpacking ncurses-base (6.3+20220423-2) over (6.3-2) ... Setting up ncurses-base (6.3+20220423-2) ... (Reading database ... 13466 files and directories currently installed.) Preparing to unpack .../archives/apt_2.5.0_amd64.deb ... Unpacking apt (2.5.0) over (2.4.5) ... Setting up apt (2.5.0) ... (Reading database ... 13467 files and directories currently installed.) Preparing to unpack .../logsave_1.46.5-2ubuntu2_amd64.deb ... Unpacking logsave (1.46.5-2ubuntu2) over (1.46.5-2ubuntu1) ... Preparing to unpack .../libext2fs2_1.46.5-2ubuntu2_amd64.deb ... Unpacking libext2fs2:amd64 (1.46.5-2ubuntu2) over (1.46.5-2ubuntu1) ... Setting up libext2fs2:amd64 (1.46.5-2ubuntu2) ... (Reading database ... 13467 files and directories currently installed.) Preparing to unpack .../e2fsprogs_1.46.5-2ubuntu2_amd64.deb ... Unpacking e2fsprogs (1.46.5-2ubuntu2) over (1.46.5-2ubuntu1) ... Setting up libapparmor1:amd64 (3.0.4-2ubuntu3) ... Setting up libip4tc2:amd64 (1.8.7-1ubuntu6) ... Setting up util-linux (2.38-4ubuntu1) ... Setting up mount (2.38-4ubuntu1) ... Setting up systemd (249.11-0ubuntu4) ... Initializing machine ID from random generator. Setting up systemd-sysv (249.11-0ubuntu4) ... (Reading database ... 13467 files and directories currently installed.) Preparing to unpack .../archives/init_1.63_amd64.deb ... Unpacking init (1.63) over (1.62) ... Preparing to unpack .../libcap-ng0_0.8.3-1_amd64.deb ... Unpacking libcap-ng0:amd64 (0.8.3-1) over (0.7.9-2.2build3) ... Setting up libcap-ng0:amd64 (0.8.3-1) ... (Reading database ... 13469 files and directories currently installed.) Preparing to unpack .../libdb5.3_5.3.28+dfsg1-0.9_amd64.deb ... Unpacking libdb5.3:amd64 (5.3.28+dfsg1-0.9) over (5.3.28+dfsg1-0.8ubuntu3) ... Setting up libdb5.3:amd64 (5.3.28+dfsg1-0.9) ... (Reading database ... 13469 files and directories currently installed.) Preparing to unpack .../libncurses6_6.3+20220423-2_amd64.deb ... Unpacking libncurses6:amd64 (6.3+20220423-2) over (6.3-2) ... Preparing to unpack .../libncursesw6_6.3+20220423-2_amd64.deb ... Unpacking libncursesw6:amd64 (6.3+20220423-2) over (6.3-2) ... Preparing to unpack .../libtinfo6_6.3+20220423-2_amd64.deb ... Unpacking libtinfo6:amd64 (6.3+20220423-2) over (6.3-2) ... Setting up libtinfo6:amd64 (6.3+20220423-2) ... (Reading database ... 13468 files and directories currently installed.) Preparing to unpack .../libuuid1_2.38-4ubuntu1_amd64.deb ... Unpacking libuuid1:amd64 (2.38-4ubuntu1) over (2.37.2-4ubuntu3) ... Setting up libuuid1:amd64 (2.38-4ubuntu1) ... (Reading database ... 13469 files and directories currently installed.) Preparing to unpack .../00-libcom-err2_1.46.5-2ubuntu2_amd64.deb ... Unpacking libcom-err2:amd64 (1.46.5-2ubuntu2) over (1.46.5-2ubuntu1) ... Preparing to unpack .../01-libss2_1.46.5-2ubuntu2_amd64.deb ... Unpacking libss2:amd64 (1.46.5-2ubuntu2) over (1.46.5-2ubuntu1) ... Preparing to unpack .../02-mawk_1.3.4.20200120-3.1_amd64.deb ... Unpacking mawk (1.3.4.20200120-3.1) over (1.3.4.20200120-3) ... Preparing to unpack .../03-libkeyutils1_1.6.1-3ubuntu1_amd64.deb ... Unpacking libkeyutils1:amd64 (1.6.1-3ubuntu1) over (1.6.1-2ubuntu3) ... Preparing to unpack .../04-readline-common_8.1.2-1.2_all.deb ... Unpacking readline-common (8.1.2-1.2) over (8.1.2-1) ... Preparing to unpack .../05-libreadline8_8.1.2-1.2_amd64.deb ... Unpacking libreadline8:amd64 (8.1.2-1.2) over (8.1.2-1) ... Preparing to unpack .../06-libsqlite3-0_3.38.5-1_amd64.deb ... Unpacking libsqlite3-0:amd64 (3.38.5-1) over (3.37.2-2) ... Preparing to unpack .../07-libssl3_3.0.3-5ubuntu3_amd64.deb ... Unpacking libssl3:amd64 (3.0.3-5ubuntu3) over (3.0.2-0ubuntu1) ... Preparing to unpack .../08-openssl_3.0.3-5ubuntu3_amd64.deb ... Unpacking openssl (3.0.3-5ubuntu3) over (3.0.2-0ubuntu1) ... Preparing to unpack .../09-libpng16-16_1.6.37-5_amd64.deb ... Unpacking libpng16-16:amd64 (1.6.37-5) over (1.6.37-3build5) ... Preparing to unpack .../10-xz-utils_5.2.5-2.1_amd64.deb ... Unpacking xz-utils (5.2.5-2.1) over (5.2.5-2ubuntu1) ... Preparing to unpack .../11-libctf0_2.38.50.20220615-4ubuntu1_amd64.deb ... Unpacking libctf0:amd64 (2.38.50.20220615-4ubuntu1) over (2.38-3ubuntu1) ... Preparing to unpack .../12-libctf-nobfd0_2.38.50.20220615-4ubuntu1_amd64.deb ... Unpacking libctf-nobfd0:amd64 (2.38.50.20220615-4ubuntu1) over (2.38-3ubuntu1) ... Preparing to unpack .../13-binutils-x86-64-linux-gnu_2.38.50.20220615-4ubuntu1_amd64.deb ... Unpacking binutils-x86-64-linux-gnu (2.38.50.20220615-4ubuntu1) over (2.38-3ubuntu1) ... Preparing to unpack .../14-libbinutils_2.38.50.20220615-4ubuntu1_amd64.deb ... Unpacking libbinutils:amd64 (2.38.50.20220615-4ubuntu1) over (2.38-3ubuntu1) ... Preparing to unpack .../15-binutils_2.38.50.20220615-4ubuntu1_amd64.deb ... Unpacking binutils (2.38.50.20220615-4ubuntu1) over (2.38-3ubuntu1) ... Preparing to unpack .../16-binutils-common_2.38.50.20220615-4ubuntu1_amd64.deb ... Unpacking binutils-common:amd64 (2.38.50.20220615-4ubuntu1) over (2.38-3ubuntu1) ... Selecting previously unselected package libgprofng0:amd64. Preparing to unpack .../17-libgprofng0_2.38.50.20220615-4ubuntu1_amd64.deb ... Unpacking libgprofng0:amd64 (2.38.50.20220615-4ubuntu1) ... Preparing to unpack .../18-g++-11_11.3.0-3ubuntu1_amd64.deb ... Unpacking g++-11 (11.3.0-3ubuntu1) over (11.3.0-1ubuntu1) ... Preparing to unpack .../19-libstdc++-11-dev_11.3.0-3ubuntu1_amd64.deb ... Unpacking libstdc++-11-dev:amd64 (11.3.0-3ubuntu1) over (11.3.0-1ubuntu1) ... Preparing to unpack .../20-libasan6_11.3.0-3ubuntu1_amd64.deb ... Unpacking libasan6:amd64 (11.3.0-3ubuntu1) over (11.3.0-1ubuntu1) ... Preparing to unpack .../21-gcc-11_11.3.0-3ubuntu1_amd64.deb ... Unpacking gcc-11 (11.3.0-3ubuntu1) over (11.3.0-1ubuntu1) ... Preparing to unpack .../22-libgcc-11-dev_11.3.0-3ubuntu1_amd64.deb ... Unpacking libgcc-11-dev:amd64 (11.3.0-3ubuntu1) over (11.3.0-1ubuntu1) ... Preparing to unpack .../23-libtsan0_11.3.0-3ubuntu1_amd64.deb ... Unpacking libtsan0:amd64 (11.3.0-3ubuntu1) over (11.3.0-1ubuntu1) ... Preparing to unpack .../24-cpp-11_11.3.0-3ubuntu1_amd64.deb ... Unpacking cpp-11 (11.3.0-3ubuntu1) over (11.3.0-1ubuntu1) ... Preparing to unpack .../25-gcc-11-base_11.3.0-3ubuntu1_amd64.deb ... Unpacking gcc-11-base:amd64 (11.3.0-3ubuntu1) over (11.3.0-1ubuntu1) ... Preparing to unpack .../26-dpkg-dev_1.21.8ubuntu1_all.deb ... Unpacking dpkg-dev (1.21.8ubuntu1) over (1.21.1ubuntu2) ... Preparing to unpack .../27-libdpkg-perl_1.21.8ubuntu1_all.deb ... Unpacking libdpkg-perl (1.21.8ubuntu1) over (1.21.1ubuntu2) ... Preparing to unpack .../28-lto-disabled-list_27_all.deb ... Unpacking lto-disabled-list (27) over (25) ... Preparing to unpack .../29-libfakeroot_1.29-1ubuntu1_amd64.deb ... Unpacking libfakeroot:amd64 (1.29-1ubuntu1) over (1.28-1ubuntu1) ... Preparing to unpack .../30-fakeroot_1.29-1ubuntu1_amd64.deb ... Unpacking fakeroot (1.29-1ubuntu1) over (1.28-1ubuntu1) ... Preparing to unpack .../31-linux-libc-dev_5.15.0-35.36+22.10.1_amd64.deb ... Unpacking linux-libc-dev:amd64 (5.15.0-35.36+22.10.1) over (5.15.0-27.28) ... Preparing to unpack .../32-pinentry-curses_1.2.0-1ubuntu1_amd64.deb ... Unpacking pinentry-curses (1.2.0-1ubuntu1) over (1.1.1-1build2) ... Preparing to unpack .../33-policyrcd-script-zg2_0.1-3.1_all.deb ... Unpacking policyrcd-script-zg2 (0.1-3.1) over (0.1-3) ... Setting up gcc-11-base:amd64 (11.3.0-3ubuntu1) ... Setting up libkeyutils1:amd64 (1.6.1-3ubuntu1) ... Setting up lto-disabled-list (27) ... Setting up init (1.63) ... Setting up libsqlite3-0:amd64 (3.38.5-1) ... Setting up binutils-common:amd64 (2.38.50.20220615-4ubuntu1) ... Setting up libssl3:amd64 (3.0.3-5ubuntu3) ... Setting up linux-libc-dev:amd64 (5.15.0-35.36+22.10.1) ... Setting up libctf-nobfd0:amd64 (2.38.50.20220615-4ubuntu1) ... Setting up libcom-err2:amd64 (1.46.5-2ubuntu2) ... Setting up libgomp1:amd64 (12.1.0-4ubuntu1) ... Setting up libfakeroot:amd64 (1.29-1ubuntu1) ... Setting up libasan6:amd64 (11.3.0-3ubuntu1) ... Setting up fakeroot (1.29-1ubuntu1) ... Setting up policyrcd-script-zg2 (0.1-3.1) ... Setting up libncurses6:amd64 (6.3+20220423-2) ... Setting up xz-utils (5.2.5-2.1) ... Setting up libquadmath0:amd64 (12.1.0-4ubuntu1) ... Setting up libpng16-16:amd64 (1.6.37-5) ... Setting up libatomic1:amd64 (12.1.0-4ubuntu1) ... Setting up libss2:amd64 (1.46.5-2ubuntu2) ... Setting up libncursesw6:amd64 (6.3+20220423-2) ... Setting up libdpkg-perl (1.21.8ubuntu1) ... Setting up logsave (1.46.5-2ubuntu2) ... Setting up libubsan1:amd64 (12.1.0-4ubuntu1) ... Setting up mawk (1.3.4.20200120-3.1) ... Setting up libbinutils:amd64 (2.38.50.20220615-4ubuntu1) ... Setting up openssl (3.0.3-5ubuntu3) ... Installing new version of config file /etc/ssl/openssl.cnf ... Setting up readline-common (8.1.2-1.2) ... Setting up libcc1-0:amd64 (12.1.0-4ubuntu1) ... Setting up liblsan0:amd64 (12.1.0-4ubuntu1) ... Setting up libitm1:amd64 (12.1.0-4ubuntu1) ... Setting up libtsan0:amd64 (11.3.0-3ubuntu1) ... Setting up libctf0:amd64 (2.38.50.20220615-4ubuntu1) ... Setting up pinentry-curses (1.2.0-1ubuntu1) ... Setting up cpp-11 (11.3.0-3ubuntu1) ... Setting up libreadline8:amd64 (8.1.2-1.2) ... Setting up e2fsprogs (1.46.5-2ubuntu2) ... Setting up libgprofng0:amd64 (2.38.50.20220615-4ubuntu1) ... Setting up libgcc-11-dev:amd64 (11.3.0-3ubuntu1) ... Setting up binutils-x86-64-linux-gnu (2.38.50.20220615-4ubuntu1) ... Setting up binutils (2.38.50.20220615-4ubuntu1) ... Setting up dpkg-dev (1.21.8ubuntu1) ... Setting up libstdc++-11-dev:amd64 (11.3.0-3ubuntu1) ... Setting up gcc-11 (11.3.0-3ubuntu1) ... Setting up g++-11 (11.3.0-3ubuntu1) ... Processing triggers for debianutils (5.7-0.2) ... Processing triggers for libc-bin (2.35-0ubuntu3) ... RUN: /usr/share/launchpad-buildd/bin/sbuild-package PACKAGEBUILD-24109081 amd64 kinetic-proposed -c chroot:build-PACKAGEBUILD-24109081 --arch=amd64 --dist=kinetic-proposed --nolog -A 'gemmlowp_0.0~git20211220.e844ffd-1.dsc' Initiating build PACKAGEBUILD-24109081 with 4 jobs across 4 processor cores. Kernel reported to sbuild: 5.4.0-120-generic #136-Ubuntu SMP Fri Jun 10 13:40:48 UTC 2022 x86_64 sbuild (Debian sbuild) 0.79.0 (05 February 2020) on lcy02-amd64-088.buildd +==============================================================================+ | gemmlowp 0.0~git20211220.e844ffd-1 (amd64) Fri, 24 Jun 2022 11:16:39 +0000 | +==============================================================================+ Package: gemmlowp Version: 0.0~git20211220.e844ffd-1 Source Version: 0.0~git20211220.e844ffd-1 Distribution: kinetic-proposed Machine Architecture: amd64 Host Architecture: amd64 Build Architecture: amd64 Build Type: binary I: NOTICE: Log filtering will replace 'home/buildd/build-PACKAGEBUILD-24109081/chroot-autobuild' with '<>' I: NOTICE: Log filtering will replace 'build/gemmlowp-yG4j1n/resolver-0xCGX9' with '<>' +------------------------------------------------------------------------------+ | Fetch source files | +------------------------------------------------------------------------------+ Local sources ------------- gemmlowp_0.0~git20211220.e844ffd-1.dsc exists in .; copying to chroot I: NOTICE: Log filtering will replace 'build/gemmlowp-yG4j1n/gemmlowp-0.0~git20211220.e844ffd' with '<>' I: NOTICE: Log filtering will replace 'build/gemmlowp-yG4j1n' with '<>' +------------------------------------------------------------------------------+ | Install package build dependencies | +------------------------------------------------------------------------------+ Setup apt archive ----------------- Merged Build-Depends: debhelper-compat (= 13), cmake, build-essential, fakeroot Filtered Build-Depends: debhelper-compat (= 13), cmake, build-essential, fakeroot dpkg-deb: building package 'sbuild-build-depends-main-dummy' in '/<>/apt_archive/sbuild-build-depends-main-dummy.deb'. Ign:1 copy:/<>/apt_archive ./ InRelease Get:2 copy:/<>/apt_archive ./ Release [957 B] Ign:3 copy:/<>/apt_archive ./ Release.gpg Get:4 copy:/<>/apt_archive ./ Sources [373 B] Get:5 copy:/<>/apt_archive ./ Packages [456 B] Fetched 1786 B in 0s (148 kB/s) Reading package lists... Reading package lists... Install main build dependencies (apt-based resolver) ---------------------------------------------------- Installing build dependencies Reading package lists... Building dependency tree... Reading state information... The following additional packages will be installed: autoconf automake autopoint autotools-dev bsdextrautils cmake cmake-data debhelper debugedit dh-autoreconf dh-elpa-helper dh-strip-nondeterminism dwz emacsen-common file gettext gettext-base groff-base intltool-debian libarchive-zip-perl libarchive13 libbrotli1 libcurl4 libdebhelper-perl libdw1 libelf1 libexpat1 libfile-stripnondeterminism-perl libicu71 libjsoncpp25 libldap-2.5-0 libmagic-mgc libmagic1 libnghttp2-14 libpipeline1 libpsl5 librhash0 librtmp1 libsasl2-2 libsasl2-modules-db libsigsegv2 libssh-4 libsub-override-perl libtool libuchardet0 libuv1 libxml2 m4 man-db po-debconf Suggested packages: autoconf-archive gnu-standards autoconf-doc cmake-doc ninja-build cmake-format dh-make gettext-doc libasprintf-dev libgettextpo-dev groff lrzip libtool-doc gfortran | fortran95-compiler gcj-jdk m4-doc apparmor less www-browser libmail-box-perl Recommended packages: curl | wget | lynx libarchive-cpio-perl libldap-common publicsuffix libsasl2-modules libltdl-dev libmail-sendmail-perl The following NEW packages will be installed: autoconf automake autopoint autotools-dev bsdextrautils cmake cmake-data debhelper debugedit dh-autoreconf dh-elpa-helper dh-strip-nondeterminism dwz emacsen-common file gettext gettext-base groff-base intltool-debian libarchive-zip-perl libarchive13 libbrotli1 libcurl4 libdebhelper-perl libdw1 libelf1 libexpat1 libfile-stripnondeterminism-perl libicu71 libjsoncpp25 libldap-2.5-0 libmagic-mgc libmagic1 libnghttp2-14 libpipeline1 libpsl5 librhash0 librtmp1 libsasl2-2 libsasl2-modules-db libsigsegv2 libssh-4 libsub-override-perl libtool libuchardet0 libuv1 libxml2 m4 man-db po-debconf sbuild-build-depends-main-dummy 0 upgraded, 51 newly installed, 0 to remove and 0 not upgraded. Need to get 27.6 MB of archives. After this operation, 104 MB of additional disk space will be used. Get:1 copy:/<>/apt_archive ./ sbuild-build-depends-main-dummy 0.invalid.0 [672 B] Get:2 http://ftpmaster.internal/ubuntu kinetic/main amd64 libelf1 amd64 0.187-1 [51.3 kB] Get:3 http://ftpmaster.internal/ubuntu kinetic/main amd64 libexpat1 amd64 2.4.8-1 [90.6 kB] Get:4 http://ftpmaster.internal/ubuntu kinetic/main amd64 libicu71 amd64 71.1-3 [10.6 MB] Get:5 http://ftpmaster.internal/ubuntu kinetic/main amd64 libxml2 amd64 2.9.14+dfsg-1 [756 kB] Get:6 http://ftpmaster.internal/ubuntu kinetic/main amd64 bsdextrautils amd64 2.38-4ubuntu1 [71.1 kB] Get:7 http://ftpmaster.internal/ubuntu kinetic/main amd64 libmagic-mgc amd64 1:5.41-4 [257 kB] Get:8 http://ftpmaster.internal/ubuntu kinetic/main amd64 libmagic1 amd64 1:5.41-4 [87.3 kB] Get:9 http://ftpmaster.internal/ubuntu kinetic/main amd64 file amd64 1:5.41-4 [21.5 kB] Get:10 http://ftpmaster.internal/ubuntu kinetic/main amd64 gettext-base amd64 0.21-6 [37.9 kB] Get:11 http://ftpmaster.internal/ubuntu kinetic/main amd64 libuchardet0 amd64 0.0.7-1build2 [76.9 kB] Get:12 http://ftpmaster.internal/ubuntu kinetic/main amd64 groff-base amd64 1.22.4-8build1 [955 kB] Get:13 http://ftpmaster.internal/ubuntu kinetic/main amd64 libnghttp2-14 amd64 1.47.0-1 [69.0 kB] Get:14 http://ftpmaster.internal/ubuntu kinetic/main amd64 libpipeline1 amd64 1.5.6-1 [23.1 kB] Get:15 http://ftpmaster.internal/ubuntu kinetic/main amd64 libpsl5 amd64 0.21.0-1.2build2 [58.4 kB] Get:16 http://ftpmaster.internal/ubuntu kinetic/main amd64 libuv1 amd64 1.44.1-2 [93.1 kB] Get:17 http://ftpmaster.internal/ubuntu kinetic/main amd64 man-db amd64 2.10.2-1 [1187 kB] Get:18 http://ftpmaster.internal/ubuntu kinetic/main amd64 libsigsegv2 amd64 2.13-1ubuntu3 [14.6 kB] Get:19 http://ftpmaster.internal/ubuntu kinetic/main amd64 m4 amd64 1.4.18-5ubuntu2 [199 kB] Get:20 http://ftpmaster.internal/ubuntu kinetic/main amd64 autoconf all 2.71-2 [338 kB] Get:21 http://ftpmaster.internal/ubuntu kinetic/main amd64 autotools-dev all 20220109.1 [44.9 kB] Get:22 http://ftpmaster.internal/ubuntu kinetic/main amd64 automake all 1:1.16.5-1.3 [558 kB] Get:23 http://ftpmaster.internal/ubuntu kinetic/main amd64 autopoint all 0.21-6 [421 kB] Get:24 http://ftpmaster.internal/ubuntu kinetic/main amd64 libarchive13 amd64 3.6.0-1ubuntu1 [368 kB] Get:25 http://ftpmaster.internal/ubuntu kinetic/main amd64 libbrotli1 amd64 1.0.9-2build6 [315 kB] Get:26 http://ftpmaster.internal/ubuntu kinetic/main amd64 libsasl2-modules-db amd64 2.1.28+dfsg-6ubuntu1 [20.5 kB] Get:27 http://ftpmaster.internal/ubuntu kinetic/main amd64 libsasl2-2 amd64 2.1.28+dfsg-6ubuntu1 [55.9 kB] Get:28 http://ftpmaster.internal/ubuntu kinetic/main amd64 libldap-2.5-0 amd64 2.5.12+dfsg-2ubuntu1 [180 kB] Get:29 http://ftpmaster.internal/ubuntu kinetic/main amd64 librtmp1 amd64 2.4+20151223.gitfa8646d.1-2build4 [58.2 kB] Get:30 http://ftpmaster.internal/ubuntu kinetic/main amd64 libssh-4 amd64 0.9.6-2build1 [184 kB] Get:31 http://ftpmaster.internal/ubuntu kinetic/main amd64 libcurl4 amd64 7.83.1-2 [285 kB] Get:32 http://ftpmaster.internal/ubuntu kinetic/main amd64 libjsoncpp25 amd64 1.9.5-4 [80.3 kB] Get:33 http://ftpmaster.internal/ubuntu kinetic/main amd64 librhash0 amd64 1.4.2-1ubuntu1 [125 kB] Get:34 http://ftpmaster.internal/ubuntu kinetic/main amd64 dh-elpa-helper all 2.0.9ubuntu1 [7610 B] Get:35 http://ftpmaster.internal/ubuntu kinetic/main amd64 emacsen-common all 3.0.4 [14.9 kB] Get:36 http://ftpmaster.internal/ubuntu kinetic/main amd64 cmake-data all 3.23.2-1ubuntu1 [1944 kB] Get:37 http://ftpmaster.internal/ubuntu kinetic/main amd64 cmake amd64 3.23.2-1ubuntu1 [5116 kB] Get:38 http://ftpmaster.internal/ubuntu kinetic/main amd64 libdebhelper-perl all 13.7.1ubuntu1 [66.9 kB] Get:39 http://ftpmaster.internal/ubuntu kinetic/main amd64 libtool all 2.4.7-4 [166 kB] Get:40 http://ftpmaster.internal/ubuntu kinetic/main amd64 dh-autoreconf all 20 [16.1 kB] Get:41 http://ftpmaster.internal/ubuntu kinetic/main amd64 libarchive-zip-perl all 1.68-1 [90.2 kB] Get:42 http://ftpmaster.internal/ubuntu kinetic/main amd64 libsub-override-perl all 0.09-3 [9756 B] Get:43 http://ftpmaster.internal/ubuntu kinetic/main amd64 libfile-stripnondeterminism-perl all 1.13.0-1 [18.1 kB] Get:44 http://ftpmaster.internal/ubuntu kinetic/main amd64 dh-strip-nondeterminism all 1.13.0-1 [5344 B] Get:45 http://ftpmaster.internal/ubuntu kinetic/main amd64 libdw1 amd64 0.187-1 [247 kB] Get:46 http://ftpmaster.internal/ubuntu kinetic/main amd64 debugedit amd64 1:5.0-4build1 [47.2 kB] Get:47 http://ftpmaster.internal/ubuntu kinetic/main amd64 dwz amd64 0.14-1build2 [105 kB] Get:48 http://ftpmaster.internal/ubuntu kinetic/main amd64 gettext amd64 0.21-6 [862 kB] Get:49 http://ftpmaster.internal/ubuntu kinetic/main amd64 intltool-debian all 0.35.0+20060710.5 [24.9 kB] Get:50 http://ftpmaster.internal/ubuntu kinetic/main amd64 po-debconf all 1.0.21+nmu1 [233 kB] Get:51 http://ftpmaster.internal/ubuntu kinetic/main amd64 debhelper all 13.7.1ubuntu1 [940 kB] debconf: delaying package configuration, since apt-utils is not installed Fetched 27.6 MB in 0s (122 MB/s) Selecting previously unselected package libelf1:amd64. (Reading database ... 13498 files and directories currently installed.) Preparing to unpack .../00-libelf1_0.187-1_amd64.deb ... Unpacking libelf1:amd64 (0.187-1) ... Selecting previously unselected package libexpat1:amd64. Preparing to unpack .../01-libexpat1_2.4.8-1_amd64.deb ... Unpacking libexpat1:amd64 (2.4.8-1) ... Selecting previously unselected package libicu71:amd64. Preparing to unpack .../02-libicu71_71.1-3_amd64.deb ... Unpacking libicu71:amd64 (71.1-3) ... Selecting previously unselected package libxml2:amd64. Preparing to unpack .../03-libxml2_2.9.14+dfsg-1_amd64.deb ... Unpacking libxml2:amd64 (2.9.14+dfsg-1) ... Selecting previously unselected package bsdextrautils. Preparing to unpack .../04-bsdextrautils_2.38-4ubuntu1_amd64.deb ... Unpacking bsdextrautils (2.38-4ubuntu1) ... Selecting previously unselected package libmagic-mgc. Preparing to unpack .../05-libmagic-mgc_1%3a5.41-4_amd64.deb ... Unpacking libmagic-mgc (1:5.41-4) ... Selecting previously unselected package libmagic1:amd64. Preparing to unpack .../06-libmagic1_1%3a5.41-4_amd64.deb ... Unpacking libmagic1:amd64 (1:5.41-4) ... Selecting previously unselected package file. Preparing to unpack .../07-file_1%3a5.41-4_amd64.deb ... Unpacking file (1:5.41-4) ... Selecting previously unselected package gettext-base. Preparing to unpack .../08-gettext-base_0.21-6_amd64.deb ... Unpacking gettext-base (0.21-6) ... Selecting previously unselected package libuchardet0:amd64. Preparing to unpack .../09-libuchardet0_0.0.7-1build2_amd64.deb ... Unpacking libuchardet0:amd64 (0.0.7-1build2) ... Selecting previously unselected package groff-base. Preparing to unpack .../10-groff-base_1.22.4-8build1_amd64.deb ... Unpacking groff-base (1.22.4-8build1) ... Selecting previously unselected package libnghttp2-14:amd64. Preparing to unpack .../11-libnghttp2-14_1.47.0-1_amd64.deb ... Unpacking libnghttp2-14:amd64 (1.47.0-1) ... Selecting previously unselected package libpipeline1:amd64. Preparing to unpack .../12-libpipeline1_1.5.6-1_amd64.deb ... Unpacking libpipeline1:amd64 (1.5.6-1) ... Selecting previously unselected package libpsl5:amd64. Preparing to unpack .../13-libpsl5_0.21.0-1.2build2_amd64.deb ... Unpacking libpsl5:amd64 (0.21.0-1.2build2) ... Selecting previously unselected package libuv1:amd64. Preparing to unpack .../14-libuv1_1.44.1-2_amd64.deb ... Unpacking libuv1:amd64 (1.44.1-2) ... Selecting previously unselected package man-db. Preparing to unpack .../15-man-db_2.10.2-1_amd64.deb ... Unpacking man-db (2.10.2-1) ... Selecting previously unselected package libsigsegv2:amd64. Preparing to unpack .../16-libsigsegv2_2.13-1ubuntu3_amd64.deb ... Unpacking libsigsegv2:amd64 (2.13-1ubuntu3) ... Selecting previously unselected package m4. Preparing to unpack .../17-m4_1.4.18-5ubuntu2_amd64.deb ... Unpacking m4 (1.4.18-5ubuntu2) ... Selecting previously unselected package autoconf. Preparing to unpack .../18-autoconf_2.71-2_all.deb ... Unpacking autoconf (2.71-2) ... Selecting previously unselected package autotools-dev. Preparing to unpack .../19-autotools-dev_20220109.1_all.deb ... Unpacking autotools-dev (20220109.1) ... Selecting previously unselected package automake. Preparing to unpack .../20-automake_1%3a1.16.5-1.3_all.deb ... Unpacking automake (1:1.16.5-1.3) ... Selecting previously unselected package autopoint. Preparing to unpack .../21-autopoint_0.21-6_all.deb ... Unpacking autopoint (0.21-6) ... Selecting previously unselected package libarchive13:amd64. Preparing to unpack .../22-libarchive13_3.6.0-1ubuntu1_amd64.deb ... Unpacking libarchive13:amd64 (3.6.0-1ubuntu1) ... Selecting previously unselected package libbrotli1:amd64. Preparing to unpack .../23-libbrotli1_1.0.9-2build6_amd64.deb ... Unpacking libbrotli1:amd64 (1.0.9-2build6) ... Selecting previously unselected package libsasl2-modules-db:amd64. Preparing to unpack .../24-libsasl2-modules-db_2.1.28+dfsg-6ubuntu1_amd64.deb ... Unpacking libsasl2-modules-db:amd64 (2.1.28+dfsg-6ubuntu1) ... Selecting previously unselected package libsasl2-2:amd64. Preparing to unpack .../25-libsasl2-2_2.1.28+dfsg-6ubuntu1_amd64.deb ... Unpacking libsasl2-2:amd64 (2.1.28+dfsg-6ubuntu1) ... Selecting previously unselected package libldap-2.5-0:amd64. Preparing to unpack .../26-libldap-2.5-0_2.5.12+dfsg-2ubuntu1_amd64.deb ... Unpacking libldap-2.5-0:amd64 (2.5.12+dfsg-2ubuntu1) ... Selecting previously unselected package librtmp1:amd64. Preparing to unpack .../27-librtmp1_2.4+20151223.gitfa8646d.1-2build4_amd64.deb ... Unpacking librtmp1:amd64 (2.4+20151223.gitfa8646d.1-2build4) ... Selecting previously unselected package libssh-4:amd64. Preparing to unpack .../28-libssh-4_0.9.6-2build1_amd64.deb ... Unpacking libssh-4:amd64 (0.9.6-2build1) ... Selecting previously unselected package libcurl4:amd64. Preparing to unpack .../29-libcurl4_7.83.1-2_amd64.deb ... Unpacking libcurl4:amd64 (7.83.1-2) ... Selecting previously unselected package libjsoncpp25:amd64. Preparing to unpack .../30-libjsoncpp25_1.9.5-4_amd64.deb ... Unpacking libjsoncpp25:amd64 (1.9.5-4) ... Selecting previously unselected package librhash0:amd64. Preparing to unpack .../31-librhash0_1.4.2-1ubuntu1_amd64.deb ... Unpacking librhash0:amd64 (1.4.2-1ubuntu1) ... Selecting previously unselected package dh-elpa-helper. Preparing to unpack .../32-dh-elpa-helper_2.0.9ubuntu1_all.deb ... Unpacking dh-elpa-helper (2.0.9ubuntu1) ... Selecting previously unselected package emacsen-common. Preparing to unpack .../33-emacsen-common_3.0.4_all.deb ... Unpacking emacsen-common (3.0.4) ... Selecting previously unselected package cmake-data. Preparing to unpack .../34-cmake-data_3.23.2-1ubuntu1_all.deb ... Unpacking cmake-data (3.23.2-1ubuntu1) ... Selecting previously unselected package cmake. Preparing to unpack .../35-cmake_3.23.2-1ubuntu1_amd64.deb ... Unpacking cmake (3.23.2-1ubuntu1) ... Selecting previously unselected package libdebhelper-perl. Preparing to unpack .../36-libdebhelper-perl_13.7.1ubuntu1_all.deb ... Unpacking libdebhelper-perl (13.7.1ubuntu1) ... Selecting previously unselected package libtool. Preparing to unpack .../37-libtool_2.4.7-4_all.deb ... Unpacking libtool (2.4.7-4) ... Selecting previously unselected package dh-autoreconf. Preparing to unpack .../38-dh-autoreconf_20_all.deb ... Unpacking dh-autoreconf (20) ... Selecting previously unselected package libarchive-zip-perl. Preparing to unpack .../39-libarchive-zip-perl_1.68-1_all.deb ... Unpacking libarchive-zip-perl (1.68-1) ... Selecting previously unselected package libsub-override-perl. Preparing to unpack .../40-libsub-override-perl_0.09-3_all.deb ... Unpacking libsub-override-perl (0.09-3) ... Selecting previously unselected package libfile-stripnondeterminism-perl. Preparing to unpack .../41-libfile-stripnondeterminism-perl_1.13.0-1_all.deb ... Unpacking libfile-stripnondeterminism-perl (1.13.0-1) ... Selecting previously unselected package dh-strip-nondeterminism. Preparing to unpack .../42-dh-strip-nondeterminism_1.13.0-1_all.deb ... Unpacking dh-strip-nondeterminism (1.13.0-1) ... Selecting previously unselected package libdw1:amd64. Preparing to unpack .../43-libdw1_0.187-1_amd64.deb ... Unpacking libdw1:amd64 (0.187-1) ... Selecting previously unselected package debugedit. Preparing to unpack .../44-debugedit_1%3a5.0-4build1_amd64.deb ... Unpacking debugedit (1:5.0-4build1) ... Selecting previously unselected package dwz. Preparing to unpack .../45-dwz_0.14-1build2_amd64.deb ... Unpacking dwz (0.14-1build2) ... Selecting previously unselected package gettext. Preparing to unpack .../46-gettext_0.21-6_amd64.deb ... Unpacking gettext (0.21-6) ... Selecting previously unselected package intltool-debian. Preparing to unpack .../47-intltool-debian_0.35.0+20060710.5_all.deb ... Unpacking intltool-debian (0.35.0+20060710.5) ... Selecting previously unselected package po-debconf. Preparing to unpack .../48-po-debconf_1.0.21+nmu1_all.deb ... Unpacking po-debconf (1.0.21+nmu1) ... Selecting previously unselected package debhelper. Preparing to unpack .../49-debhelper_13.7.1ubuntu1_all.deb ... Unpacking debhelper (13.7.1ubuntu1) ... Selecting previously unselected package sbuild-build-depends-main-dummy. Preparing to unpack .../50-sbuild-build-depends-main-dummy_0.invalid.0_amd64.deb ... Unpacking sbuild-build-depends-main-dummy (0.invalid.0) ... Setting up libexpat1:amd64 (2.4.8-1) ... Setting up libpipeline1:amd64 (1.5.6-1) ... Setting up libicu71:amd64 (71.1-3) ... Setting up libpsl5:amd64 (0.21.0-1.2build2) ... Setting up bsdextrautils (2.38-4ubuntu1) ... Setting up libmagic-mgc (1:5.41-4) ... Setting up libarchive-zip-perl (1.68-1) ... Setting up libdebhelper-perl (13.7.1ubuntu1) ... Setting up libbrotli1:amd64 (1.0.9-2build6) ... Setting up libnghttp2-14:amd64 (1.47.0-1) ... Setting up libmagic1:amd64 (1:5.41-4) ... Setting up gettext-base (0.21-6) ... Setting up file (1:5.41-4) ... Setting up libsasl2-modules-db:amd64 (2.1.28+dfsg-6ubuntu1) ... Setting up autotools-dev (20220109.1) ... Setting up libuv1:amd64 (1.44.1-2) ... Setting up emacsen-common (3.0.4) ... Setting up librtmp1:amd64 (2.4+20151223.gitfa8646d.1-2build4) ... Setting up dh-elpa-helper (2.0.9ubuntu1) ... Setting up libsigsegv2:amd64 (2.13-1ubuntu3) ... Setting up autopoint (0.21-6) ... Setting up libjsoncpp25:amd64 (1.9.5-4) ... Setting up libsasl2-2:amd64 (2.1.28+dfsg-6ubuntu1) ... Setting up libssh-4:amd64 (0.9.6-2build1) ... Setting up librhash0:amd64 (1.4.2-1ubuntu1) ... Setting up libuchardet0:amd64 (0.0.7-1build2) ... Setting up libsub-override-perl (0.09-3) ... Setting up cmake-data (3.23.2-1ubuntu1) ... Setting up libelf1:amd64 (0.187-1) ... Setting up libxml2:amd64 (2.9.14+dfsg-1) ... Setting up libfile-stripnondeterminism-perl (1.13.0-1) ... Setting up libdw1:amd64 (0.187-1) ... Setting up gettext (0.21-6) ... Setting up libtool (2.4.7-4) ... Setting up libarchive13:amd64 (3.6.0-1ubuntu1) ... Setting up m4 (1.4.18-5ubuntu2) ... Setting up libldap-2.5-0:amd64 (2.5.12+dfsg-2ubuntu1) ... Setting up intltool-debian (0.35.0+20060710.5) ... Setting up autoconf (2.71-2) ... Setting up dh-strip-nondeterminism (1.13.0-1) ... Setting up dwz (0.14-1build2) ... Setting up groff-base (1.22.4-8build1) ... Setting up debugedit (1:5.0-4build1) ... Setting up libcurl4:amd64 (7.83.1-2) ... Setting up automake (1:1.16.5-1.3) ... update-alternatives: using /usr/bin/automake-1.16 to provide /usr/bin/automake (automake) in auto mode Setting up po-debconf (1.0.21+nmu1) ... Setting up man-db (2.10.2-1) ... Not building database; man-db/auto-update is not 'true'. Created symlink /etc/systemd/system/timers.target.wants/man-db.timer → /lib/systemd/system/man-db.timer. Setting up dh-autoreconf (20) ... Setting up cmake (3.23.2-1ubuntu1) ... Setting up debhelper (13.7.1ubuntu1) ... Setting up sbuild-build-depends-main-dummy (0.invalid.0) ... Processing triggers for libc-bin (2.35-0ubuntu3) ... +------------------------------------------------------------------------------+ | Check architectures | +------------------------------------------------------------------------------+ Arch check ok (amd64 included in any) +------------------------------------------------------------------------------+ | Build environment | +------------------------------------------------------------------------------+ Kernel: Linux 5.4.0-120-generic #136-Ubuntu SMP Fri Jun 10 13:40:48 UTC 2022 amd64 (x86_64) Toolchain package versions: binutils_2.38.50.20220615-4ubuntu1 dpkg-dev_1.21.8ubuntu1 g++-11_11.3.0-3ubuntu1 gcc-11_11.3.0-3ubuntu1 libc6-dev_2.35-0ubuntu3 libstdc++-11-dev_11.3.0-3ubuntu1 libstdc++6_12.1.0-4ubuntu1 linux-libc-dev_5.15.0-35.36+22.10.1 Package versions: adduser_3.121ubuntu1 advancecomp_2.1-2.1ubuntu2 apt_2.5.0 autoconf_2.71-2 automake_1:1.16.5-1.3 autopoint_0.21-6 autotools-dev_20220109.1 base-files_12.2ubuntu1 base-passwd_3.5.52build1 bash_5.1-6ubuntu1 binutils_2.38.50.20220615-4ubuntu1 binutils-common_2.38.50.20220615-4ubuntu1 binutils-x86-64-linux-gnu_2.38.50.20220615-4ubuntu1 bsdextrautils_2.38-4ubuntu1 bsdutils_1:2.38-4ubuntu1 build-essential_12.9ubuntu3 bzip2_1.0.8-5build1 ca-certificates_20211016 cmake_3.23.2-1ubuntu1 cmake-data_3.23.2-1ubuntu1 coreutils_8.32-4.1ubuntu1 cpp_4:11.2.0-1ubuntu1 cpp-11_11.3.0-3ubuntu1 dash_0.5.11+git20210903+057cd650a4ed-8ubuntu1 debconf_1.5.79ubuntu1 debhelper_13.7.1ubuntu1 debianutils_5.7-0.2 debugedit_1:5.0-4build1 dh-autoreconf_20 dh-elpa-helper_2.0.9ubuntu1 dh-strip-nondeterminism_1.13.0-1 diffutils_1:3.8-0ubuntu2 dpkg_1.21.8ubuntu1 dpkg-dev_1.21.8ubuntu1 dwz_0.14-1build2 e2fsprogs_1.46.5-2ubuntu2 emacsen-common_3.0.4 fakeroot_1.29-1ubuntu1 file_1:5.41-4 findutils_4.8.0-1ubuntu3 g++_4:11.2.0-1ubuntu1 g++-11_11.3.0-3ubuntu1 gcc_4:11.2.0-1ubuntu1 gcc-11_11.3.0-3ubuntu1 gcc-11-base_11.3.0-3ubuntu1 gcc-12-base_12.1.0-4ubuntu1 gettext_0.21-6 gettext-base_0.21-6 gpg_2.2.27-3ubuntu2 gpg-agent_2.2.27-3ubuntu2 gpgconf_2.2.27-3ubuntu2 gpgv_2.2.27-3ubuntu2 grep_3.7-1build1 groff-base_1.22.4-8build1 gzip_1.10-4ubuntu4 hostname_3.23ubuntu2 init_1.63 init-system-helpers_1.63 intltool-debian_0.35.0+20060710.5 libacl1_2.3.1-1 libapparmor1_3.0.4-2ubuntu3 libapt-pkg6.0_2.5.0 libarchive-zip-perl_1.68-1 libarchive13_3.6.0-1ubuntu1 libargon2-1_0~20171227-0.3 libasan6_11.3.0-3ubuntu1 libassuan0_2.5.5-3 libatomic1_12.1.0-4ubuntu1 libattr1_1:2.5.1-1build1 libaudit-common_1:3.0.7-1build1 libaudit1_1:3.0.7-1build1 libbinutils_2.38.50.20220615-4ubuntu1 libblkid1_2.38-4ubuntu1 libbrotli1_1.0.9-2build6 libbz2-1.0_1.0.8-5build1 libc-bin_2.35-0ubuntu3 libc-dev-bin_2.35-0ubuntu3 libc6_2.35-0ubuntu3 libc6-dev_2.35-0ubuntu3 libcap-ng0_0.8.3-1 libcap2_1:2.44-1build3 libcc1-0_12.1.0-4ubuntu1 libcom-err2_1.46.5-2ubuntu2 libcrypt-dev_1:4.4.27-1 libcrypt1_1:4.4.27-1 libcryptsetup12_2:2.4.3-1ubuntu1 libctf-nobfd0_2.38.50.20220615-4ubuntu1 libctf0_2.38.50.20220615-4ubuntu1 libcurl4_7.83.1-2 libdb5.3_5.3.28+dfsg1-0.9 libdebconfclient0_0.261ubuntu1 libdebhelper-perl_13.7.1ubuntu1 libdevmapper1.02.1_2:1.02.175-2.1ubuntu4 libdpkg-perl_1.21.8ubuntu1 libdw1_0.187-1 libelf1_0.187-1 libexpat1_2.4.8-1 libext2fs2_1.46.5-2ubuntu2 libfakeroot_1.29-1ubuntu1 libffi8_3.4.2-4 libfile-stripnondeterminism-perl_1.13.0-1 libgcc-11-dev_11.3.0-3ubuntu1 libgcc-s1_12.1.0-4ubuntu1 libgcrypt20_1.10.1-2ubuntu1 libgdbm-compat4_1.23-1 libgdbm6_1.23-1 libgmp10_2:6.2.1+dfsg-3ubuntu1 libgnutls30_3.7.6-2ubuntu1 libgomp1_12.1.0-4ubuntu1 libgpg-error0_1.45-2 libgprofng0_2.38.50.20220615-4ubuntu1 libgssapi-krb5-2_1.19.2-2 libhogweed6_3.7.3-1build2 libicu71_71.1-3 libidn2-0_2.3.2-2build1 libip4tc2_1.8.7-1ubuntu6 libisl23_0.24-2build1 libitm1_12.1.0-4ubuntu1 libjson-c5_0.16-1 libjsoncpp25_1.9.5-4 libk5crypto3_1.19.2-2 libkeyutils1_1.6.1-3ubuntu1 libkmod2_29-1ubuntu1 libkrb5-3_1.19.2-2 libkrb5support0_1.19.2-2 libldap-2.5-0_2.5.12+dfsg-2ubuntu1 liblockfile-bin_1.17-1build2 liblockfile1_1.17-1build2 liblsan0_12.1.0-4ubuntu1 liblz4-1_1.9.3-2build2 liblzma5_5.2.5-2.1 libmagic-mgc_1:5.41-4 libmagic1_1:5.41-4 libmount1_2.38-4ubuntu1 libmpc3_1.2.1-2build1 libmpfr6_4.1.0-3build3 libncurses6_6.3+20220423-2 libncursesw6_6.3+20220423-2 libnettle8_3.7.3-1build2 libnghttp2-14_1.47.0-1 libnpth0_1.6-3build2 libnsl-dev_1.3.0-2build2 libnsl2_1.3.0-2build2 libp11-kit0_0.24.1-1 libpam-modules_1.4.0-13ubuntu1 libpam-modules-bin_1.4.0-13ubuntu1 libpam-runtime_1.4.0-13ubuntu1 libpam0g_1.4.0-13ubuntu1 libpcre2-8-0_10.40-1 libpcre3_2:8.39-14 libperl5.34_5.34.0-3ubuntu1 libpipeline1_1.5.6-1 libpng16-16_1.6.37-5 libprocps8_2:3.3.17-6ubuntu2 libpsl5_0.21.0-1.2build2 libquadmath0_12.1.0-4ubuntu1 libreadline8_8.1.2-1.2 librhash0_1.4.2-1ubuntu1 librtmp1_2.4+20151223.gitfa8646d.1-2build4 libsasl2-2_2.1.28+dfsg-6ubuntu1 libsasl2-modules-db_2.1.28+dfsg-6ubuntu1 libseccomp2_2.5.4-1ubuntu1 libselinux1_3.4-1 libsemanage-common_3.4-1 libsemanage2_3.4-1 libsepol2_3.4-2 libsigsegv2_2.13-1ubuntu3 libsmartcols1_2.38-4ubuntu1 libsqlite3-0_3.38.5-1 libss2_1.46.5-2ubuntu2 libssh-4_0.9.6-2build1 libssl3_3.0.3-5ubuntu3 libstdc++-11-dev_11.3.0-3ubuntu1 libstdc++6_12.1.0-4ubuntu1 libsub-override-perl_0.09-3 libsystemd0_249.11-0ubuntu4 libtasn1-6_4.18.0-4build1 libtinfo6_6.3+20220423-2 libtirpc-common_1.3.2-2build1 libtirpc-dev_1.3.2-2build1 libtirpc3_1.3.2-2build1 libtool_2.4.7-4 libtsan0_11.3.0-3ubuntu1 libubsan1_12.1.0-4ubuntu1 libuchardet0_0.0.7-1build2 libudev1_249.11-0ubuntu4 libunistring2_1.0-1 libuuid1_2.38-4ubuntu1 libuv1_1.44.1-2 libxml2_2.9.14+dfsg-1 libxxhash0_0.8.1-1 libzstd1_1.5.2+dfsg-1 linux-libc-dev_5.15.0-35.36+22.10.1 lockfile-progs_0.1.19build1 login_1:4.11.1+dfsg1-2ubuntu1 logsave_1.46.5-2ubuntu2 lsb-base_11.1.0ubuntu4 lto-disabled-list_27 m4_1.4.18-5ubuntu2 make_4.3-4.1build1 man-db_2.10.2-1 mawk_1.3.4.20200120-3.1 mount_2.38-4ubuntu1 ncurses-base_6.3+20220423-2 ncurses-bin_6.3+20220423-2 openssl_3.0.3-5ubuntu3 optipng_0.7.7-2build1 passwd_1:4.11.1+dfsg1-2ubuntu1 patch_2.7.6-7build2 perl_5.34.0-3ubuntu1 perl-base_5.34.0-3ubuntu1 perl-modules-5.34_5.34.0-3ubuntu1 pinentry-curses_1.2.0-1ubuntu1 pkgbinarymangler_149 po-debconf_1.0.21+nmu1 policyrcd-script-zg2_0.1-3.1 procps_2:3.3.17-6ubuntu2 readline-common_8.1.2-1.2 rpcsvc-proto_1.4.2-0ubuntu6 sbuild-build-depends-main-dummy_0.invalid.0 sed_4.8-1ubuntu2 sensible-utils_0.0.17 systemd_249.11-0ubuntu4 systemd-sysv_249.11-0ubuntu4 sysvinit-utils_3.01-1ubuntu1 tar_1.34+dfsg-1build3 tzdata_2022a-0ubuntu1 ubuntu-keyring_2021.03.26 usrmerge_25ubuntu2 util-linux_2.38-4ubuntu1 util-linux-extra_2.38-4ubuntu1 xz-utils_5.2.5-2.1 zlib1g_1:1.2.11.dfsg-2ubuntu9 +------------------------------------------------------------------------------+ | Build | +------------------------------------------------------------------------------+ Unpack source ------------- -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA512 Format: 3.0 (quilt) Source: gemmlowp Binary: libgemmlowp-dev Architecture: any Version: 0.0~git20211220.e844ffd-1 Maintainer: Debian Science Maintainers Uploaders: Mo Zhou Homepage: https://github.com/google/gemmlowp Standards-Version: 4.5.0 Vcs-Browser: https://salsa.debian.org/science-team/gemmlowp Vcs-Git: https://salsa.debian.org/science-team/gemmlowp.git Build-Depends: debhelper-compat (= 13), cmake Package-List: libgemmlowp-dev deb libdevel optional arch=any Checksums-Sha1: f7b4ad36b046a404ce7f7b0df1b674d747e73430 544168 gemmlowp_0.0~git20211220.e844ffd.orig.tar.xz c31cbd302d97bf431aed316aa8a6d4ce74a11007 3372 gemmlowp_0.0~git20211220.e844ffd-1.debian.tar.xz Checksums-Sha256: b43ad962deaa41ba6727888a48ecf4dad181a70fcdd82505d9dbdcc8388a98e7 544168 gemmlowp_0.0~git20211220.e844ffd.orig.tar.xz a21ebb88383583856d79b7cc2e04e3d2abf162801a749e9c486404088316bfd6 3372 gemmlowp_0.0~git20211220.e844ffd-1.debian.tar.xz Files: abdf0c81faac4c2340657290fb5a9354 544168 gemmlowp_0.0~git20211220.e844ffd.orig.tar.xz ee68dd119e3f3cd7036072821b60c274 3372 gemmlowp_0.0~git20211220.e844ffd-1.debian.tar.xz -----BEGIN PGP SIGNATURE----- iQJFBAEBCgAvFiEEY4vHXsHlxYkGfjXeYmRes19oaooFAmK1UhgRHGx1bWluQGRl Ymlhbi5vcmcACgkQYmRes19oaopA/Q/+PFnDxewMprNXwTj8nFzpsi/yZ7OjBvXO y00AxsZD6+7eS039S/VpbjQ43eJM56kT0p2kqZN4bueJOQI2BCcPtGN1k/fHixlv By+L2BQoXyownTD1WhfylJuySCY1tghN/zLl4G7Xr0daWyrZz9nCBXVx+JzVQg7V AFjwPDD5ox5ZwoCKlFMOsJr9HA5sEV1AbmCAvJ3lf9qOPuaD03HefGjSRyRhFSR2 zU9yHqYS/DdmyzuqL4TLV5hnmTeS7H+xIaNHW6pN46nlA3ZTGPKc/0Rn5edGhYtS PrFar/NJ0jri7qFC4ck+kWR/fz6oP9FLmr5E4Da5aGOruqU3g4ByeSmg6z9on5fe MPJoHrf7kwCPa9VopOmfI7SoI6f7tyJ9JDLe2NprXiLryuFRw3/X01RH9V0Y9BPa KgOscXbJZwl3XdbtMa2Me3UKkgQQsZB5kRyHBFNhX6a7noxEoFTOMSLhGw3tIwpd U4CG1ddZqASnWBn7RjQNeLxLMvPo2mK9IkzQm+sHpNONf1GisUAS0IHKARAuxgsk Na2r8mvIHMv4Y2nmHKUkd3U9OupXusFZOapV2PBgjI2AZ9h9OJwUPVs9SGseogbx yKF2U3bZqf2f1OJKlw0kjN+x6oO3NtA6+H3fDPczmgQN0UliIOW39dkDtqzE5t8Z tu4VkVxZLX4= =AWaR -----END PGP SIGNATURE----- gpgv: Signature made Fri Jun 24 05:56:40 2022 UTC gpgv: using RSA key 638BC75EC1E5C589067E35DE62645EB35F686A8A gpgv: issuer "lumin@debian.org" gpgv: Can't check signature: No public key dpkg-source: warning: cannot verify signature ./gemmlowp_0.0~git20211220.e844ffd-1.dsc dpkg-source: info: extracting gemmlowp in /<> dpkg-source: info: unpacking gemmlowp_0.0~git20211220.e844ffd.orig.tar.xz dpkg-source: info: unpacking gemmlowp_0.0~git20211220.e844ffd-1.debian.tar.xz dpkg-source: info: using patch list from debian/patches/series dpkg-source: info: applying 0001-cmake-build-fix.patch Check disk space ---------------- Sufficient free space for build User Environment ---------------- APT_CONFIG=/var/lib/sbuild/apt.conf DEB_BUILD_OPTIONS=parallel=4 HOME=/sbuild-nonexistent LANG=C.UTF-8 LC_ALL=C.UTF-8 LOGNAME=buildd PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games SCHROOT_ALIAS_NAME=build-PACKAGEBUILD-24109081 SCHROOT_CHROOT_NAME=build-PACKAGEBUILD-24109081 SCHROOT_COMMAND=env SCHROOT_GID=2501 SCHROOT_GROUP=buildd SCHROOT_SESSION_ID=build-PACKAGEBUILD-24109081 SCHROOT_UID=2001 SCHROOT_USER=buildd SHELL=/bin/sh TERM=unknown USER=buildd V=1 dpkg-buildpackage ----------------- Command: dpkg-buildpackage -us -uc -mLaunchpad Build Daemon -b -rfakeroot dpkg-buildpackage: info: source package gemmlowp dpkg-buildpackage: info: source version 0.0~git20211220.e844ffd-1 dpkg-buildpackage: info: source distribution unstable dpkg-source --before-build . dpkg-buildpackage: info: host architecture amd64 fakeroot debian/rules clean dh clean -Scmake debian/rules override_dh_auto_clean make[1]: Entering directory '/<>' rm -f CMakeLists.txt dh_auto_clean make[1]: Leaving directory '/<>' dh_clean -O-Scmake debian/rules build dh build -Scmake dh_update_autotools_config -O-Scmake dh_autoreconf -O-Scmake debian/rules override_dh_auto_configure make[1]: Entering directory '/<>' ln -s contrib/CMakeLists.txt . dh_auto_configure -- \ -DCMAKE_C_FLAGS="-g -O2 -ffile-prefix-map=/<>=. -flto=auto -ffat-lto-objects -flto=auto -ffat-lto-objects -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2" \ -DCMAKE_CXX_FLAGS="-g -O2 -ffile-prefix-map=/<>=. -flto=auto -ffat-lto-objects -flto=auto -ffat-lto-objects -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2" cd obj-x86_64-linux-gnu && cmake -DCMAKE_INSTALL_PREFIX=/usr -DCMAKE_BUILD_TYPE=None -DCMAKE_INSTALL_SYSCONFDIR=/etc -DCMAKE_INSTALL_LOCALSTATEDIR=/var -DCMAKE_EXPORT_NO_PACKAGE_REGISTRY=ON -DCMAKE_FIND_USE_PACKAGE_REGISTRY=OFF -DCMAKE_FIND_PACKAGE_NO_PACKAGE_REGISTRY=ON -DCMAKE_INSTALL_RUNSTATEDIR=/run -DCMAKE_SKIP_INSTALL_ALL_DEPENDENCY=ON "-GUnix Makefiles" -DCMAKE_VERBOSE_MAKEFILE=ON -DCMAKE_INSTALL_LIBDIR=lib/x86_64-linux-gnu "-DCMAKE_C_FLAGS=-g -O2 -ffile-prefix-map=/<>=. -flto=auto -ffat-lto-objects -flto=auto -ffat-lto-objects -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2" "-DCMAKE_CXX_FLAGS=-g -O2 -ffile-prefix-map=/<>=. -flto=auto -ffat-lto-objects -flto=auto -ffat-lto-objects -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2" .. -- The C compiler identification is GNU 11.3.0 -- The CXX compiler identification is GNU 11.3.0 -- Detecting C compiler ABI info -- Detecting C compiler ABI info - done -- Check for working C compiler: /usr/bin/cc - skipped -- Detecting C compile features -- Detecting C compile features - done -- Detecting CXX compiler ABI info -- Detecting CXX compiler ABI info - done -- Check for working CXX compiler: /usr/bin/c++ - skipped -- Detecting CXX compile features -- Detecting CXX compile features - done -- Configuring done -- Generating done CMake Warning: Manually-specified variables were not used by the project: CMAKE_EXPORT_NO_PACKAGE_REGISTRY CMAKE_FIND_PACKAGE_NO_PACKAGE_REGISTRY CMAKE_FIND_USE_PACKAGE_REGISTRY -- Build files have been written to: /<>/obj-x86_64-linux-gnu make[1]: Leaving directory '/<>' dh_auto_build -O-Scmake cd obj-x86_64-linux-gnu && make -j4 "INSTALL=install --strip-program=true" VERBOSE=1 make[1]: Entering directory '/<>/obj-x86_64-linux-gnu' /usr/bin/cmake -S"/<>" -B"/<>/obj-x86_64-linux-gnu" --check-build-system CMakeFiles/Makefile.cmake 0 /usr/bin/cmake -E cmake_progress_start "/<>/obj-x86_64-linux-gnu/CMakeFiles" "/<>/obj-x86_64-linux-gnu//CMakeFiles/progress.marks" make -f CMakeFiles/Makefile2 all make[2]: Entering directory '/<>/obj-x86_64-linux-gnu' make -f CMakeFiles/eight_bit_int_gemm.dir/build.make CMakeFiles/eight_bit_int_gemm.dir/depend make -f CMakeFiles/benchmark.dir/build.make CMakeFiles/benchmark.dir/depend make -f CMakeFiles/benchmark_all_sizes.dir/build.make CMakeFiles/benchmark_all_sizes.dir/depend make -f CMakeFiles/test_math_helpers.dir/build.make CMakeFiles/test_math_helpers.dir/depend make[3]: Entering directory '/<>/obj-x86_64-linux-gnu' make[3]: Entering directory '/<>/obj-x86_64-linux-gnu' cd "/<>/obj-x86_64-linux-gnu" && /usr/bin/cmake -E cmake_depends "Unix Makefiles" "/<>" "/<>" "/<>/obj-x86_64-linux-gnu" "/<>/obj-x86_64-linux-gnu" "/<>/obj-x86_64-linux-gnu/CMakeFiles/eight_bit_int_gemm.dir/DependInfo.cmake" --color= cd "/<>/obj-x86_64-linux-gnu" && /usr/bin/cmake -E cmake_depends "Unix Makefiles" "/<>" "/<>" "/<>/obj-x86_64-linux-gnu" "/<>/obj-x86_64-linux-gnu" "/<>/obj-x86_64-linux-gnu/CMakeFiles/benchmark_all_sizes.dir/DependInfo.cmake" --color= make[3]: Entering directory '/<>/obj-x86_64-linux-gnu' cd "/<>/obj-x86_64-linux-gnu" && /usr/bin/cmake -E cmake_depends "Unix Makefiles" "/<>" "/<>" "/<>/obj-x86_64-linux-gnu" "/<>/obj-x86_64-linux-gnu" "/<>/obj-x86_64-linux-gnu/CMakeFiles/benchmark.dir/DependInfo.cmake" --color= make[3]: Entering directory '/<>/obj-x86_64-linux-gnu' cd "/<>/obj-x86_64-linux-gnu" && /usr/bin/cmake -E cmake_depends "Unix Makefiles" "/<>" "/<>" "/<>/obj-x86_64-linux-gnu" "/<>/obj-x86_64-linux-gnu" "/<>/obj-x86_64-linux-gnu/CMakeFiles/test_math_helpers.dir/DependInfo.cmake" --color= make[3]: Leaving directory '/<>/obj-x86_64-linux-gnu' make[3]: Leaving directory '/<>/obj-x86_64-linux-gnu' make -f CMakeFiles/eight_bit_int_gemm.dir/build.make CMakeFiles/eight_bit_int_gemm.dir/build make -f CMakeFiles/benchmark_all_sizes.dir/build.make CMakeFiles/benchmark_all_sizes.dir/build make[3]: Entering directory '/<>/obj-x86_64-linux-gnu' make[3]: Entering directory '/<>/obj-x86_64-linux-gnu' make[3]: Leaving directory '/<>/obj-x86_64-linux-gnu' make -f CMakeFiles/benchmark.dir/build.make CMakeFiles/benchmark.dir/build make[3]: Entering directory '/<>/obj-x86_64-linux-gnu' make[3]: Leaving directory '/<>/obj-x86_64-linux-gnu' make -f CMakeFiles/test_math_helpers.dir/build.make CMakeFiles/test_math_helpers.dir/build make[3]: Entering directory '/<>/obj-x86_64-linux-gnu' [ 5%] Building CXX object CMakeFiles/eight_bit_int_gemm.dir/eight_bit_int_gemm/eight_bit_int_gemm.cc.o /usr/bin/c++ -g -O2 -ffile-prefix-map=/<>=. -flto=auto -ffat-lto-objects -flto=auto -ffat-lto-objects -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -std=gnu++11 -MD -MT CMakeFiles/eight_bit_int_gemm.dir/eight_bit_int_gemm/eight_bit_int_gemm.cc.o -MF CMakeFiles/eight_bit_int_gemm.dir/eight_bit_int_gemm/eight_bit_int_gemm.cc.o.d -o CMakeFiles/eight_bit_int_gemm.dir/eight_bit_int_gemm/eight_bit_int_gemm.cc.o -c "/<>/eight_bit_int_gemm/eight_bit_int_gemm.cc" [ 11%] Building CXX object CMakeFiles/test_math_helpers.dir/test/test_math_helpers.cc.o [ 17%] Building CXX object CMakeFiles/benchmark_all_sizes.dir/test/benchmark_all_sizes.cc.o /usr/bin/c++ -g -O2 -ffile-prefix-map=/<>=. -flto=auto -ffat-lto-objects -flto=auto -ffat-lto-objects -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -std=gnu++11 -MD -MT CMakeFiles/test_math_helpers.dir/test/test_math_helpers.cc.o -MF CMakeFiles/test_math_helpers.dir/test/test_math_helpers.cc.o.d -o CMakeFiles/test_math_helpers.dir/test/test_math_helpers.cc.o -c "/<>/test/test_math_helpers.cc" /usr/bin/c++ -g -O2 -ffile-prefix-map=/<>=. -flto=auto -ffat-lto-objects -flto=auto -ffat-lto-objects -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -DBENCHMARK_8bit -DBENCHMARK_QUICK -std=gnu++11 -MD -MT CMakeFiles/benchmark_all_sizes.dir/test/benchmark_all_sizes.cc.o -MF CMakeFiles/benchmark_all_sizes.dir/test/benchmark_all_sizes.cc.o.d -o CMakeFiles/benchmark_all_sizes.dir/test/benchmark_all_sizes.cc.o -c "/<>/test/benchmark_all_sizes.cc" [ 23%] Building CXX object CMakeFiles/benchmark.dir/test/benchmark.cc.o /usr/bin/c++ -g -O2 -ffile-prefix-map=/<>=. -flto=auto -ffat-lto-objects -flto=auto -ffat-lto-objects -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -std=gnu++11 -MD -MT CMakeFiles/benchmark.dir/test/benchmark.cc.o -MF CMakeFiles/benchmark.dir/test/benchmark.cc.o.d -o CMakeFiles/benchmark.dir/test/benchmark.cc.o -c "/<>/test/benchmark.cc" [ 29%] Linking CXX executable test_math_helpers /usr/bin/cmake -E cmake_link_script CMakeFiles/test_math_helpers.dir/link.txt --verbose=1 /usr/bin/c++ -g -O2 -ffile-prefix-map=/<>=. -flto=auto -ffat-lto-objects -flto=auto -ffat-lto-objects -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -Wl,-Bsymbolic-functions -flto=auto -ffat-lto-objects -flto=auto -Wl,-z,relro -Wl,-z,now -Wl,--as-needed CMakeFiles/test_math_helpers.dir/test/test_math_helpers.cc.o -o test_math_helpers make[3]: Leaving directory '/<>/obj-x86_64-linux-gnu' [ 29%] Built target test_math_helpers make -f CMakeFiles/test_blocking_counter.dir/build.make CMakeFiles/test_blocking_counter.dir/depend make[3]: Entering directory '/<>/obj-x86_64-linux-gnu' cd "/<>/obj-x86_64-linux-gnu" && /usr/bin/cmake -E cmake_depends "Unix Makefiles" "/<>" "/<>" "/<>/obj-x86_64-linux-gnu" "/<>/obj-x86_64-linux-gnu" "/<>/obj-x86_64-linux-gnu/CMakeFiles/test_blocking_counter.dir/DependInfo.cmake" --color= make[3]: Leaving directory '/<>/obj-x86_64-linux-gnu' make -f CMakeFiles/test_blocking_counter.dir/build.make CMakeFiles/test_blocking_counter.dir/build make[3]: Entering directory '/<>/obj-x86_64-linux-gnu' [ 35%] Building CXX object CMakeFiles/test_blocking_counter.dir/test/test_blocking_counter.cc.o /usr/bin/c++ -g -O2 -ffile-prefix-map=/<>=. -flto=auto -ffat-lto-objects -flto=auto -ffat-lto-objects -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -std=gnu++11 -MD -MT CMakeFiles/test_blocking_counter.dir/test/test_blocking_counter.cc.o -MF CMakeFiles/test_blocking_counter.dir/test/test_blocking_counter.cc.o.d -o CMakeFiles/test_blocking_counter.dir/test/test_blocking_counter.cc.o -c "/<>/test/test_blocking_counter.cc" [ 41%] Linking CXX executable test_blocking_counter /usr/bin/cmake -E cmake_link_script CMakeFiles/test_blocking_counter.dir/link.txt --verbose=1 /usr/bin/c++ -g -O2 -ffile-prefix-map=/<>=. -flto=auto -ffat-lto-objects -flto=auto -ffat-lto-objects -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -Wl,-Bsymbolic-functions -flto=auto -ffat-lto-objects -flto=auto -Wl,-z,relro -Wl,-z,now -Wl,--as-needed CMakeFiles/test_blocking_counter.dir/test/test_blocking_counter.cc.o -o test_blocking_counter -lpthread make[3]: Leaving directory '/<>/obj-x86_64-linux-gnu' [ 41%] Built target test_blocking_counter make -f CMakeFiles/test_allocator.dir/build.make CMakeFiles/test_allocator.dir/depend make[3]: Entering directory '/<>/obj-x86_64-linux-gnu' cd "/<>/obj-x86_64-linux-gnu" && /usr/bin/cmake -E cmake_depends "Unix Makefiles" "/<>" "/<>" "/<>/obj-x86_64-linux-gnu" "/<>/obj-x86_64-linux-gnu" "/<>/obj-x86_64-linux-gnu/CMakeFiles/test_allocator.dir/DependInfo.cmake" --color= make[3]: Leaving directory '/<>/obj-x86_64-linux-gnu' make -f CMakeFiles/test_allocator.dir/build.make CMakeFiles/test_allocator.dir/build make[3]: Entering directory '/<>/obj-x86_64-linux-gnu' [ 47%] Building CXX object CMakeFiles/test_allocator.dir/test/test_allocator.cc.o /usr/bin/c++ -g -O2 -ffile-prefix-map=/<>=. -flto=auto -ffat-lto-objects -flto=auto -ffat-lto-objects -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -std=gnu++11 -MD -MT CMakeFiles/test_allocator.dir/test/test_allocator.cc.o -MF CMakeFiles/test_allocator.dir/test/test_allocator.cc.o.d -o CMakeFiles/test_allocator.dir/test/test_allocator.cc.o -c "/<>/test/test_allocator.cc" [ 52%] Linking CXX executable benchmark /usr/bin/cmake -E cmake_link_script CMakeFiles/benchmark.dir/link.txt --verbose=1 /usr/bin/c++ -g -O2 -ffile-prefix-map=/<>=. -flto=auto -ffat-lto-objects -flto=auto -ffat-lto-objects -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -Wl,-Bsymbolic-functions -flto=auto -ffat-lto-objects -flto=auto -Wl,-z,relro -Wl,-z,now -Wl,--as-needed CMakeFiles/benchmark.dir/test/benchmark.cc.o -o benchmark -lpthread [ 58%] Linking CXX executable benchmark_all_sizes /usr/bin/cmake -E cmake_link_script CMakeFiles/benchmark_all_sizes.dir/link.txt --verbose=1 /usr/bin/c++ -g -O2 -ffile-prefix-map=/<>=. -flto=auto -ffat-lto-objects -flto=auto -ffat-lto-objects -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -Wl,-Bsymbolic-functions -flto=auto -ffat-lto-objects -flto=auto -Wl,-z,relro -Wl,-z,now -Wl,--as-needed CMakeFiles/benchmark_all_sizes.dir/test/benchmark_all_sizes.cc.o -o benchmark_all_sizes -lpthread [ 64%] Linking CXX executable test_allocator /usr/bin/cmake -E cmake_link_script CMakeFiles/test_allocator.dir/link.txt --verbose=1 /usr/bin/c++ -g -O2 -ffile-prefix-map=/<>=. -flto=auto -ffat-lto-objects -flto=auto -ffat-lto-objects -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -Wl,-Bsymbolic-functions -flto=auto -ffat-lto-objects -flto=auto -Wl,-z,relro -Wl,-z,now -Wl,--as-needed CMakeFiles/test_allocator.dir/test/test_allocator.cc.o -o test_allocator make[3]: Leaving directory '/<>/obj-x86_64-linux-gnu' [ 64%] Built target test_allocator make -f CMakeFiles/test_fixedpoint.dir/build.make CMakeFiles/test_fixedpoint.dir/depend make[3]: Entering directory '/<>/obj-x86_64-linux-gnu' cd "/<>/obj-x86_64-linux-gnu" && /usr/bin/cmake -E cmake_depends "Unix Makefiles" "/<>" "/<>" "/<>/obj-x86_64-linux-gnu" "/<>/obj-x86_64-linux-gnu" "/<>/obj-x86_64-linux-gnu/CMakeFiles/test_fixedpoint.dir/DependInfo.cmake" --color= make[3]: Leaving directory '/<>/obj-x86_64-linux-gnu' make -f CMakeFiles/test_fixedpoint.dir/build.make CMakeFiles/test_fixedpoint.dir/build make[3]: Entering directory '/<>/obj-x86_64-linux-gnu' [ 70%] Building CXX object CMakeFiles/test_fixedpoint.dir/test/test_fixedpoint.cc.o /usr/bin/c++ -g -O2 -ffile-prefix-map=/<>=. -flto=auto -ffat-lto-objects -flto=auto -ffat-lto-objects -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -std=gnu++11 -MD -MT CMakeFiles/test_fixedpoint.dir/test/test_fixedpoint.cc.o -MF CMakeFiles/test_fixedpoint.dir/test/test_fixedpoint.cc.o.d -o CMakeFiles/test_fixedpoint.dir/test/test_fixedpoint.cc.o -c "/<>/test/test_fixedpoint.cc" make[3]: Leaving directory '/<>/obj-x86_64-linux-gnu' [ 70%] Built target benchmark make[3]: Leaving directory '/<>/obj-x86_64-linux-gnu' [ 70%] Built target benchmark_all_sizes [ 76%] Linking CXX executable test_fixedpoint /usr/bin/cmake -E cmake_link_script CMakeFiles/test_fixedpoint.dir/link.txt --verbose=1 /usr/bin/c++ -g -O2 -ffile-prefix-map=/<>=. -flto=auto -ffat-lto-objects -flto=auto -ffat-lto-objects -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -Wl,-Bsymbolic-functions -flto=auto -ffat-lto-objects -flto=auto -Wl,-z,relro -Wl,-z,now -Wl,--as-needed CMakeFiles/test_fixedpoint.dir/test/test_fixedpoint.cc.o -o test_fixedpoint [ 82%] Linking CXX static library libeight_bit_int_gemm.a /usr/bin/cmake -P CMakeFiles/eight_bit_int_gemm.dir/cmake_clean_target.cmake /usr/bin/cmake -E cmake_link_script CMakeFiles/eight_bit_int_gemm.dir/link.txt --verbose=1 /usr/bin/ar qc libeight_bit_int_gemm.a CMakeFiles/eight_bit_int_gemm.dir/eight_bit_int_gemm/eight_bit_int_gemm.cc.o /usr/bin/ranlib libeight_bit_int_gemm.a make[3]: Leaving directory '/<>/obj-x86_64-linux-gnu' [ 82%] Built target eight_bit_int_gemm make -f CMakeFiles/test_gemmlowp.dir/build.make CMakeFiles/test_gemmlowp.dir/depend make[3]: Entering directory '/<>/obj-x86_64-linux-gnu' cd "/<>/obj-x86_64-linux-gnu" && /usr/bin/cmake -E cmake_depends "Unix Makefiles" "/<>" "/<>" "/<>/obj-x86_64-linux-gnu" "/<>/obj-x86_64-linux-gnu" "/<>/obj-x86_64-linux-gnu/CMakeFiles/test_gemmlowp.dir/DependInfo.cmake" --color= make[3]: Leaving directory '/<>/obj-x86_64-linux-gnu' make -f CMakeFiles/test_gemmlowp.dir/build.make CMakeFiles/test_gemmlowp.dir/build make[3]: Entering directory '/<>/obj-x86_64-linux-gnu' [ 88%] Building CXX object CMakeFiles/test_gemmlowp.dir/test/test.cc.o /usr/bin/c++ -g -O2 -ffile-prefix-map=/<>=. -flto=auto -ffat-lto-objects -flto=auto -ffat-lto-objects -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -std=gnu++11 -MD -MT CMakeFiles/test_gemmlowp.dir/test/test.cc.o -MF CMakeFiles/test_gemmlowp.dir/test/test.cc.o.d -o CMakeFiles/test_gemmlowp.dir/test/test.cc.o -c "/<>/test/test.cc" [ 94%] Building CXX object CMakeFiles/test_gemmlowp.dir/test/test_data.cc.o /usr/bin/c++ -g -O2 -ffile-prefix-map=/<>=. -flto=auto -ffat-lto-objects -flto=auto -ffat-lto-objects -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -std=gnu++11 -MD -MT CMakeFiles/test_gemmlowp.dir/test/test_data.cc.o -MF CMakeFiles/test_gemmlowp.dir/test/test_data.cc.o.d -o CMakeFiles/test_gemmlowp.dir/test/test_data.cc.o -c "/<>/test/test_data.cc" make[3]: Leaving directory '/<>/obj-x86_64-linux-gnu' [ 94%] Built target test_fixedpoint /<>/test/test.cc: In function ‘gemmlowp::MultiThreadGemmWrapper, 1>, gemmlowp::KernelSideFormat, 1> > >, unsigned char, gemmlowp::BitDepthParams, gemmlowp::OperandRange<0, 255> > >::Name()’: /<>/test/test.cc:163:58: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 231 [-Wformat-truncation=] 163 | snprintf(buf, sizeof(buf), "MultiThreadGemm, Kernel: %s", Kernel().Name()); | ^~ In file included from /usr/include/stdio.h:894, from /usr/include/c++/11/cstdio:42, from /usr/include/c++/11/ext/string_conversions.h:43, from /usr/include/c++/11/bits/basic_string.h:6608, from /usr/include/c++/11/string:55, from /usr/include/c++/11/bits/locale_classes.h:40, from /usr/include/c++/11/bits/ios_base.h:41, from /usr/include/c++/11/ios:42, from /usr/include/c++/11/ostream:38, from /usr/include/c++/11/iostream:39, from /<>/test/test.h:26, from /<>/test/test.cc:15: /usr/include/x86_64-linux-gnu/bits/stdio2.h:71:35: note: ‘__snprintf_chk’ output between 26 and 281 bytes into a destination of size 256 71 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1, | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 | __glibc_objsize (__s), __fmt, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 | __va_arg_pack ()); | ~~~~~~~~~~~~~~~~~ /<>/test/test.cc: In function ‘gemmlowp::MultiThreadGemmWrapper, 1>, gemmlowp::KernelSideFormat, 2> > >, unsigned char, gemmlowp::BitDepthParams, gemmlowp::OperandRange<0, 255> > >::Name()’: /<>/test/test.cc:163:58: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 231 [-Wformat-truncation=] 163 | snprintf(buf, sizeof(buf), "MultiThreadGemm, Kernel: %s", Kernel().Name()); | ^~ In file included from /usr/include/stdio.h:894, from /usr/include/c++/11/cstdio:42, from /usr/include/c++/11/ext/string_conversions.h:43, from /usr/include/c++/11/bits/basic_string.h:6608, from /usr/include/c++/11/string:55, from /usr/include/c++/11/bits/locale_classes.h:40, from /usr/include/c++/11/bits/ios_base.h:41, from /usr/include/c++/11/ios:42, from /usr/include/c++/11/ostream:38, from /usr/include/c++/11/iostream:39, from /<>/test/test.h:26, from /<>/test/test.cc:15: /usr/include/x86_64-linux-gnu/bits/stdio2.h:71:35: note: ‘__snprintf_chk’ output between 26 and 281 bytes into a destination of size 256 71 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1, | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 | __glibc_objsize (__s), __fmt, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 | __va_arg_pack ()); | ~~~~~~~~~~~~~~~~~ /<>/test/test.cc: In function ‘gemmlowp::MultiThreadGemmWrapper, 4>, gemmlowp::KernelSideFormat, 5> > >, unsigned char, gemmlowp::BitDepthParams, gemmlowp::OperandRange<0, 255> > >::Name()’: /<>/test/test.cc:163:58: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 231 [-Wformat-truncation=] 163 | snprintf(buf, sizeof(buf), "MultiThreadGemm, Kernel: %s", Kernel().Name()); | ^~ In file included from /usr/include/stdio.h:894, from /usr/include/c++/11/cstdio:42, from /usr/include/c++/11/ext/string_conversions.h:43, from /usr/include/c++/11/bits/basic_string.h:6608, from /usr/include/c++/11/string:55, from /usr/include/c++/11/bits/locale_classes.h:40, from /usr/include/c++/11/bits/ios_base.h:41, from /usr/include/c++/11/ios:42, from /usr/include/c++/11/ostream:38, from /usr/include/c++/11/iostream:39, from /<>/test/test.h:26, from /<>/test/test.cc:15: /usr/include/x86_64-linux-gnu/bits/stdio2.h:71:35: note: ‘__snprintf_chk’ output between 26 and 281 bytes into a destination of size 256 71 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1, | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 | __glibc_objsize (__s), __fmt, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 | __va_arg_pack ()); | ~~~~~~~~~~~~~~~~~ /<>/test/test.cc: In function ‘gemmlowp::MultiThreadGemmWrapper, 2>, gemmlowp::KernelSideFormat, 3> > >, unsigned char, gemmlowp::BitDepthParams, gemmlowp::OperandRange<0, 255> > >::Name()’: /<>/test/test.cc:163:58: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 231 [-Wformat-truncation=] 163 | snprintf(buf, sizeof(buf), "MultiThreadGemm, Kernel: %s", Kernel().Name()); | ^~ In file included from /usr/include/stdio.h:894, from /usr/include/c++/11/cstdio:42, from /usr/include/c++/11/ext/string_conversions.h:43, from /usr/include/c++/11/bits/basic_string.h:6608, from /usr/include/c++/11/string:55, from /usr/include/c++/11/bits/locale_classes.h:40, from /usr/include/c++/11/bits/ios_base.h:41, from /usr/include/c++/11/ios:42, from /usr/include/c++/11/ostream:38, from /usr/include/c++/11/iostream:39, from /<>/test/test.h:26, from /<>/test/test.cc:15: /usr/include/x86_64-linux-gnu/bits/stdio2.h:71:35: note: ‘__snprintf_chk’ output between 26 and 281 bytes into a destination of size 256 71 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1, | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 | __glibc_objsize (__s), __fmt, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 | __va_arg_pack ()); | ~~~~~~~~~~~~~~~~~ /<>/test/test.cc: In function ‘gemmlowp::MultiThreadGemmWrapper, 2>, gemmlowp::KernelSideFormat, 3> > >, unsigned char, gemmlowp::BitDepthParams, gemmlowp::OperandRange<0, 255> > >::Name()’: /<>/test/test.cc:163:58: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 231 [-Wformat-truncation=] 163 | snprintf(buf, sizeof(buf), "MultiThreadGemm, Kernel: %s", Kernel().Name()); | ^~ In file included from /usr/include/stdio.h:894, from /usr/include/c++/11/cstdio:42, from /usr/include/c++/11/ext/string_conversions.h:43, from /usr/include/c++/11/bits/basic_string.h:6608, from /usr/include/c++/11/string:55, from /usr/include/c++/11/bits/locale_classes.h:40, from /usr/include/c++/11/bits/ios_base.h:41, from /usr/include/c++/11/ios:42, from /usr/include/c++/11/ostream:38, from /usr/include/c++/11/iostream:39, from /<>/test/test.h:26, from /<>/test/test.cc:15: /usr/include/x86_64-linux-gnu/bits/stdio2.h:71:35: note: ‘__snprintf_chk’ output between 26 and 281 bytes into a destination of size 256 71 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1, | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 | __glibc_objsize (__s), __fmt, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 | __va_arg_pack ()); | ~~~~~~~~~~~~~~~~~ /<>/test/test.cc: In function ‘gemmlowp::MultiThreadGemmWrapper, 3>, gemmlowp::KernelSideFormat, 2> > >, unsigned char, gemmlowp::BitDepthParams, gemmlowp::OperandRange<0, 255> > >::Name()’: /<>/test/test.cc:163:58: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 231 [-Wformat-truncation=] 163 | snprintf(buf, sizeof(buf), "MultiThreadGemm, Kernel: %s", Kernel().Name()); | ^~ In file included from /usr/include/stdio.h:894, from /usr/include/c++/11/cstdio:42, from /usr/include/c++/11/ext/string_conversions.h:43, from /usr/include/c++/11/bits/basic_string.h:6608, from /usr/include/c++/11/string:55, from /usr/include/c++/11/bits/locale_classes.h:40, from /usr/include/c++/11/bits/ios_base.h:41, from /usr/include/c++/11/ios:42, from /usr/include/c++/11/ostream:38, from /usr/include/c++/11/iostream:39, from /<>/test/test.h:26, from /<>/test/test.cc:15: /usr/include/x86_64-linux-gnu/bits/stdio2.h:71:35: note: ‘__snprintf_chk’ output between 26 and 281 bytes into a destination of size 256 71 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1, | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 | __glibc_objsize (__s), __fmt, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 | __va_arg_pack ()); | ~~~~~~~~~~~~~~~~~ /<>/test/test.cc: In function ‘gemmlowp::MultiThreadGemmWrapper, 3>, gemmlowp::KernelSideFormat, 2> > >, unsigned char, gemmlowp::BitDepthParams, gemmlowp::OperandRange<0, 255> > >::Name()’: /<>/test/test.cc:163:58: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 231 [-Wformat-truncation=] 163 | snprintf(buf, sizeof(buf), "MultiThreadGemm, Kernel: %s", Kernel().Name()); | ^~ In file included from /usr/include/stdio.h:894, from /usr/include/c++/11/cstdio:42, from /usr/include/c++/11/ext/string_conversions.h:43, from /usr/include/c++/11/bits/basic_string.h:6608, from /usr/include/c++/11/string:55, from /usr/include/c++/11/bits/locale_classes.h:40, from /usr/include/c++/11/bits/ios_base.h:41, from /usr/include/c++/11/ios:42, from /usr/include/c++/11/ostream:38, from /usr/include/c++/11/iostream:39, from /<>/test/test.h:26, from /<>/test/test.cc:15: /usr/include/x86_64-linux-gnu/bits/stdio2.h:71:35: note: ‘__snprintf_chk’ output between 26 and 281 bytes into a destination of size 256 71 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1, | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 | __glibc_objsize (__s), __fmt, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 | __va_arg_pack ()); | ~~~~~~~~~~~~~~~~~ /<>/test/test.cc: In function ‘gemmlowp::MultiThreadGemmWrapper, 2>, gemmlowp::KernelSideFormat, 1> > >, unsigned char, gemmlowp::BitDepthParams, gemmlowp::OperandRange<0, 255> > >::Name()’: /<>/test/test.cc:163:58: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 231 [-Wformat-truncation=] 163 | snprintf(buf, sizeof(buf), "MultiThreadGemm, Kernel: %s", Kernel().Name()); | ^~ In file included from /usr/include/stdio.h:894, from /usr/include/c++/11/cstdio:42, from /usr/include/c++/11/ext/string_conversions.h:43, from /usr/include/c++/11/bits/basic_string.h:6608, from /usr/include/c++/11/string:55, from /usr/include/c++/11/bits/locale_classes.h:40, from /usr/include/c++/11/bits/ios_base.h:41, from /usr/include/c++/11/ios:42, from /usr/include/c++/11/ostream:38, from /usr/include/c++/11/iostream:39, from /<>/test/test.h:26, from /<>/test/test.cc:15: /usr/include/x86_64-linux-gnu/bits/stdio2.h:71:35: note: ‘__snprintf_chk’ output between 26 and 281 bytes into a destination of size 256 71 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1, | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 | __glibc_objsize (__s), __fmt, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 | __va_arg_pack ()); | ~~~~~~~~~~~~~~~~~ /<>/test/test.cc: In function ‘gemmlowp::MultiThreadGemmWrapper, 1>, gemmlowp::KernelSideFormat, 1> > >, unsigned char, gemmlowp::BitDepthParams, gemmlowp::OperandRange<0, 255> > >::Name()’: /<>/test/test.cc:163:58: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 231 [-Wformat-truncation=] 163 | snprintf(buf, sizeof(buf), "MultiThreadGemm, Kernel: %s", Kernel().Name()); | ^~ In file included from /usr/include/stdio.h:894, from /usr/include/c++/11/cstdio:42, from /usr/include/c++/11/ext/string_conversions.h:43, from /usr/include/c++/11/bits/basic_string.h:6608, from /usr/include/c++/11/string:55, from /usr/include/c++/11/bits/locale_classes.h:40, from /usr/include/c++/11/bits/ios_base.h:41, from /usr/include/c++/11/ios:42, from /usr/include/c++/11/ostream:38, from /usr/include/c++/11/iostream:39, from /<>/test/test.h:26, from /<>/test/test.cc:15: /usr/include/x86_64-linux-gnu/bits/stdio2.h:71:35: note: ‘__snprintf_chk’ output between 26 and 281 bytes into a destination of size 256 71 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1, | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 | __glibc_objsize (__s), __fmt, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 | __va_arg_pack ()); | ~~~~~~~~~~~~~~~~~ /<>/test/test.cc: In function ‘gemmlowp::SingleThreadGemmWrapper, gemmlowp::OperandRange<0, 255> > >, unsigned char, gemmlowp::BitDepthParams, gemmlowp::OperandRange<0, 255> > >::Name()’: /<>/test/test.cc:123:59: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 230 [-Wformat-truncation=] 123 | snprintf(buf, sizeof(buf), "SingleThreadGemm, Kernel: %s", Kernel().Name()); | ^~ In file included from /usr/include/stdio.h:894, from /usr/include/c++/11/cstdio:42, from /usr/include/c++/11/ext/string_conversions.h:43, from /usr/include/c++/11/bits/basic_string.h:6608, from /usr/include/c++/11/string:55, from /usr/include/c++/11/bits/locale_classes.h:40, from /usr/include/c++/11/bits/ios_base.h:41, from /usr/include/c++/11/ios:42, from /usr/include/c++/11/ostream:38, from /usr/include/c++/11/iostream:39, from /<>/test/test.h:26, from /<>/test/test.cc:15: /usr/include/x86_64-linux-gnu/bits/stdio2.h:71:35: note: ‘__snprintf_chk’ output between 27 and 282 bytes into a destination of size 256 71 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1, | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 | __glibc_objsize (__s), __fmt, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 | __va_arg_pack ()); | ~~~~~~~~~~~~~~~~~ /<>/test/test.cc: In function ‘gemmlowp::MultiThreadGemmWrapper, gemmlowp::OperandRange<0, 255> > >, unsigned char, gemmlowp::BitDepthParams, gemmlowp::OperandRange<0, 255> > >::Name()’: /<>/test/test.cc:163:58: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 231 [-Wformat-truncation=] 163 | snprintf(buf, sizeof(buf), "MultiThreadGemm, Kernel: %s", Kernel().Name()); | ^~ In file included from /usr/include/stdio.h:894, from /usr/include/c++/11/cstdio:42, from /usr/include/c++/11/ext/string_conversions.h:43, from /usr/include/c++/11/bits/basic_string.h:6608, from /usr/include/c++/11/string:55, from /usr/include/c++/11/bits/locale_classes.h:40, from /usr/include/c++/11/bits/ios_base.h:41, from /usr/include/c++/11/ios:42, from /usr/include/c++/11/ostream:38, from /usr/include/c++/11/iostream:39, from /<>/test/test.h:26, from /<>/test/test.cc:15: /usr/include/x86_64-linux-gnu/bits/stdio2.h:71:35: note: ‘__snprintf_chk’ output between 26 and 281 bytes into a destination of size 256 71 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1, | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 | __glibc_objsize (__s), __fmt, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 | __va_arg_pack ()); | ~~~~~~~~~~~~~~~~~ /<>/test/test.cc: In function ‘gemmlowp::SingleThreadGemmWrapper, gemmlowp::OperandRange<0, 255> > >, unsigned char, gemmlowp::BitDepthParams, gemmlowp::OperandRange<0, 255> > >::Name()’: /<>/test/test.cc:123:59: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 230 [-Wformat-truncation=] 123 | snprintf(buf, sizeof(buf), "SingleThreadGemm, Kernel: %s", Kernel().Name()); | ^~ In file included from /usr/include/stdio.h:894, from /usr/include/c++/11/cstdio:42, from /usr/include/c++/11/ext/string_conversions.h:43, from /usr/include/c++/11/bits/basic_string.h:6608, from /usr/include/c++/11/string:55, from /usr/include/c++/11/bits/locale_classes.h:40, from /usr/include/c++/11/bits/ios_base.h:41, from /usr/include/c++/11/ios:42, from /usr/include/c++/11/ostream:38, from /usr/include/c++/11/iostream:39, from /<>/test/test.h:26, from /<>/test/test.cc:15: /usr/include/x86_64-linux-gnu/bits/stdio2.h:71:35: note: ‘__snprintf_chk’ output between 27 and 282 bytes into a destination of size 256 71 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1, | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 | __glibc_objsize (__s), __fmt, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 | __va_arg_pack ()); | ~~~~~~~~~~~~~~~~~ /<>/test/test.cc: In function ‘gemmlowp::MultiThreadGemmWrapper, gemmlowp::OperandRange<0, 255> > >, unsigned char, gemmlowp::BitDepthParams, gemmlowp::OperandRange<0, 255> > >::Name()’: /<>/test/test.cc:163:58: warning: ‘%s’ directive output may be truncated writing up to 255 bytes into a region of size 231 [-Wformat-truncation=] 163 | snprintf(buf, sizeof(buf), "MultiThreadGemm, Kernel: %s", Kernel().Name()); | ^~ In file included from /usr/include/stdio.h:894, from /usr/include/c++/11/cstdio:42, from /usr/include/c++/11/ext/string_conversions.h:43, from /usr/include/c++/11/bits/basic_string.h:6608, from /usr/include/c++/11/string:55, from /usr/include/c++/11/bits/locale_classes.h:40, from /usr/include/c++/11/bits/ios_base.h:41, from /usr/include/c++/11/ios:42, from /usr/include/c++/11/ostream:38, from /usr/include/c++/11/iostream:39, from /<>/test/test.h:26, from /<>/test/test.cc:15: /usr/include/x86_64-linux-gnu/bits/stdio2.h:71:35: note: ‘__snprintf_chk’ output between 26 and 281 bytes into a destination of size 256 71 | return __builtin___snprintf_chk (__s, __n, __USE_FORTIFY_LEVEL - 1, | ~~~~~~~~~~~~~~~~~~~~~~~~~^~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 72 | __glibc_objsize (__s), __fmt, | ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 | __va_arg_pack ()); | ~~~~~~~~~~~~~~~~~ [100%] Linking CXX executable test_gemmlowp /usr/bin/cmake -E cmake_link_script CMakeFiles/test_gemmlowp.dir/link.txt --verbose=1 /usr/bin/c++ -g -O2 -ffile-prefix-map=/<>=. -flto=auto -ffat-lto-objects -flto=auto -ffat-lto-objects -fstack-protector-strong -Wformat -Werror=format-security -Wdate-time -D_FORTIFY_SOURCE=2 -Wl,-Bsymbolic-functions -flto=auto -ffat-lto-objects -flto=auto -Wl,-z,relro -Wl,-z,now -Wl,--as-needed CMakeFiles/test_gemmlowp.dir/test/test.cc.o CMakeFiles/test_gemmlowp.dir/test/test_data.cc.o -o test_gemmlowp libeight_bit_int_gemm.a -lpthread make[3]: Leaving directory '/<>/obj-x86_64-linux-gnu' [100%] Built target test_gemmlowp make[2]: Leaving directory '/<>/obj-x86_64-linux-gnu' /usr/bin/cmake -E cmake_progress_start "/<>/obj-x86_64-linux-gnu/CMakeFiles" 0 make[1]: Leaving directory '/<>/obj-x86_64-linux-gnu' dh_auto_test -O-Scmake cd obj-x86_64-linux-gnu && make -j4 test ARGS\+=--verbose ARGS\+=-j4 make[1]: Entering directory '/<>/obj-x86_64-linux-gnu' Running tests... /usr/bin/ctest --force-new-ctest-process --verbose -j4 UpdateCTestConfiguration from :/<>/obj-x86_64-linux-gnu/DartConfiguration.tcl Parse Config file:/<>/obj-x86_64-linux-gnu/DartConfiguration.tcl UpdateCTestConfiguration from :/<>/obj-x86_64-linux-gnu/DartConfiguration.tcl Parse Config file:/<>/obj-x86_64-linux-gnu/DartConfiguration.tcl Test project /<>/obj-x86_64-linux-gnu Constructing a list of tests Done constructing a list of tests Updating test list for fixtures Added 0 tests to meet fixture requirements Checking test dependency graph... Checking test dependency graph end test 1 Start 1: test_math_helpers 1: Test command: /<>/obj-x86_64-linux-gnu/test_math_helpers 1: Test timeout computed to be: 1500 test 2 Start 2: test_blocking_counter 2: Test command: /<>/obj-x86_64-linux-gnu/test_blocking_counter 2: Test timeout computed to be: 1500 test 3 Start 3: test_allocator 3: Test command: /<>/obj-x86_64-linux-gnu/test_allocator 3: Test timeout computed to be: 1500 test 4 Start 4: test_fixedpoint 4: Test command: /<>/obj-x86_64-linux-gnu/test_fixedpoint 4: Test timeout computed to be: 1500 1/5 Test #3: test_allocator ................... Passed 0.00 sec test 5 Start 5: test_gemmlowp 5: Test command: /<>/obj-x86_64-linux-gnu/test_gemmlowp 5: Test timeout computed to be: 1500 2/5 Test #1: test_math_helpers ................ Passed 0.01 sec 5: TestWithSmallData: PASS 5: number of matrix entries: 8 5: median value: 136 5: median unsigned diff: 0 (tolerating 0) 5: max unsigned diff: 0 (tolerating 0) 5: median signed diff: 0 (tolerating 0) 5: mean signed diff: 0 (tolerating 0) 5: No error: 100.00 % of entries 5: Error in 1..1 range: 0.00 % of entries 5: Error in 2..3 range: 0.00 % of entries 5: Error in 4..7 range: 0.00 % of entries 5: Error in 8..15 range: 0.00 % of entries 5: Error in 16..31 range: 0.00 % of entries 5: Error in 32..63 range: 0.00 % of entries 5: Error in 64..127 range: 0.00 % of entries 5: Error in 128..255 range: 0.00 % of entries 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 6 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 6 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 6 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 10 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 10 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 10 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 6 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 6 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 6 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 10 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 10 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 10 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 3/5 Test #2: test_blocking_counter ............ Passed 0.06 sec 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 4: PASS (Scalar int32) 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 16 4: PASS (Scalar int16) 4/5 Test #4: test_fixedpoint .................. Passed 0.18 sec 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 22 5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 22 5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 22 5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 6 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 6 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 10 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 10 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 10 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 10 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 6 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 6 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 6 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 6 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 10 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 10 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 22 5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 22 5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 22 5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 24 5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 24 5: PASS: 1x1x1000 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1x1000x1000 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 22 5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 22 5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 24 5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 300x400x500 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 16 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 16 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 22 5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 22 5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 22 5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 22 5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 22 5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 22 5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 24 5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 24 5: PASS: 1x1x1000 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1x1000x1000 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 22 5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 22 5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 24 5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 300x400x500 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 16 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 16 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 16 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 16 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 16 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 16 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 6 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 6 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 10 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 10 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 10 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 22 5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 22 5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 10 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 6 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 6 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 6 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 6 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 10 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 22 5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 22 5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 22 5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 22 5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 24 5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 24 5: PASS: 1x1x1000 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1x1000x1000 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 22 5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 22 5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 24 5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 300x400x500 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, SingleThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 16 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 16 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x16 WidthMajor, Rhs: 1 cells 16x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 16 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 16 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, public Gemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 16 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 16 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 10 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 10 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 22 5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 24 5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 24 5: PASS: 1x1x1000 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 22 5: PASS: 1x1000x1000 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 22 5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 24 5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 300x400x500 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 4x4x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 5x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 3x5x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 16 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 16 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 321x123x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 300x400x1 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x1 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 6 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 6 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 6 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 6 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 8 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 10 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 10 5: PASS: 2x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 1x2x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 8 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 8 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 8 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 8 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 1x1x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 6x6x6 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 3x5x7 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 12 5: PASS: 7x3x5 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 5x7x3 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 10 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 14 5: PASS: 8x8x8 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 16x16x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 32x32x32 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 64x64x64 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 16 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 128x128x128 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 16x17x16 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 12 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 12 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 16 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 16 5: PASS: 37x55x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 57x87x117 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 93x83x73 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 109x89x99 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 78x101x82 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 512x512x512 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1024x1024x1024 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 22 5: PASS: 567x2345x123 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 24 5: PASS: 100x5000x100 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 24 5: PASS: 1x1x1000 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1000x1x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1x1000x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 22 5: PASS: 1x1000x1000 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 22 5: PASS: 1000x1x1000 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 16 5: PASS: 1000x1000x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 22 5: PASS: 777x3456x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 24 5: PASS: 4567x555x1 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/0/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/10/0, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/10, mult 1, shift 14 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 0/0/0, mult 10, shift 16 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 10/10/10, mult 10, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets 256/1/17, mult 4, shift 18 5: PASS: 70x90x110 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 18 5: PASS: 300x400x500 ColMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 RowMajor x ColMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 ColMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 RowMajor x RowMajor -> ColMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 ColMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 RowMajor x ColMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 ColMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 300x400x500 RowMajor x RowMajor -> RowMajor, EightBitIntGemm, offsets -75/-91/74980, mult 123, shift 20 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 200x200x200 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 50x5000x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x1 DepthMajor, Rhs: 1 cells 1x1 DepthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 200x200x200 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 50x5000x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 4x2 DepthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 200x200x200 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 50x5000x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 4 cells 4x2 DepthMajor, Rhs: 5 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 200x200x200 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 50x5000x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 DepthMajor, Rhs: 3 cells 4x5 DepthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 200x200x200 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 50x5000x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 3x4 WidthMajor, Rhs: 3 cells 4x5 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 200x200x200 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 50x5000x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 WidthMajor, Rhs: 2 cells 2x4 DepthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 200x200x200 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 50x5000x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 3 cells 5x2 DepthMajor, Rhs: 2 cells 2x4 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets 0/0/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets 10/0/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets 0/10/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets 0/0/10, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets 0/0/0, mult 10, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets 10/10/10, mult 10, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets 256/1/17, mult 4, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 200x200x200 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 50x5000x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 2 cells 8x8 Diagonal, Rhs: 1 cells 8x3 WidthMajor), offsets -75/-91/74980, mult 123, shift 24 5: PASS: 1x1x1 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 2x2x2 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 3x3x3 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 4x4x4 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 5x5x5 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets 0/0/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets 10/0/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets 0/10/0, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets 0/0/10, mult 1, shift 12 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets 0/0/0, mult 10, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets 10/10/10, mult 10, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets 256/1/17, mult 4, shift 16 5: PASS: 50x50x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 18 5: PASS: 200x200x200 ColMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x RowMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x ColMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 ColMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 200x200x200 RowMajor x RowMajor -> RowMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 20 5: PASS: 50x5000x50 RowMajor x ColMajor -> ColMajor, MultiThreadGemm, Kernel: reference(Lhs: 1 cells 1x4 DepthMajor, Rhs: 1 cells 4x4 Diagonal), offsets -75/-91/74980, mult 123, shift 24 5: TestWithRealData: PASS with Lhs: 8 bit, Rhs: 8 bit 5: number of matrix entries: 49152 5: median value: 104 5: median unsigned diff: 0 (tolerating 0) 5: max unsigned diff: 0 (tolerating 0) 5: median signed diff: 0 (tolerating 0) 5: mean signed diff: 0 (tolerating 0) 5: No error: 100.00 % of entries 5: Error in 1..1 range: 0.00 % of entries 5: Error in 2..3 range: 0.00 % of entries 5: Error in 4..7 range: 0.00 % of entries 5: Error in 8..15 range: 0.00 % of entries 5: Error in 16..31 range: 0.00 % of entries 5: Error in 32..63 range: 0.00 % of entries 5: Error in 64..127 range: 0.00 % of entries 5: Error in 128..255 range: 0.00 % of entries 5: TestWithRealData: PASS with (legacy, no longer requantizing) Lhs: 7 bit, Rhs: 5 bit 5: number of matrix entries: 49152 5: median value: 104 5: median unsigned diff: 0 (tolerating 2) 5: max unsigned diff: 0 (tolerating 10) 5: median signed diff: 0 (tolerating 0) 5: mean signed diff: 0 (tolerating 0.2) 5: No error: 100.00 % of entries 5: Error in 1..1 range: 0.00 % of entries 5: Error in 2..3 range: 0.00 % of entries 5: Error in 4..7 range: 0.00 % of entries 5: Error in 8..15 range: 0.00 % of entries 5: Error in 16..31 range: 0.00 % of entries 5: Error in 32..63 range: 0.00 % of entries 5: Error in 64..127 range: 0.00 % of entries 5: Error in 128..255 range: 0.00 % of entries 5: TestOutputStages: PASS with ResultOrder=RowMajor 5: TestOutputStages: PASS with ResultOrder=ColMajor 5: TestOutputStages: PASS with ResultOrder=RowMajor 5: TestOutputStages: PASS with ResultOrder=ColMajor 5: TestOutputStages: PASS with ResultOrder=RowMajor 5: TestOutputStages: PASS with ResultOrder=ColMajor 5: TestOutputStages: PASS with ResultOrder=RowMajor 5: TestOutputStages: PASS with ResultOrder=ColMajor 5: TestWithSmallDataPerChannelQuantization: PASS 5: number of matrix entries: 18 5: median value: 127 5: median unsigned diff: 0 (tolerating 0) 5: max unsigned diff: 0 (tolerating 0) 5: median signed diff: 0 (tolerating 0) 5: mean signed diff: 0 (tolerating 0) 5: No error: 100.00 % of entries 5: Error in 1..1 range: 0.00 % of entries 5: Error in 2..3 range: 0.00 % of entries 5: Error in 4..7 range: 0.00 % of entries 5: Error in 8..15 range: 0.00 % of entries 5: Error in 16..31 range: 0.00 % of entries 5: Error in 32..63 range: 0.00 % of entries 5: Error in 64..127 range: 0.00 % of entries 5: Error in 128..255 range: 0.00 % of entries 5: TestWithLargeDataPerChannelQuantization: PASS 5: number of matrix entries: 550 5: median value: 7 5: median unsigned diff: 0 (tolerating 0) 5: max unsigned diff: 0 (tolerating 0) 5: median signed diff: 0 (tolerating 0) 5: mean signed diff: 0 (tolerating 0) 5: No error: 100.00 % of entries 5: Error in 1..1 range: 0.00 % of entries 5: Error in 2..3 range: 0.00 % of entries 5: Error in 4..7 range: 0.00 % of entries 5: Error in 8..15 range: 0.00 % of entries 5: Error in 16..31 range: 0.00 % of entries 5: Error in 32..63 range: 0.00 % of entries 5: Error in 64..127 range: 0.00 % of entries 5: Error in 128..255 range: 0.00 % of entries 5: TestMultithreadedPerChannelQuantization: PASS 5: number of matrix entries: 1280 5: median value: 0 5: median unsigned diff: 0 (tolerating 0) 5: max unsigned diff: 0 (tolerating 0) 5: median signed diff: 0 (tolerating 0) 5: mean signed diff: 0 (tolerating 0) 5: No error: 100.00 % of entries 5: Error in 1..1 range: 0.00 % of entries 5: Error in 2..3 range: 0.00 % of entries 5: Error in 4..7 range: 0.00 % of entries 5: Error in 8..15 range: 0.00 % of entries 5: Error in 16..31 range: 0.00 % of entries 5: Error in 32..63 range: 0.00 % of entries 5: Error in 64..127 range: 0.00 % of entries 5: Error in 128..255 range: 0.00 % of entries 5: All tests passed. 5/5 Test #5: test_gemmlowp .................... Passed 106.24 sec 100% tests passed, 0 tests failed out of 5 Total Test time (real) = 106.25 sec make[1]: Leaving directory '/<>/obj-x86_64-linux-gnu' create-stamp debian/debhelper-build-stamp fakeroot debian/rules binary dh binary -Scmake dh_testroot -O-Scmake dh_prep -O-Scmake dh_auto_install --destdir=debian/libgemmlowp-dev/ -O-Scmake cd obj-x86_64-linux-gnu && make -j4 install DESTDIR=/<>/gemmlowp-0.0\~git20211220.e844ffd/debian/libgemmlowp-dev AM_UPDATE_INFO_DIR=no "INSTALL=install --strip-program=true" make[1]: Entering directory '/<>/obj-x86_64-linux-gnu' /usr/bin/cmake -S"/<>" -B"/<>/obj-x86_64-linux-gnu" --check-build-system CMakeFiles/Makefile.cmake 0 make -f CMakeFiles/Makefile2 preinstall make[2]: Entering directory '/<>/obj-x86_64-linux-gnu' make[2]: Nothing to be done for 'preinstall'. make[2]: Leaving directory '/<>/obj-x86_64-linux-gnu' Install the project... /usr/bin/cmake -P cmake_install.cmake -- Install configuration: "None" -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/eight_bit_int_gemm/eight_bit_int_gemm.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/base.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/legacy_multi_thread_common.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/legacy_multi_thread_gemm.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/legacy_multi_thread_gemv.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/legacy_operations_common.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/legacy_single_thread_gemm.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/multi_thread_common.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/multi_thread_gemm.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/multi_thread_transform.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/quantized_mul_kernels.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/quantized_mul_kernels_arm_32.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/quantized_mul_kernels_arm_64.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/single_thread_gemm.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/single_thread_transform.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/streams.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/streams_arm_32.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/streams_arm_64.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/transform_kernels.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/transform_kernels_arm_32.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/meta/transform_kernels_arm_64.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/public/bit_depth.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/public/gemmlowp.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/public/map.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/public/output_stages.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/profiling/instrumentation.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/profiling/profiler.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/profiling/pthread_everywhere.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/allocator.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/block_params.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/common.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/compute.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/detect_platform.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/dispatch_gemm_shape.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/kernel.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/kernel_avx.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/kernel_default.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/kernel_msa.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/kernel_neon.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/kernel_reference.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/kernel_sse.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/multi_thread_gemm.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/output.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/output_avx.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/output_msa.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/output_neon.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/output_sse.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/pack.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/pack_avx.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/pack_msa.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/pack_neon.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/pack_sse.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/platform.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/simd_wrappers.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/simd_wrappers_common_neon_sse.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/simd_wrappers_msa.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/simd_wrappers_neon.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/simd_wrappers_sse.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/single_thread_gemm.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/internal/unpack.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/fixedpoint/fixedpoint.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/fixedpoint/fixedpoint_avx.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/fixedpoint/fixedpoint_msa.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/fixedpoint/fixedpoint_neon.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/fixedpoint/fixedpoint_sse.h -- Installing: /<>/debian/libgemmlowp-dev/usr/include/gemmlowp/fixedpoint/fixedpoint_wasmsimd.h -- Installing: /<>/debian/libgemmlowp-dev/usr/lib/x86_64-linux-gnu/libeight_bit_int_gemm.a -- Installing: /<>/debian/libgemmlowp-dev/usr/lib/x86_64-linux-gnu/cmake/gemmlowp/gemmlowp-config.cmake -- Installing: /<>/debian/libgemmlowp-dev/usr/lib/x86_64-linux-gnu/cmake/gemmlowp/gemmlowp-config-none.cmake make[1]: Leaving directory '/<>/obj-x86_64-linux-gnu' dh_install -O-Scmake debian/rules override_dh_installdocs make[1]: Entering directory '/<>' mkdir -p debian/libgemmlowp-dev/usr/share/doc/libgemmlowp-dev/meta/ install meta/README debian/libgemmlowp-dev/usr/share/doc/libgemmlowp-dev/meta/ dh_installdocs make[1]: Leaving directory '/<>' dh_installchangelogs -O-Scmake dh_installexamples -O-Scmake dh_installinit -O-Scmake dh_perl -O-Scmake dh_link -O-Scmake dh_strip_nondeterminism -O-Scmake dh_compress -O-Scmake dh_fixperms -O-Scmake dh_missing -O-Scmake dh_dwz -a -O-Scmake dh_strip -a -O-Scmake dh_makeshlibs -a -O-Scmake dh_shlibdeps -a -O-Scmake dh_installdeb -O-Scmake dh_gencontrol -O-Scmake dh_md5sums -O-Scmake dh_builddeb -O-Scmake INFO: pkgstriptranslations version 149 pkgstriptranslations: processing libgemmlowp-dev (in debian/libgemmlowp-dev); do_strip: , oemstrip: pkgmaintainermangler: Maintainer field overridden to "Ubuntu Developers " pkgstripfiles: processing control file: debian/libgemmlowp-dev/DEBIAN/control, package libgemmlowp-dev, directory debian/libgemmlowp-dev pkgstripfiles: Truncating usr/share/doc/libgemmlowp-dev/changelog.Debian.gz to topmost ten records pkgstripfiles: Running PNG optimization (using 4 cpus) for package libgemmlowp-dev ... pkgstripfiles: No PNG files. dpkg-deb: building package 'libgemmlowp-dev' in '../libgemmlowp-dev_0.0~git20211220.e844ffd-1_amd64.deb'. dpkg-genbuildinfo --build=binary -O../gemmlowp_0.0~git20211220.e844ffd-1_amd64.buildinfo dpkg-genchanges --build=binary -mLaunchpad Build Daemon -O../gemmlowp_0.0~git20211220.e844ffd-1_amd64.changes dpkg-genchanges: info: binary-only upload (no source code included) dpkg-source --after-build . dpkg-buildpackage: info: binary-only upload (no source included) -------------------------------------------------------------------------------- Build finished at 2022-06-24T11:20:11Z Finished -------- I: Built successfully +------------------------------------------------------------------------------+ | Changes | +------------------------------------------------------------------------------+ gemmlowp_0.0~git20211220.e844ffd-1_amd64.changes: ------------------------------------------------- Format: 1.8 Date: Thu, 23 Jun 2022 22:56:13 -0700 Source: gemmlowp Binary: libgemmlowp-dev Built-For-Profiles: noudeb Architecture: amd64 Version: 0.0~git20211220.e844ffd-1 Distribution: kinetic-proposed Urgency: medium Maintainer: Launchpad Build Daemon Changed-By: Mo Zhou Description: libgemmlowp-dev - small self-contained low-precision GEMM library Changes: gemmlowp (0.0~git20211220.e844ffd-1) unstable; urgency=medium . * New upstream version 0.0~git20211220.e844ffd Checksums-Sha1: 163cacabcb535fef70693dc62884a42dbf1dbc1b 6584 gemmlowp_0.0~git20211220.e844ffd-1_amd64.buildinfo 9125f3c9f8b3dbc8f7784e9e20817d7436a3c144 591308 libgemmlowp-dev_0.0~git20211220.e844ffd-1_amd64.deb Checksums-Sha256: 7ff5c76aa0fdf843af57fc6bd8d93c80dcdd806e8ca166690fb4da75176d5c8a 6584 gemmlowp_0.0~git20211220.e844ffd-1_amd64.buildinfo b84d48e21995945bbe2c6a611ad1295d125e1d07ff387b6d83b66c868574e017 591308 libgemmlowp-dev_0.0~git20211220.e844ffd-1_amd64.deb Files: 9160174d0a5a4214949f9e77d5de09d7 6584 science optional gemmlowp_0.0~git20211220.e844ffd-1_amd64.buildinfo 6ce347d390494111a75fb803be03fbdf 591308 libdevel optional libgemmlowp-dev_0.0~git20211220.e844ffd-1_amd64.deb /<>/gemmlowp_0.0~git20211220.e844ffd-1_amd64.changes.new could not be renamed to /<>/gemmlowp_0.0~git20211220.e844ffd-1_amd64.changes: Illegal seek Distribution field may be wrong!!! +------------------------------------------------------------------------------+ | Buildinfo | +------------------------------------------------------------------------------+ Format: 1.0 Source: gemmlowp Binary: libgemmlowp-dev Architecture: amd64 Version: 0.0~git20211220.e844ffd-1 Checksums-Md5: 6ce347d390494111a75fb803be03fbdf 591308 libgemmlowp-dev_0.0~git20211220.e844ffd-1_amd64.deb Checksums-Sha1: 9125f3c9f8b3dbc8f7784e9e20817d7436a3c144 591308 libgemmlowp-dev_0.0~git20211220.e844ffd-1_amd64.deb Checksums-Sha256: b84d48e21995945bbe2c6a611ad1295d125e1d07ff387b6d83b66c868574e017 591308 libgemmlowp-dev_0.0~git20211220.e844ffd-1_amd64.deb Build-Origin: Ubuntu Build-Architecture: amd64 Build-Date: Fri, 24 Jun 2022 11:20:11 +0000 Build-Path: /<> Build-Tainted-By: merged-usr-via-aliased-dirs usr-local-has-programs Installed-Build-Depends: autoconf (= 2.71-2), automake (= 1:1.16.5-1.3), autopoint (= 0.21-6), autotools-dev (= 20220109.1), base-files (= 12.2ubuntu1), base-passwd (= 3.5.52build1), bash (= 5.1-6ubuntu1), binutils (= 2.38.50.20220615-4ubuntu1), binutils-common (= 2.38.50.20220615-4ubuntu1), binutils-x86-64-linux-gnu (= 2.38.50.20220615-4ubuntu1), bsdextrautils (= 2.38-4ubuntu1), bsdutils (= 1:2.38-4ubuntu1), build-essential (= 12.9ubuntu3), bzip2 (= 1.0.8-5build1), cmake (= 3.23.2-1ubuntu1), cmake-data (= 3.23.2-1ubuntu1), coreutils (= 8.32-4.1ubuntu1), cpp (= 4:11.2.0-1ubuntu1), cpp-11 (= 11.3.0-3ubuntu1), dash (= 0.5.11+git20210903+057cd650a4ed-8ubuntu1), debconf (= 1.5.79ubuntu1), debhelper (= 13.7.1ubuntu1), debianutils (= 5.7-0.2), debugedit (= 1:5.0-4build1), dh-autoreconf (= 20), dh-elpa-helper (= 2.0.9ubuntu1), dh-strip-nondeterminism (= 1.13.0-1), diffutils (= 1:3.8-0ubuntu2), dpkg (= 1.21.8ubuntu1), dpkg-dev (= 1.21.8ubuntu1), dwz (= 0.14-1build2), emacsen-common (= 3.0.4), file (= 1:5.41-4), findutils (= 4.8.0-1ubuntu3), g++ (= 4:11.2.0-1ubuntu1), g++-11 (= 11.3.0-3ubuntu1), gcc (= 4:11.2.0-1ubuntu1), gcc-11 (= 11.3.0-3ubuntu1), gcc-11-base (= 11.3.0-3ubuntu1), gcc-12-base (= 12.1.0-4ubuntu1), gettext (= 0.21-6), gettext-base (= 0.21-6), grep (= 3.7-1build1), groff-base (= 1.22.4-8build1), gzip (= 1.10-4ubuntu4), hostname (= 3.23ubuntu2), init-system-helpers (= 1.63), intltool-debian (= 0.35.0+20060710.5), libacl1 (= 2.3.1-1), libarchive-zip-perl (= 1.68-1), libarchive13 (= 3.6.0-1ubuntu1), libasan6 (= 11.3.0-3ubuntu1), libatomic1 (= 12.1.0-4ubuntu1), libattr1 (= 1:2.5.1-1build1), libaudit-common (= 1:3.0.7-1build1), libaudit1 (= 1:3.0.7-1build1), libbinutils (= 2.38.50.20220615-4ubuntu1), libblkid1 (= 2.38-4ubuntu1), libbrotli1 (= 1.0.9-2build6), libbz2-1.0 (= 1.0.8-5build1), libc-bin (= 2.35-0ubuntu3), libc-dev-bin (= 2.35-0ubuntu3), libc6 (= 2.35-0ubuntu3), libc6-dev (= 2.35-0ubuntu3), libcap-ng0 (= 0.8.3-1), libcap2 (= 1:2.44-1build3), libcc1-0 (= 12.1.0-4ubuntu1), libcom-err2 (= 1.46.5-2ubuntu2), libcrypt-dev (= 1:4.4.27-1), libcrypt1 (= 1:4.4.27-1), libctf-nobfd0 (= 2.38.50.20220615-4ubuntu1), libctf0 (= 2.38.50.20220615-4ubuntu1), libcurl4 (= 7.83.1-2), libdb5.3 (= 5.3.28+dfsg1-0.9), libdebconfclient0 (= 0.261ubuntu1), libdebhelper-perl (= 13.7.1ubuntu1), libdpkg-perl (= 1.21.8ubuntu1), libdw1 (= 0.187-1), libelf1 (= 0.187-1), libexpat1 (= 2.4.8-1), libffi8 (= 3.4.2-4), libfile-stripnondeterminism-perl (= 1.13.0-1), libgcc-11-dev (= 11.3.0-3ubuntu1), libgcc-s1 (= 12.1.0-4ubuntu1), libgcrypt20 (= 1.10.1-2ubuntu1), libgdbm-compat4 (= 1.23-1), libgdbm6 (= 1.23-1), libgmp10 (= 2:6.2.1+dfsg-3ubuntu1), libgnutls30 (= 3.7.6-2ubuntu1), libgomp1 (= 12.1.0-4ubuntu1), libgpg-error0 (= 1.45-2), libgprofng0 (= 2.38.50.20220615-4ubuntu1), libgssapi-krb5-2 (= 1.19.2-2), libhogweed6 (= 3.7.3-1build2), libicu71 (= 71.1-3), libidn2-0 (= 2.3.2-2build1), libisl23 (= 0.24-2build1), libitm1 (= 12.1.0-4ubuntu1), libjsoncpp25 (= 1.9.5-4), libk5crypto3 (= 1.19.2-2), libkeyutils1 (= 1.6.1-3ubuntu1), libkrb5-3 (= 1.19.2-2), libkrb5support0 (= 1.19.2-2), libldap-2.5-0 (= 2.5.12+dfsg-2ubuntu1), liblsan0 (= 12.1.0-4ubuntu1), liblz4-1 (= 1.9.3-2build2), liblzma5 (= 5.2.5-2.1), libmagic-mgc (= 1:5.41-4), libmagic1 (= 1:5.41-4), libmount1 (= 2.38-4ubuntu1), libmpc3 (= 1.2.1-2build1), libmpfr6 (= 4.1.0-3build3), libncurses6 (= 6.3+20220423-2), libncursesw6 (= 6.3+20220423-2), libnettle8 (= 3.7.3-1build2), libnghttp2-14 (= 1.47.0-1), libnsl-dev (= 1.3.0-2build2), libnsl2 (= 1.3.0-2build2), libp11-kit0 (= 0.24.1-1), libpam-modules (= 1.4.0-13ubuntu1), libpam-modules-bin (= 1.4.0-13ubuntu1), libpam-runtime (= 1.4.0-13ubuntu1), libpam0g (= 1.4.0-13ubuntu1), libpcre2-8-0 (= 10.40-1), libpcre3 (= 2:8.39-14), libperl5.34 (= 5.34.0-3ubuntu1), libpipeline1 (= 1.5.6-1), libprocps8 (= 2:3.3.17-6ubuntu2), libpsl5 (= 0.21.0-1.2build2), libquadmath0 (= 12.1.0-4ubuntu1), librhash0 (= 1.4.2-1ubuntu1), librtmp1 (= 2.4+20151223.gitfa8646d.1-2build4), libsasl2-2 (= 2.1.28+dfsg-6ubuntu1), libsasl2-modules-db (= 2.1.28+dfsg-6ubuntu1), libseccomp2 (= 2.5.4-1ubuntu1), libselinux1 (= 3.4-1), libsigsegv2 (= 2.13-1ubuntu3), libsmartcols1 (= 2.38-4ubuntu1), libssh-4 (= 0.9.6-2build1), libssl3 (= 3.0.3-5ubuntu3), libstdc++-11-dev (= 11.3.0-3ubuntu1), libstdc++6 (= 12.1.0-4ubuntu1), libsub-override-perl (= 0.09-3), libsystemd0 (= 249.11-0ubuntu4), libtasn1-6 (= 4.18.0-4build1), libtinfo6 (= 6.3+20220423-2), libtirpc-common (= 1.3.2-2build1), libtirpc-dev (= 1.3.2-2build1), libtirpc3 (= 1.3.2-2build1), libtool (= 2.4.7-4), libtsan0 (= 11.3.0-3ubuntu1), libubsan1 (= 12.1.0-4ubuntu1), libuchardet0 (= 0.0.7-1build2), libudev1 (= 249.11-0ubuntu4), libunistring2 (= 1.0-1), libuuid1 (= 2.38-4ubuntu1), libuv1 (= 1.44.1-2), libxml2 (= 2.9.14+dfsg-1), libzstd1 (= 1.5.2+dfsg-1), linux-libc-dev (= 5.15.0-35.36+22.10.1), login (= 1:4.11.1+dfsg1-2ubuntu1), lsb-base (= 11.1.0ubuntu4), lto-disabled-list (= 27), m4 (= 1.4.18-5ubuntu2), make (= 4.3-4.1build1), man-db (= 2.10.2-1), mawk (= 1.3.4.20200120-3.1), ncurses-base (= 6.3+20220423-2), ncurses-bin (= 6.3+20220423-2), patch (= 2.7.6-7build2), perl (= 5.34.0-3ubuntu1), perl-base (= 5.34.0-3ubuntu1), perl-modules-5.34 (= 5.34.0-3ubuntu1), po-debconf (= 1.0.21+nmu1), procps (= 2:3.3.17-6ubuntu2), rpcsvc-proto (= 1.4.2-0ubuntu6), sed (= 4.8-1ubuntu2), sensible-utils (= 0.0.17), sysvinit-utils (= 3.01-1ubuntu1), tar (= 1.34+dfsg-1build3), util-linux (= 2.38-4ubuntu1), util-linux-extra (= 2.38-4ubuntu1), xz-utils (= 5.2.5-2.1), zlib1g (= 1:1.2.11.dfsg-2ubuntu9) Environment: DEB_BUILD_OPTIONS="parallel=4" DEB_BUILD_PROFILES="noudeb" LANG="C.UTF-8" LC_ALL="C.UTF-8" SOURCE_DATE_EPOCH="1656050173" +------------------------------------------------------------------------------+ | Package contents | +------------------------------------------------------------------------------+ libgemmlowp-dev_0.0~git20211220.e844ffd-1_amd64.deb --------------------------------------------------- new Debian package, version 2.0. size 591308 bytes: control archive=4177 bytes. 1136 bytes, 25 lines control 10117 bytes, 116 lines md5sums Package: libgemmlowp-dev Source: gemmlowp Version: 0.0~git20211220.e844ffd-1 Architecture: amd64 Maintainer: Ubuntu Developers Original-Maintainer: Debian Science Maintainers Installed-Size: 5929 Section: libdevel Priority: optional Multi-Arch: foreign Homepage: https://github.com/google/gemmlowp Description: small self-contained low-precision GEMM library This is not a full linear algebra library, only a GEMM library: it only does general matrix multiplication ("GEMM"). . Its performance goals differ from typical GEMM performance goals in the following ways: 1. It cares not only about speed, but also about minimizing power usage. It specifically cares about charge usage in mobile/embedded devices. 2. Most GEMMs are optimized primarily for large dense matrix sizes (>= 1000). It does care about large sizes, but it also cares specifically about the typically smaller matrix sizes encountered in various mobile applications. . Keep in mind (previous section) that gemmlowp itself is a pure-headers-only library. drwxr-xr-x root/root 0 2022-06-24 05:56 ./ drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/ drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/include/ drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/include/gemmlowp/ drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/include/gemmlowp/eight_bit_int_gemm/ -rw-r--r-- root/root 3418 2021-12-20 17:33 ./usr/include/gemmlowp/eight_bit_int_gemm/eight_bit_int_gemm.h drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/include/gemmlowp/fixedpoint/ -rw-r--r-- root/root 35769 2021-12-20 17:33 ./usr/include/gemmlowp/fixedpoint/fixedpoint.h -rw-r--r-- root/root 11184 2021-12-20 17:33 ./usr/include/gemmlowp/fixedpoint/fixedpoint_avx.h -rw-r--r-- root/root 12541 2021-12-20 17:33 ./usr/include/gemmlowp/fixedpoint/fixedpoint_msa.h -rw-r--r-- root/root 9073 2021-12-20 17:33 ./usr/include/gemmlowp/fixedpoint/fixedpoint_neon.h -rw-r--r-- root/root 11142 2021-12-20 17:33 ./usr/include/gemmlowp/fixedpoint/fixedpoint_sse.h -rw-r--r-- root/root 11257 2021-12-20 17:33 ./usr/include/gemmlowp/fixedpoint/fixedpoint_wasmsimd.h drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/include/gemmlowp/internal/ -rw-r--r-- root/root 6328 2021-12-20 17:33 ./usr/include/gemmlowp/internal/allocator.h -rw-r--r-- root/root 6768 2021-12-20 17:33 ./usr/include/gemmlowp/internal/block_params.h -rw-r--r-- root/root 6676 2021-12-20 17:33 ./usr/include/gemmlowp/internal/common.h -rw-r--r-- root/root 4299 2021-12-20 17:33 ./usr/include/gemmlowp/internal/compute.h -rw-r--r-- root/root 4996 2021-12-20 17:33 ./usr/include/gemmlowp/internal/detect_platform.h -rw-r--r-- root/root 8036 2021-12-20 17:33 ./usr/include/gemmlowp/internal/dispatch_gemm_shape.h -rw-r--r-- root/root 9218 2021-12-20 17:33 ./usr/include/gemmlowp/internal/kernel.h -rw-r--r-- root/root 19165 2021-12-20 17:33 ./usr/include/gemmlowp/internal/kernel_avx.h -rw-r--r-- root/root 4847 2021-12-20 17:33 ./usr/include/gemmlowp/internal/kernel_default.h -rw-r--r-- root/root 23748 2021-12-20 17:33 ./usr/include/gemmlowp/internal/kernel_msa.h -rw-r--r-- root/root 75739 2021-12-20 17:33 ./usr/include/gemmlowp/internal/kernel_neon.h -rw-r--r-- root/root 4837 2021-12-20 17:33 ./usr/include/gemmlowp/internal/kernel_reference.h -rw-r--r-- root/root 18968 2021-12-20 17:33 ./usr/include/gemmlowp/internal/kernel_sse.h -rw-r--r-- root/root 28223 2021-12-20 17:33 ./usr/include/gemmlowp/internal/multi_thread_gemm.h -rw-r--r-- root/root 22604 2021-12-20 17:33 ./usr/include/gemmlowp/internal/output.h -rw-r--r-- root/root 763 2021-12-20 17:33 ./usr/include/gemmlowp/internal/output_avx.h -rw-r--r-- root/root 44686 2021-12-20 17:33 ./usr/include/gemmlowp/internal/output_msa.h -rw-r--r-- root/root 35994 2021-12-20 17:33 ./usr/include/gemmlowp/internal/output_neon.h -rw-r--r-- root/root 20105 2021-12-20 17:33 ./usr/include/gemmlowp/internal/output_sse.h -rw-r--r-- root/root 17975 2021-12-20 17:33 ./usr/include/gemmlowp/internal/pack.h -rw-r--r-- root/root 11519 2021-12-20 17:33 ./usr/include/gemmlowp/internal/pack_avx.h -rw-r--r-- root/root 18939 2021-12-20 17:33 ./usr/include/gemmlowp/internal/pack_msa.h -rw-r--r-- root/root 15055 2021-12-20 17:33 ./usr/include/gemmlowp/internal/pack_neon.h -rw-r--r-- root/root 4972 2021-12-20 17:33 ./usr/include/gemmlowp/internal/pack_sse.h -rw-r--r-- root/root 2977 2021-12-20 17:33 ./usr/include/gemmlowp/internal/platform.h -rw-r--r-- root/root 25588 2021-12-20 17:33 ./usr/include/gemmlowp/internal/simd_wrappers.h -rw-r--r-- root/root 31387 2021-12-20 17:33 ./usr/include/gemmlowp/internal/simd_wrappers_common_neon_sse.h -rw-r--r-- root/root 5642 2021-12-20 17:33 ./usr/include/gemmlowp/internal/simd_wrappers_msa.h -rw-r--r-- root/root 19188 2021-12-20 17:33 ./usr/include/gemmlowp/internal/simd_wrappers_neon.h -rw-r--r-- root/root 4257 2021-12-20 17:33 ./usr/include/gemmlowp/internal/simd_wrappers_sse.h -rw-r--r-- root/root 5586 2021-12-20 17:33 ./usr/include/gemmlowp/internal/single_thread_gemm.h -rw-r--r-- root/root 12594 2021-12-20 17:33 ./usr/include/gemmlowp/internal/unpack.h drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/include/gemmlowp/meta/ -rw-r--r-- root/root 3960 2021-12-20 17:33 ./usr/include/gemmlowp/meta/base.h -rw-r--r-- root/root 5384 2021-12-20 17:33 ./usr/include/gemmlowp/meta/legacy_multi_thread_common.h -rw-r--r-- root/root 11396 2021-12-20 17:33 ./usr/include/gemmlowp/meta/legacy_multi_thread_gemm.h -rw-r--r-- root/root 6992 2021-12-20 17:33 ./usr/include/gemmlowp/meta/legacy_multi_thread_gemv.h -rw-r--r-- root/root 1850 2021-12-20 17:33 ./usr/include/gemmlowp/meta/legacy_operations_common.h -rw-r--r-- root/root 9600 2021-12-20 17:33 ./usr/include/gemmlowp/meta/legacy_single_thread_gemm.h -rw-r--r-- root/root 1593 2021-12-20 17:33 ./usr/include/gemmlowp/meta/multi_thread_common.h -rw-r--r-- root/root 5253 2021-12-20 17:33 ./usr/include/gemmlowp/meta/multi_thread_gemm.h -rw-r--r-- root/root 3519 2021-12-20 17:33 ./usr/include/gemmlowp/meta/multi_thread_transform.h -rw-r--r-- root/root 5759 2021-12-20 17:33 ./usr/include/gemmlowp/meta/quantized_mul_kernels.h -rw-r--r-- root/root 131368 2021-12-20 17:33 ./usr/include/gemmlowp/meta/quantized_mul_kernels_arm_32.h -rw-r--r-- root/root 130137 2021-12-20 17:33 ./usr/include/gemmlowp/meta/quantized_mul_kernels_arm_64.h -rw-r--r-- root/root 25668 2021-12-20 17:33 ./usr/include/gemmlowp/meta/single_thread_gemm.h -rw-r--r-- root/root 2957 2021-12-20 17:33 ./usr/include/gemmlowp/meta/single_thread_transform.h -rw-r--r-- root/root 11049 2021-12-20 17:33 ./usr/include/gemmlowp/meta/streams.h -rw-r--r-- root/root 390785 2021-12-20 17:33 ./usr/include/gemmlowp/meta/streams_arm_32.h -rw-r--r-- root/root 410715 2021-12-20 17:33 ./usr/include/gemmlowp/meta/streams_arm_64.h -rw-r--r-- root/root 7317 2021-12-20 17:33 ./usr/include/gemmlowp/meta/transform_kernels.h -rw-r--r-- root/root 247365 2021-12-20 17:33 ./usr/include/gemmlowp/meta/transform_kernels_arm_32.h -rw-r--r-- root/root 260838 2021-12-20 17:33 ./usr/include/gemmlowp/meta/transform_kernels_arm_64.h drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/include/gemmlowp/profiling/ -rw-r--r-- root/root 6543 2021-12-20 17:33 ./usr/include/gemmlowp/profiling/instrumentation.h -rw-r--r-- root/root 11853 2021-12-20 17:33 ./usr/include/gemmlowp/profiling/profiler.h -rw-r--r-- root/root 3313 2021-12-20 17:33 ./usr/include/gemmlowp/profiling/pthread_everywhere.h drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/include/gemmlowp/public/ -rw-r--r-- root/root 2616 2021-12-20 17:33 ./usr/include/gemmlowp/public/bit_depth.h -rw-r--r-- root/root 4314 2021-12-20 17:33 ./usr/include/gemmlowp/public/gemmlowp.h -rw-r--r-- root/root 4422 2021-12-20 17:33 ./usr/include/gemmlowp/public/map.h -rw-r--r-- root/root 11283 2021-12-20 17:33 ./usr/include/gemmlowp/public/output_stages.h drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/lib/ drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/lib/x86_64-linux-gnu/ drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/lib/x86_64-linux-gnu/cmake/ drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/lib/x86_64-linux-gnu/cmake/gemmlowp/ -rw-r--r-- root/root 944 2022-06-24 05:56 ./usr/lib/x86_64-linux-gnu/cmake/gemmlowp/gemmlowp-config-none.cmake -rw-r--r-- root/root 4181 2022-06-24 05:56 ./usr/lib/x86_64-linux-gnu/cmake/gemmlowp/gemmlowp-config.cmake -rw-r--r-- root/root 1065980 2022-06-24 05:56 ./usr/lib/x86_64-linux-gnu/libeight_bit_int_gemm.a drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/share/ drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/share/doc/ drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/share/doc/libgemmlowp-dev/ -rw-r--r-- root/root 390 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/AUTHORS -rw-r--r-- root/root 1977 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/CONTRIBUTING -rw-r--r-- root/root 1208 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/CONTRIBUTORS -rw-r--r-- root/root 3774 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/README.md.gz -rw-r--r-- root/root 988 2022-06-24 05:56 ./usr/share/doc/libgemmlowp-dev/changelog.Debian.gz -rw-r--r-- root/root 1864 2022-06-24 05:51 ./usr/share/doc/libgemmlowp-dev/copyright drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/share/doc/libgemmlowp-dev/doc/ -rw-r--r-- root/root 2479 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/doc/design.md.gz -rw-r--r-- root/root 2399 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/doc/kernel.md.gz -rw-r--r-- root/root 5820 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/doc/less-than-8-bit.md.gz -rw-r--r-- root/root 3106 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/doc/low-precision.md.gz -rw-r--r-- root/root 2022 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/doc/output.md -rw-r--r-- root/root 3499 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/doc/packing.md.gz -rw-r--r-- root/root 2650 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/doc/public.md.gz -rw-r--r-- root/root 4673 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/doc/quantization.md.gz -rw-r--r-- root/root 4572 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/doc/quantization_example.cc.gz drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/share/doc/libgemmlowp-dev/examples/ -rw-r--r-- root/root 15511 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/eight_bit_int_gemm.cc drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ -rw-r--r-- root/root 12806 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/benchmark.cc -rw-r--r-- root/root 11382 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/benchmark_all_sizes.cc -rw-r--r-- root/root 10839 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/benchmark_meta_gemm.cc -rw-r--r-- root/root 12064 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/correctness_meta_gemm.cc drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/ drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test.xcodeproj/ -rw-r--r-- root/root 29285 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test.xcodeproj/project.pbxproj drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/ -rw-r--r-- root/root 279 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/AppDelegate.h -rw-r--r-- root/root 2149 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/AppDelegate.mm drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/Base.lproj/ -rw-r--r-- root/root 3708 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/Base.lproj/LaunchScreen.xib -rw-r--r-- root/root 1575 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/Base.lproj/Main.storyboard drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/Images.xcassets/ drwxr-xr-x root/root 0 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/Images.xcassets/AppIcon.appiconset/ -rw-r--r-- root/root 1077 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/Images.xcassets/AppIcon.appiconset/Contents.json -rw-r--r-- root/root 1511 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/Info.plist -rw-r--r-- root/root 219 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/ViewController.h -rw-r--r-- root/root 492 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/ViewController.m -rw-r--r-- root/root 334 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/ios/gemmlowp_test/main.m -rw-r--r-- root/root 75539 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/test.cc -rw-r--r-- root/root 4448 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/test.h -rw-r--r-- root/root 2115 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/test_allocator.cc -rw-r--r-- root/root 4408 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/test_blocking_counter.cc -rw-r--r-- root/root 2296359 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/test_data.cc -rw-r--r-- root/root 1288 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/test_data.h -rw-r--r-- root/root 22285 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/test_fixedpoint.cc -rw-r--r-- root/root 4144 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/examples/test/test_math_helpers.cc drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/share/doc/libgemmlowp-dev/meta/ -rw-r--r-- root/root 3657 2022-06-24 05:56 ./usr/share/doc/libgemmlowp-dev/meta/README drwxr-xr-x root/root 0 2022-06-24 05:56 ./usr/share/doc/libgemmlowp-dev/todo/ -rw-r--r-- root/root 1605 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/todo/armv8-64bit-kernel-for-less-than-8-bit.txt -rw-r--r-- root/root 3277 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/todo/error-diffusion-experiments.txt -rw-r--r-- root/root 6232 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/todo/fast-gemv.txt.gz -rw-r--r-- root/root 962 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/todo/less-than-8-bit-without-requantization.txt -rw-r--r-- root/root 2338 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/todo/multi-threading-experiments.txt.gz -rw-r--r-- root/root 1114 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/todo/neon-depth-major-sources-packing.txt -rw-r--r-- root/root 802 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/todo/remove-default-template-param-values.txt -rw-r--r-- root/root 1666 2021-12-20 17:33 ./usr/share/doc/libgemmlowp-dev/todo/x86-kernels.txt +------------------------------------------------------------------------------+ | Post Build | +------------------------------------------------------------------------------+ +------------------------------------------------------------------------------+ | Cleanup | +------------------------------------------------------------------------------+ Purging /<> Not removing build depends: as requested +------------------------------------------------------------------------------+ | Summary | +------------------------------------------------------------------------------+ Build Architecture: amd64 Build Type: binary Build-Space: 138252 Build-Time: 204 Distribution: kinetic-proposed Host Architecture: amd64 Install-Time: 7 Job: gemmlowp_0.0~git20211220.e844ffd-1.dsc Machine Architecture: amd64 Package: gemmlowp Package-Time: 212 Source-Version: 0.0~git20211220.e844ffd-1 Space: 138252 Status: successful Version: 0.0~git20211220.e844ffd-1 -------------------------------------------------------------------------------- Finished at 2022-06-24T11:20:11Z Build needed 00:03:32, 138252k disk space Adding user buildd to group lxd RUN: /usr/share/launchpad-buildd/bin/in-target scan-for-processes --backend=chroot --series=kinetic --arch=amd64 PACKAGEBUILD-24109081 Scanning for processes to kill in build PACKAGEBUILD-24109081