[ICX]ICL/ICX Fast Short Rep Movs

Bug #1883630 reported by quanxian
6
This bug affects 1 person
Affects Status Importance Assigned to Milestone
intel
Fix Released
Undecided
Unassigned
Ubuntu
New
Undecided
Unassigned

Bug Description

[Feature Description]
Starting in ICL/ICX there is a performance improvement to the "rep movsb" copy instruction for short strings.

From the optimization guide:

3.7.6.1 Fast Short REP MOVSB
Beginning with processors based on Ice Lake Client microarchitecture,
REP MOVSB performance of short operations is enhanced. The enhancement
applies to string lengths between 1 and 128 bytes long. Support for
fast-short REP MOVSB is enumerated by the CPUID feature flag: CPUID
[EAX=7H, ECX=0H).EDX.FAST_SHORT_REP_MOVSB[bit 4] = 1. There is no change
in the REP STOS performance.

Linux could check for this CPUID bit and make the flag "fsrm" appear
in /proc/cpuinfo flags.

#define X86_FEATURES_FSRM (18*32+ 4) /* Fast short rep movsb */

At some future point copy functions could be modified to check for this feature and have some alternate copy loop for short strings on CPUs that do NOT have FSRM (benchmarking needed to see if this actually helps).

Target Kernel: 5.6
Target Release: 20.10

[HW/SW Information]
IceLake Server

[Business Justification]
performance

Revision history for this message
quanxian (quanxian-wang) wrote :

f444a5ff95dce07cf4353cbb85fc3e785019d430 v5.6

quanxian (quanxian-wang)
information type: Public → Private
quanxian (quanxian-wang)
Changed in intel:
status: New → Fix Committed
information type: Private → Public
quanxian (quanxian-wang)
Changed in intel:
status: Fix Committed → Fix Released
To post a comment you must log in.
This report contains Public information  
Everyone can see this information.

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.