SBCL

Float truncate generates redundant machine code

Bug #1189568 reported by Lutz Euler on 2013-06-10

This bug affects 1 person

Affects		Status	Importance	Assigned to	Milestone
	SBCL	Fix Released	Undecided	Unassigned

Bug Description

In SBCL 1.1.8.53, under x86-64, the following function

(defun f (x)
  (declare (optimize speed)
           (type (double-float 1.0d0 4.0d0) x))
  (nth-value 0 (truncate x)))

compiles to:

CVTTSD2SI RCX, XMM0 ; no-arg-parsing entry point
SHL RCX, 1
MOV RAX, RCX
SAR RAX, 1
XORPD XMM0, XMM0
CVTSI2SD XMM0, RAX
MOV RDX, RCX
MOV RSP, RBP
CLC
POP RBP
RET

The integer result of the truncation is redundantly converted to a double float in XMM0.

In 1.0.27.46 (just some old version I happen to have readily compiled) this worked better:

CVTTSD2SI RDX, XMM0 ; no-arg-parsing entry point
SHL RDX, 3
MOV RSP, RBP
CLC
POP RBP
RET

In some SBCL versions inbetween these two a comparable function accidentally used generic arithmetic (see bug #489388). This was fixed in 1.0.34.11 (and a few following commits) and since then this redundant code is generated.

Could the redundant instructions be optimized away?

Revision history for this message

Stas Boukarev (stassats) wrote on 2013-06-10:

(values (truncate (the double-float x))) also generates suboptimal code, because SB-KERNEL:%DOUBLE-FLOAT is not marked as flushable (because it can signal errors) and the transform tries to be smart, but fails.

Revision history for this message

Paul Khuong (pvk) wrote on 2013-06-10: Re: [Bug 1189568] [NEW] Float truncate generates redundant machine code

There's a frankly dodgy looking test to determine if the call to
TRUNCATE is in a single-value context; I don't think the test can ever
fire in safe code. The test could be improved, locally, e.g. by looking
at the receiving lvar's dest node. However, I'm pretty sure the right
thing is to find a way to enable dead code elimination for (some) FP
arithmetic inserted by transforms.

Another optimisation policy, and a new fndb attribute, a tweak to
flushable-combination-p, and then we can mark a lot of FP functions as
transformly-flushable.

Stas Boukarev (stassats) on 2023-08-08

Changed in sbcl:
status:	New → Fix Released

Report a bug

This report contains Public information

Everyone can see this information.

You are

Subscribing...

Edit bug mail

Other bug subscribers

Remote bug watches

Bug watches keep track of this bug in other bug trackers.