[PATCH] Restore the sqrt -> llvm.sqrt mapping in fast-math mode

Hal Finkel hfinkel at anl.gov
Thu Sep 12 15:55:41 PDT 2013


----- Original Message -----
> 
> On Thu, Sep 12, 2013 at 2:44 PM, Hal Finkel < hfinkel at anl.gov >
> wrote:
> 
> 
> 
> 
> 
> ----- Original Message -----
> > 
> > On Thu, Sep 12, 2013 at 2:03 PM, Hal Finkel < hfinkel at anl.gov >
> > wrote:
> > 
> > 
> > 
> > 
> > Hello,
> > 
> > Please review the attached patch which restores the libm sqrt* ->
> > @llvm.sqrt* mapping, but only in fast-math mode (specifically, when
> > the UnsafeFPMath or NoNaNsFPMath CodeGen options are enabled). The
> > @llvm.sqrt* intrinsics have slightly different semantics from the
> > libm call, specifically, they are undefined when given a non-zero
> > negative number (the libm calls will always return NaN for any
> > negative number).
> > 
> > This mapping was removed in r100613, and replaced with a TODO, but
> > at
> > that time the fast-math flags were not yet implemented. Now that we
> > have these, restoring this mapping is important because it will
> > enable autovectorization of sqrt calls in loops (at least in
> > fast-math mode).
> > 
> > 
> > 
> > 
> > This is dangerous, if LangRef is actually correct. People don't
> > associate -ffast-math with "my program will crash at random". :) Of
> > course, LangRef is probably overstating the issue.
> 
> I agree, and the LangRef does indeed say "undefined behavior", but I
> assume that should really mean, "returns an undefined value." Do you
> agree?
> 
> 
> 
> Well, if we map llvm.sqrt to sqrt and sqrt sets errno, we really do
> mean "undefined behavior"... or at least something more that
> "returns an undefined value".

Agreed, but this possibility-of-setting-errno problem exists for all LLVM libm-style intrinsics, and so also exists for pow() [for which we currently do this exact kind of replacement whenever -fmath-errno=0]. So, FWIW, this is not without precedent.

> 
> 
> 
> 
> > 
> > 
> > That said, there's actually a general issue here: if we map the
> > LLVM
> > intrinsics to libc functions, and the libc functions set errno, we
> > could break code that depends on errno for non-math calls (e.g.
> > fopen().)
> 
> Perhaps, but I'm not changing that here. For one thing, if the
> mapping does, in effect, sqrt -> llvm.sqrt -> sqrt, and only when
> -fmath-errno=0. Are you worried about cases where the libm functions
> actually do set errno (even though we have -fmath-errno=0)?
> 
> 
> 
> 
> I'm not sure our implementation of -fno-math-errno is safe: according
> to the gcc manual, it isn't equivalent to marking the math functions
> with attribute((const)). (For example, the gcc manual's definition
> allows transforming a call to sqrt() into the SSE sqrt instruction,
> but it doesn't allow hoisting a call to sqrt out of arbitrary loops
> on a machine where the sqrt() call could set errno.)

I'm sure it is not safe, for the very reason that you highlight. Nevertheless, this is a long-standing problem, affecting the implementation of -fmath-errno=0 on all systems for which libm math functions actually do set errno, and will require a general solution (fairly orthogonal to this patch).

I recommend that we:

 1. Commit this change (so that we can autovectorize calls to sqrt()).
 2. Have a discussion about how to actually solve this problem: I think that it involves making a specific function attribute for setting errno, and teaching the alias analysis infrastructure to do something sensible with it.

 -Hal

> 
> 
> -Eli

-- 
Hal Finkel
Assistant Computational Scientist
Leadership Computing Facility
Argonne National Laboratory



More information about the cfe-commits mailing list