[llvm-commits] [llvm-gcc-4.2] r82640 - /llvm-gcc-4.2/trunk/gcc/llvm-convert.cpp

Dale Johannesen dalej at apple.com
Fri Sep 25 14:02:07 PDT 2009


I haven't tried to touch clang.  I think the rest of it is done.
(Should probably mention sqrt, and other math intrinsics, are readonly  
not readnone because of:
// These functions do not actually read memory, but they are sensitive  
to the
// rounding mode.  This needs to be modelled separately; in the meantime
// declaring them as reading memory is conservatively correct.)

On Sep 23, 2009, at 11:04 PM, Chris Lattner wrote:

> On Sep 23, 2009, at 6:10 PM, Evan Cheng wrote:
> On Sep 23, 2009, at 3:21 PM, Dale Johannesen wrote:
>>> On Sep 23, 2009, at 3:18 PMPDT, Evan Cheng wrote:
>>>
>>>> Are we generating a call to sqrt now? If so, that's bad. We should
>>>> be using SSE sqrts* instructions.
>>>
>>> Agreed.  My proposed semantic change would fix that.
>>
>> You mean change to llvm.sqrt and then llvm-gcc can switch bad to
>> generating the intrinsic? It seems like the current fix is not what  
>> we
>> want. Perhaps we should revert it first?
>>
>> According to Chris, the semantics of sqrt of negative value is  
>> defined
>> and this is just some optimization bug.
>
> Sorry, I'm just catching up on this now.  There are a couple of  
> things that confuse the issue, but I'll just try to keep it "to the  
> point" instead of rambling about history.
>
> 1. I did tell Evan that sqrt is defined on negative number, but I  
> misunderstood and didn't think about llvm.sqrt.  Please disregard my  
> comment Evan.  llvm.sqrt should be undefined on negative numbers as  
> langref says, and llvm-gcc/clang should only transform sqrt to  
> llvm.sqrt if the appropriate "I don't care about fp semantics" flag  
> is set.
>
> 2. Dale's patch to llvm-gcc is ok, but it would be better to still  
> do the transformation when -ffast-math is specified or whatever the  
> more precise "nan's aren't generated" flag is.  We should do the  
> same thing for clang as well.
>
> 3. Please make sure that llvm-gcc/clang on the mac (and other  
> targets with -fno-math-errno) are producing a call to sqrt that is  
> marked as readnone.  Given this, the mid-level optimizer should  
> hoist and cse the calls to sqrt just as well as it did calls to  
> llvm.sqrt.
>
> 4. The constant folding of llvm.sqrt(-123) -> 0 is ok because the  
> intrinsic really is undefined on negative.  The constant folding of  
> sqrt(-123) doesn't fold if the input is negative, so it will just  
> not optimize the curious case.
>
> 5. Please make the X86 backend compile calls to readonly/readnone  
> "sqrt" produce a sqrtsd (etc) instruction.  We really don't want to  
> get a function call on the mac (or other x86 target with -fno-math- 
> errno).  Like malloc, if someone cares about -fno-builtin-sqrt, they  
> can solve the general problem.  We already constant fold "real sqrt"  
> calls in Analysis/ConstantFolding.cpp anyway.
>
> 6. Please add handling of "real sqrt" to llvm::CannotBeNegativeZero  
> to match the handling of llvm.sqrt.
>
> Thanks!
>
> -Chris




More information about the llvm-commits mailing list