[LLVMdev] NEON vector instructions and the fast math IR flags
    David Tweed 
    david.tweed at arm.com
       
    Fri Jun  7 01:01:12 PDT 2013
    
    
  
>> Darwin uses NEON for floating point, but does *not* (and should not).
>> globally enable fast math flags.  Use of NEON for FP needs to remain
>> achievable without globally setting the fast math flags.  Fast math may
>> imply reasonably imply NEON, but the opposite direction is not accurate.
| Good point. Fast math is probably a too tough requirement. I need to
| look into what are the ways NEON does not comply with IEEE 754. For now
| the only difference I see is that it may round denormals to zero.
Yes, I've gone on record before as saying that fast-math enables far too
many
different things for it to be "the canonical switch" for just about any
transformation.
Rather, it should be what I think it is in gcc which is an effectively a
short-cut
for invoking of several individual math-option flags.
[snip]
|I just looked again at the +neonfp flag. Compiling with and without 
|+neonfp flag seems to only affect scalar types in the attached test 
|case. If e.g. the LLVM vectorizer introduces vector instructions on 
|LLVM-IR level floating point vectors still yield NEON assembly even if 
|compiled with "-mattr=+neon,-neonfp". Is this expected?
I'm virtually certain that's a problem since there are codebases out there
which use that to effectively specify "integer neon but use VFP for floats".
If the vectorizer is producing neon floating point from scalar code
in the presence of that flag then it's a (minor) issue waiting to happen. 
Cheers,
Dave
    
    
More information about the llvm-dev
mailing list