[llvm-dev] how to simplify FP ops with an undef operand?
Nuno Lopes via llvm-dev
llvm-dev at lists.llvm.org
Thu Mar 1 02:08:02 PST 2018
We can do "add %x, undef" => "undef" because for any value of %x, we can
always find a value that when added to %x produces any value in the domain
of integers.
This is not the case with floats since with some inputs, e.g., NaNs, we are
not able to produce some values in the domain (e.g., there's no value of %x
that makes "fadd NaN, %x" return 42.0).
In summary, since there's no NaN constant in the IR, all we can do is to
canonicalize to some instruction that yields NaN (and treat that as a NaN
constant throughout the pipeline, I guess).
Nuno
P.S.: With poison you can always fold operations with a poison input to
poison (e.g., "fadd %x, poison" => poison).
-----Original Message-----
From: Kaylor, Andrew
Sent: Wednesday, February 28, 2018 11:29 PM
Subject: RE: how to simplify FP ops with an undef operand?
For the first part of Sanjay’s question, I think the answer is, “Yes, we can
fold all of these to NaN in the general case.” For the second part, which
the nnan FMF is present, I’m not sure. The particulars of the semantics of
nnan are unclear to me.
But let me explore what Eli is saying. It sounds reasonable, but I have a
question about it.
Suppose we have the nnan FMF set, and we encounter this:
%y = fdiv float %x, undef
If I’ve understood Eli’s interpretation correctly, any of the following
transformations would be legal and safe:
%y = 0.0
%y = -1.0
%y = inf
%y = NaN
And so on, covering all possible concrete values, right? Now suppose I don’t
change it at all. Now I might have IR that looks like this.
%y = fdiv float %x, undef
%z = fmul float %q, %y
At this point, while working on %z, I could reconsider and say “If I had
transformed the fdiv into ‘%y = 0.0’ I could optimize this fmul away too.”
So at that point I can choose to do that, right? And in general I can
“retroactively” choose any concrete value that would be convenient for the
next transformation. You see where I’m going with this?
How is that different from just folding the fdiv into undef to begin with?
Is it because I can’t choose different values on different code paths?
-Andy
From: Friedman, Eli [mailto:efriedma at codeaurora.org]
Sent: Wednesday, February 28, 2018 3:07 PM
To: Sanjay Patel <spatel at rotateright.com>; Kaylor, Andrew
<andrew.kaylor at intel.com>
Cc: llvm-dev <llvm-dev at lists.llvm.org>; Nuno Lopes <nunoplopes at sapo.pt>;
Stephen Canon <scanon at apple.com>; David Majnemer <david.majnemer at gmail.com>;
John Regehr <regehr at cs.utah.edu>; Sanjoy Das
<sanjoy at playingwithpointers.com>; Matt Arsenault <arsenm2 at gmail.com>;
Kreitzer, David L <david.l.kreitzer at intel.com>
Subject: Re: how to simplify FP ops with an undef operand?
I'm pretty sure that isn't what nnan is supposed to mean. If the result of
nnan math were undefined in the sense of "undef", programs using nnan could
have undefined behavior if the result is used in certain ways which would
not be undefined for any actual float value (e.g. converting the result to a
string), which seems like a surprising result. And I don't think we gain
any useful optimization power from saying we can fold to undef instead of
something else.
So I think it's supposed to say "the result is not specified" or something
(so an nnan operation which would produce a nan can instead produce any
value that isn't undef/poison).
-Eli
On 2/28/2018 2:45 PM, Sanjay Patel wrote:
Ah, thanks for explaining. So given that any of these ops will return NaN
with a NaN operand, let's choose the undef operand value to be NaN. That
means we can fold all of these to a NaN constant in the general case.
But if we have 'nnan' FMF, then we can fold harder to undef?
nnan - Allow optimizations to assume the arguments and result are not NaN.
Such optimizations are required to retain defined behavior over NaNs, but
the value of the result is undefined.
On Wed, Feb 28, 2018 at 3:25 PM, Kaylor, Andrew <andrew.kaylor at intel.com>
wrote:
What I’m saying is that if we have one operand that is not an undef value
then that operand might be NaN and if it is then the result must be NaN. So
while it may be true that we don’t have a NaN, it is not true that we
definitely do not have a NaN in the example. This is analogous to the
example in the language reference where it says “%A = or %X, undef” -> “%A =
undef” is unsafe because any bits that are set in %A must be set in the
result. If any floating point operand is NaN dynamically, then the result
must be NaN.
I don’t believe it’s accurate to say that NaN is “morally equivalent” to
undef. There are some similarities, but the important difference is that NaN
always has well defined behavior with a specific correct result. There is,
perhaps, a sense in which it is analogous to a poison value, but for the
purposes of reasoning about the correctness of floating point operations I
think it’s best to be pedantic about treating it as the specific value that
it is.
Finally, I was pretty sure you knew that fdiv by zero wasn’t undefined. I
just wanted to clarify that the “?” in your comment was indicating that the
assertion in the language reference was questionable as opposed to this
point being in any way actually uncertain.
From: Sanjay Patel [mailto:spatel at rotateright.com]
Sent: Wednesday, February 28, 2018 1:05 PM
To: Kaylor, Andrew <andrew.kaylor at intel.com>
Cc: llvm-dev <llvm-dev at lists.llvm.org>; Nuno Lopes <nunoplopes at sapo.pt>;
Stephen Canon <scanon at apple.com>; David Majnemer <david.majnemer at gmail.com>;
John Regehr <regehr at cs.utah.edu>; Sanjoy Das
<sanjoy at playingwithpointers.com>; Friedman, Eli <efriedma at codeaurora.org>;
Matt Arsenault <arsenm2 at gmail.com>; Kreitzer, David L
<david.l.kreitzer at intel.com>
Subject: Re: how to simplify FP ops with an undef operand?
Correct - NaN is not undef in IR. But we don't have a NaN in this example.
We have its moral equivalent in LLVM - an uninitialized value, undef.
So we're not introducing any extra uncertainty by propagating the undef. The
backend can choose whatever encoding of undef makes sense when lowering?
And yes, I don't know why FP-div-by-zero would ever be UB. I think that text
in the LangRef should be removed regardless of any other outcome here.
On Wed, Feb 28, 2018 at 1:18 PM, Kaylor, Andrew <andrew.kaylor at intel.com>
wrote:
Why is NaN “just ‘undef’ in IR”? NaN is a specific value with well-defined
behavior. I would think that unless the no-NaNs flag is used we need to
preserve the behavior of NaNs.
From: Sanjay Patel [mailto:spatel at rotateright.com]
Sent: Wednesday, February 28, 2018 12:08 PM
To: Kaylor, Andrew <andrew.kaylor at intel.com>
Cc: llvm-dev <llvm-dev at lists.llvm.org>; Nuno Lopes <nunoplopes at sapo.pt>;
Stephen Canon <scanon at apple.com>; David Majnemer <david.majnemer at gmail.com>;
John Regehr <regehr at cs.utah.edu>; Sanjoy Das
<sanjoy at playingwithpointers.com>; Friedman, Eli <efriedma at codeaurora.org>;
Matt Arsenault <arsenm2 at gmail.com>
Subject: Re: how to simplify FP ops with an undef operand?
Yes, if %x is a NaN, we should expect that NaN is propagated.
I'm still not sure what to do here. We can take comfort in knowing that
whatever we do is likely an improvement over the current situation though.
:)
That's because the code in InstSimplify is inconsistent with the LangRef:
http://llvm.org/docs/LangRef.html#undefined-values (UB for fdiv by 0?)
...and both of those are inconsistent with undef handling in SDAG.
Let me propose an alternate interpretation:
1. The meaning of snan as written in IEEE754-2008 is: "Signaling NaNs afford
representations for uninitialized variables..."
2. That matches our intent with 'undef' here in IR as written in the
LangRef: "unspecified bit-pattern".
3. The current fdiv transform is actually correct (any SNaN UB/trapping
commentary is irrelevant because we assume exceptions are off by default).
The undef operand represents an uninitialized variable, and the result of
any FP op with that uninitialized variable is well-defined: it's another NaN
which is just 'undef' in IR.
On Wed, Feb 28, 2018 at 11:43 AM, Kaylor, Andrew <andrew.kaylor at intel.com>
wrote:
I’m not sure the transformation happening with fdiv is correct. If we have
“%y = fdiv float %x, undef” and %x is a NaN then the result will be NaN for
any value of the undef, right? So if I understand the undef rules correctly
(never a certainty) then we can’t safely replace the expression with undef.
We could, I think, replace it with “%y = %x” though. I think the same is
true for fadd, fsub, fmul, and frem.
-Andy
%y = fadd float %x, undef
Can we simplify this?
Currently in IR, we do nothing for fadd/fsub/fmul. For fdiv/frem, we
propagate undef. The code comment for fdiv/frem says:
"the undef could be a snan"
If that's correct, then shouldn't it be the same for fadd/fsub/fmul? But
this can't be correct because we support targets that don't raise
exceptions...and even targets that raise exceptions do not trap by default
on snan?
-- Employee of Qualcomm Innovation Center, Inc.Qualcomm Innovation Center,
Inc. is a member of Code Aurora Forum, a Linux Foundation Collaborative
Project
More information about the llvm-dev
mailing list