[PATCH] D11678: [CodeGen] Fixes *absdiff* intrinsic: LangRef doc/test case improvement and corresponding code change
James Molloy
james at jamesmolloy.co.uk
Mon Aug 3 13:09:06 PDT 2015
Hi Michael,
I think the complexity comes from the subtraction having as its domain two
unsigned integers- so it's range must be a larger signed integer.
Signed comparison for unsigned values is clearly wrong as you say, but I
could contruct a testcase that shows incorrect behaviour with an unsigned
comparison too. I think the only correct behaviour is to extend the inputs
first and truncate the result.
But I've been wrong before :)
James
On Mon, 3 Aug 2015 at 21:05, Michael Zolotukhin <mzolotukhin at apple.com>
wrote:
> mzolotukhin added inline comments.
>
> ================
> Comment at: docs/LangRef.rst:10387-10390
> @@ -10386,6 +10386,6 @@
>
> %sub = sub nsw <4 x i32> %a, %b
> - %ispos = icmp sgt <4 x i32> %sub, <i32 -1, i32 -1, i32 -1, i32 -1>
> + %ispos = icmp sge <4 x i32> %sub, zeroinitializer
> %neg = sub nsw <4 x i32> zeroinitializer, %sub
> %1 = select <4 x i1> %ispos, <4 x i32> %sub, <4 x i32> %neg
>
> ----------------
> ashahid wrote:
> > mzolotukhin wrote:
> > > What's the difference between `llvm.uabsdiff` and `llvm.sabsdiff` then?
> > The difference is the presence of NSW flag in case of llvm.sabsdiff.
> I still don't think it's correct. NSW is just a hint to optimizers, but it
> doesn't add any additional logic. It does assert that the expression won't
> overflow, but the operations we execute are still the same. That is,
> currently the only difference between signed and unsigned version is that
> for signed version we could get an undefined behavior in some cases. This
> is clearly incorrect, because we should get different results without
> undefined behavior in some cases (e.g. `<-1,-1,-1,-1>` and `<1,1,1,1>` - it
> should give `<254,254,254,254>` for `uabsdiff.v4i8` and `<2,2,2,2>` for
> `sabsdiff.v4i8`).
>
> What really should be the difference, as far is I understand, is condition
> code in the comparison:
> ```
> %ispos = icmp sge <4 x i32> %sub, zeroinitializer
> ```
> As far as I understand, we should use `uge` for unsigned and `sge` for
> signed case.
>
>
>
> Repository:
> rL LLVM
>
> http://reviews.llvm.org/D11678
>
>
>
>
> _______________________________________________
> llvm-commits mailing list
> llvm-commits at cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvm-commits
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20150803/8d8e4749/attachment.html>
More information about the llvm-commits
mailing list