[llvm] r297650 - AMDGPU: Fold icmp/fcmp into icmp intrinsic
Matt Arsenault via llvm-commits
llvm-commits at lists.llvm.org
Fri Mar 31 11:37:11 PDT 2017
> On Mar 31, 2017, at 03:03, Nicolai Hähnle via llvm-commits <llvm-commits at lists.llvm.org> wrote:
>
> Hi Matt,
>
> this commit causes the attached shader to miscompile. Specifically:
>
> opt -S -instcombine < $shader
>
> causes the logical and in the endif8 block to disappear and one of the branch conditions to be come constant.
>
> I haven't looked in more detail yet, just found this via a bisect, and reverting the commit fixes the issue.
>
> Thanks,
> Nicolai
This testcase only has a use with constant inputs: %25 = call i64 @llvm.amdgcn.icmp.i32(i32 -1, i32 0, i32 33) #2 which folds to true, so I would expect somewhere a branch condition would fold to a constant?
-Matt
More information about the llvm-commits
mailing list