[PATCH] D21190: [InstCombine] allow more than one use for vector cast folding with selects
Eli Friedman via llvm-commits
llvm-commits at lists.llvm.org
Thu Jun 16 10:48:20 PDT 2016
eli.friedman accepted this revision.
eli.friedman added a comment.
This revision is now accepted and ready to land.
This patch doesn't only apply to bitcasts, which can lead to gigantic codegen changes in some cases. That isn't a new problem, though; consider:
define void @min_max_trunc(<4 x float> %a, <4 x float> %b, <4 x i64> %c, <4 x i64> %d, <4 x i32>* %ptr1, <4 x i32>* %ptr2) {
%cmp = fcmp olt <4 x float> %a, %b
%bc1 = trunc <4 x i64> %c to <4 x i32>
%bc2 = trunc <4 x i64> %d to <4 x i32>
%sel1 = select <4 x i1> %cmp, <4 x i32> %bc1, <4 x i32> %bc2
store <4 x i32> %sel1, <4 x i32>* %ptr1
ret void
}
instcombine makes this generate much worse code for SSE2. Although, I'm pretty sure we can blame SelectionDAG for some part of that because it's generating absolutely terrible code. Feel free to just file a bug for this, but I'm pretty sure we need some sort of target-hook for this.
It would be nice to throw together a few testcases for zext/sext/trunc/sitofp/fptoui just to make sure we have coverage.
Otherwise LGTM.
http://reviews.llvm.org/D21190
More information about the llvm-commits
mailing list