[PATCH] D89697: * [x86] Implement smarter instruction lowering for FP_TO_UINT from vXf32/vXf64 to vXi32 for SSE2 and AVX2 by using the exact semantic of the CVTTPS2SI instruction.
Simon Pilgrim via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Jul 12 13:24:35 PDT 2021
RKSimon updated this revision to Diff 358048.
RKSimon retitled this revision from "* [x86] Implement smarter instruction lowering for FP_TO_UINT from vXf32 to vXi32 for SSE2 and AVX2 by using the exact semantic of the CVTTPS2SI instruction." to "* [x86] Implement smarter instruction lowering for FP_TO_UINT from vXf32/vXf64 to vXi32 for SSE2 and AVX2 by using the exact semantic of the CVTTPS2SI instruction.".
RKSimon edited the summary of this revision.
RKSimon added a comment.
Updated patch to also handle vXf64->vXi32 vector and f32/f64->u64/u32 scalar fptoui cases.
I've also added AVX1 support for v8f32->v8i32 where we still need a VBLENDPS (instead of a VSRAI ymm).
Refreshed costs using the helper script from D103695 <https://reviews.llvm.org/D103695> for reference.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D89697/new/
https://reviews.llvm.org/D89697
Files:
llvm/lib/Target/X86/X86ISelLowering.cpp
llvm/lib/Target/X86/X86TargetTransformInfo.cpp
llvm/test/Analysis/CostModel/X86/fptoui.ll
llvm/test/CodeGen/X86/concat-cast.ll
llvm/test/CodeGen/X86/fptoui-sat-scalar.ll
llvm/test/CodeGen/X86/ftrunc.ll
llvm/test/CodeGen/X86/half.ll
llvm/test/CodeGen/X86/scalar-fp-to-i32.ll
llvm/test/CodeGen/X86/scalar-fp-to-i64.ll
llvm/test/CodeGen/X86/vec_cast3.ll
llvm/test/CodeGen/X86/vec_fp_to_int.ll
llvm/test/Transforms/SLPVectorizer/X86/fptoui.ll
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D89697.358048.patch
Type: text/x-patch
Size: 122946 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20210712/9c2bfde7/attachment.bin>
More information about the llvm-commits
mailing list