[PATCH] D68557: PR41162 Implement LKK remainder and divisibility algorithms [srem]
Tim Gymnich via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Sun Oct 6 15:48:57 PDT 2019
TG908 created this revision.
TG908 added a project: LLVM.
Herald added subscribers: llvm-commits, jsji, MaskRay, hiraditya, nemanjai.
This patch implements the LKK algorithm for optimizing remainder computations with a constant divisor.
LKK is an improvement on the previously used Granlund-Montgomery-Warren approach.
UREM Patch <https://reviews.llvm.org/D68360>
PR41162 <https://bugs.llvm.org/show_bug.cgi?id=41162>
LKK Paper <https://arxiv.org/pdf/1902.01961.pdf>
**Notes:**
- I added functions `foldSREM` and `foldUREM` in `DAGCombiner.cpp` to handle signed and unsigned remainders.
- Tests have been performed on x86 for every 8 bit signed and unsigned remainder operation.
- Tests for 32 bit signed remainders have been performed on a large set of random integers.
- I also added some llc code gen tests.
- I am a bit unsure whether my `isOperationLegalOrCustomOrPromote` checks are too strict.
Repository:
rL LLVM
https://reviews.llvm.org/D68557
Files:
llvm/include/llvm/CodeGen/TargetLowering.h
llvm/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
llvm/test/CodeGen/AArch64/srem-lkk.ll
llvm/test/CodeGen/AArch64/srem-seteq.ll
llvm/test/CodeGen/AArch64/srem-vector-lkk.ll
llvm/test/CodeGen/ARM/urem-opt-size.ll
llvm/test/CodeGen/PowerPC/machine-pre.ll
llvm/test/CodeGen/PowerPC/srem-vector-lkk.ll
llvm/test/CodeGen/X86/load-scalar-as-vector.ll
llvm/test/CodeGen/X86/pr14088.ll
llvm/test/CodeGen/X86/srem-lkk.ll
llvm/test/CodeGen/X86/srem-vector-lkk.ll
llvm/test/CodeGen/X86/urem-vector-lkk.ll
llvm/test/CodeGen/X86/vector-idiv-sdiv-128.ll
llvm/test/CodeGen/X86/vector-idiv-sdiv-256.ll
llvm/test/CodeGen/X86/vector-idiv-sdiv-512.ll
llvm/test/CodeGen/X86/vector-intrinsics.ll
llvm/test/CodeGen/X86/vector-rem.ll
llvm/test/CodeGen/X86/vector-truncate-combine.ll
llvm/test/CodeGen/X86/vector-variable-idx.ll
llvm/test/CodeGen/X86/vector-variable-idx2.ll
llvm/test/CodeGen/X86/vector-width-store-merge.ll
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D68557.223447.patch
Type: text/x-patch
Size: 144607 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20191006/17bce26a/attachment-0001.bin>
More information about the llvm-commits
mailing list