[PATCH] D129431: [TargetLowering][RISCV] Make expandCTLZ work for non-power of 2 types.
Craig Topper via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Sat Jul 9 16:49:20 PDT 2022
craig.topper created this revision.
craig.topper added reviewers: RKSimon, spatel, asb, luismarques, reames.
Herald added subscribers: sunshaoce, VincentWu, luke957, StephenFan, vkmr, frasercrmck, evandro, apazos, sameer.abuasal, s.egerton, Jim, benna, psnobl, jocewei, PkmX, the_o, brucehoult, MartinMosbeck, rogfer01, edward-jones, zzheng, jrtc27, shiva0217, kito-cheng, niosHD, sabuasal, simoncook, johnrusso, rbar, hiraditya, arichardson.
Herald added a project: All.
craig.topper requested review of this revision.
Herald added subscribers: pcwang-thead, eopXD, MaskRay.
Herald added a project: LLVM.
To convert CTLZ to popcount we do
x = x | (x >> 1);
x = x | (x >> 2);
...
x = x | (x >>16);
x = x | (x >>32); // for 64-bit input
return popcount(~x);
This smears the most significant set bit across all of the bits
below it then inverts the remaining 0s and does a population count.
To support non-power of 2 types, the last shift amount must be
more than half of the size of the type. For i15, the last shift
was previously a shift by 4, with this patch we add another shift
of 8.
Fixes PR56457.
Repository:
rG LLVM Github Monorepo
https://reviews.llvm.org/D129431
Files:
llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
llvm/test/CodeGen/RISCV/pr56457.ll
Index: llvm/test/CodeGen/RISCV/pr56457.ll
===================================================================
--- llvm/test/CodeGen/RISCV/pr56457.ll
+++ llvm/test/CodeGen/RISCV/pr56457.ll
@@ -18,6 +18,9 @@
; CHECK-NEXT: slli a1, a0, 49
; CHECK-NEXT: srli a1, a1, 53
; CHECK-NEXT: or a0, a0, a1
+; CHECK-NEXT: slli a1, a0, 49
+; CHECK-NEXT: srli a1, a1, 57
+; CHECK-NEXT: or a0, a0, a1
; CHECK-NEXT: not a0, a0
; CHECK-NEXT: slli a0, a0, 49
; CHECK-NEXT: srli a0, a0, 49
Index: llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
===================================================================
--- llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
+++ llvm/lib/CodeGen/SelectionDAG/TargetLowering.cpp
@@ -7856,7 +7856,7 @@
// return popcount(~x);
//
// Ref: "Hacker's Delight" by Henry Warren
- for (unsigned i = 0; (1U << i) <= (NumBitsPerElt / 2); ++i) {
+ for (unsigned i = 0; (1U << i) < NumBitsPerElt; ++i) {
SDValue Tmp = DAG.getConstant(1ULL << i, dl, ShVT);
Op = DAG.getNode(ISD::OR, dl, VT, Op,
DAG.getNode(ISD::SRL, dl, VT, Op, Tmp));
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D129431.443465.patch
Type: text/x-patch
Size: 1120 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20220709/be2a30a4/attachment.bin>
More information about the llvm-commits
mailing list