[PATCH] D140024: [RISCV] Allow conversion of fp divisions to fp multiplications by the reciprocal
Anton Sidorenko via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Dec 14 06:31:01 PST 2022
asi-sc created this revision.
Herald added subscribers: sunshaoce, VincentWu, vkmr, frasercrmck, evandro, luismarques, apazos, sameer.abuasal, s.egerton, Jim, benna, psnobl, jocewei, PkmX, the_o, brucehoult, MartinMosbeck, rogfer01, edward-jones, zzheng, jrtc27, shiva0217, kito-cheng, niosHD, sabuasal, simoncook, johnrusso, rbar, asb, hiraditya, arichardson.
Herald added a project: All.
asi-sc requested review of this revision.
Herald added subscribers: llvm-commits, pcwang-thead, eopXD, MaskRay.
Herald added a project: LLVM.
If the divisor is repeated at least twice, we will convert the FDIVs to the
calculation of the reciprocal and FMULs.
We perform the transformation only under fast-math mode. FDIVs must have
'arcp' flag.
Repository:
rG LLVM Github Monorepo
https://reviews.llvm.org/D140024
Files:
llvm/lib/Target/RISCV/RISCVISelLowering.cpp
llvm/lib/Target/RISCV/RISCVISelLowering.h
llvm/test/CodeGen/RISCV/repeated-fp-divisors.ll
Index: llvm/test/CodeGen/RISCV/repeated-fp-divisors.ll
===================================================================
--- llvm/test/CodeGen/RISCV/repeated-fp-divisors.ll
+++ llvm/test/CodeGen/RISCV/repeated-fp-divisors.ll
@@ -1,6 +1,7 @@
; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
; RUN: llc -mtriple=riscv64 -mattr=+d -verify-machineinstrs < %s | FileCheck %s
+; Negative test
define void @single_fdiv(double %a0, double %a1, ptr %res) {
; CHECK-LABEL: single_fdiv:
; CHECK: # %bb.0: # %entry
@@ -16,10 +17,13 @@
define void @two_fdivs(double %a0, double %a1, double %a2, ptr %res) {
; CHECK-LABEL: two_fdivs:
; CHECK: # %bb.0: # %entry
-; CHECK-NEXT: fdiv.d ft0, fa1, fa0
-; CHECK-NEXT: fdiv.d ft1, fa2, fa0
-; CHECK-NEXT: fsd ft0, 0(a0)
-; CHECK-NEXT: fsd ft1, 8(a0)
+; CHECK-NEXT: lui a1, %hi(.LCPI1_0)
+; CHECK-NEXT: fld ft0, %lo(.LCPI1_0)(a1)
+; CHECK-NEXT: fdiv.d ft0, ft0, fa0
+; CHECK-NEXT: fmul.d ft1, fa1, ft0
+; CHECK-NEXT: fmul.d ft0, fa2, ft0
+; CHECK-NEXT: fsd ft1, 0(a0)
+; CHECK-NEXT: fsd ft0, 8(a0)
; CHECK-NEXT: ret
entry:
%div = fdiv arcp double %a1, %a0
@@ -30,6 +34,7 @@
ret void
}
+; Negative test
define void @no_arcp(double %a0, double %a1, double %a2, ptr %res) {
; CHECK-LABEL: no_arcp:
; CHECK: # %bb.0: # %entry
Index: llvm/lib/Target/RISCV/RISCVISelLowering.h
===================================================================
--- llvm/lib/Target/RISCV/RISCVISelLowering.h
+++ llvm/lib/Target/RISCV/RISCVISelLowering.h
@@ -734,6 +734,10 @@
bool shouldNormalizeToSelectSequence(LLVMContext &, EVT) const override {
return false;
};
+
+ /// For available scheduling models FDIV + two independent FMULs are much
+ /// faster than two FDIVs.
+ unsigned combineRepeatedFPDivisors() const override;
};
namespace RISCVVIntrinsicsTable {
Index: llvm/lib/Target/RISCV/RISCVISelLowering.cpp
===================================================================
--- llvm/lib/Target/RISCV/RISCVISelLowering.cpp
+++ llvm/lib/Target/RISCV/RISCVISelLowering.cpp
@@ -59,6 +59,12 @@
"VWADD_W) with splat constants"),
cl::init(false));
+static cl::opt<unsigned> NumRepeatedDivisors(
+ DEBUG_TYPE "-fp-repeated-divisors", cl::Hidden,
+ cl::desc("Set the minimum number of repetitions of a divisor to allow "
+ "transformation to multiplications by the reciprocal"),
+ cl::init(2));
+
RISCVTargetLowering::RISCVTargetLowering(const TargetMachine &TM,
const RISCVSubtarget &STI)
: TargetLowering(TM), Subtarget(STI) {
@@ -1726,6 +1732,10 @@
return false;
}
+unsigned RISCVTargetLowering::combineRepeatedFPDivisors() const {
+ return NumRepeatedDivisors;
+}
+
static SDValue getVLOperand(SDValue Op) {
assert((Op.getOpcode() == ISD::INTRINSIC_WO_CHAIN ||
Op.getOpcode() == ISD::INTRINSIC_W_CHAIN) &&
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D140024.482843.patch
Type: text/x-patch
Size: 2989 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20221214/f4dd70fb/attachment.bin>
More information about the llvm-commits
mailing list