[PATCH] D73588: AMDGPU: Enhancement on FDIV lowering in AMDGPUCodeGenPrepare
Matt Arsenault via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Jan 31 15:03:43 PST 2020
arsenm added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp:611
//
-// 1/x -> rcp(x) when fast unsafe rcp is legal or fpmath >= 2.5ULP with
-// denormals flushed.
+// 1/x -> rcp(x) when fdiv is allowed to be re-associated or rcp is accurate.
//
----------------
This has nothing to do with reassociation
================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp:614
+// a/b -> a*rcp(b) when fdiv is allowed to be re-associated.
+static Value *lowerUsingRcp (Value *Num, Value *Den, bool CanReassociateFDiv,
+ bool RcpIsAccurate, IRBuilder<> Builder,
----------------
This should not be referred to ass lowering
================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp:624
if (const ConstantFP *CLHS = dyn_cast<ConstantFP>(Num)) {
- if (FastUnsafeRcpLegal || Ty->isFloatTy() || Ty->isHalfTy()) {
+ if (CanReassociateFDiv || RcpIsAccurate) {
if (CLHS->isExactlyValue(1.0)) {
----------------
We aren't fdiv here. We're handling an fdiv, and not splitting it into a multiple and rcp
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D73588/new/
https://reviews.llvm.org/D73588
More information about the llvm-commits
mailing list