[PATCH] D73588: AMDGPU: Enhancement on FDIV lowering in AMDGPUCodeGenPrepare
Changpeng Fang via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Jan 29 14:53:40 PST 2020
cfang marked 2 inline comments as done.
cfang added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp:714
const bool UseFDivFast = Ty->isFloatTy() && !NeedHighAccuracy &&
- !FastUnsafeRcpLegal;
+ !CanReassociateFDiv;
----------------
arsenm wrote:
> fdiv.fast doesn't' care about the reassociation
You are right. This is just the optimization priority issue.
If we can reassociate fdiv, x/y -> x * rcp(y) is faster than fdiv.fast so we don't do fdiv.fast.
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D73588/new/
https://reviews.llvm.org/D73588
More information about the llvm-commits
mailing list