[PATCH] D156677: AMDGPU: Remove pointless libcall optimization of fma/mad
Matt Arsenault via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Jul 31 05:16:51 PDT 2023
arsenm created this revision.
arsenm added reviewers: AMDGPU, rampitec, jhuber6, vpykhtin, dfukalov, foad, yaxunl.
Herald added subscribers: StephenFan, kerbowa, hiraditya, tpr, dstuttard, jvesely, kzhuravl.
Herald added a project: All.
arsenm requested review of this revision.
Herald added a subscriber: wdng.
Herald added a project: LLVM.
After the library is linked and trivially inlined, the generic fma and
fmuladd intrinsics already handle these cases, and with precise flag
handling. This was requiring all fast math flags when we really just
need nsz for the fma(a, b, 0) case.
https://reviews.llvm.org/D156677
Files:
llvm/lib/Target/AMDGPU/AMDGPULibCalls.cpp
llvm/test/CodeGen/AMDGPU/simplify-libcalls.ll
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D156677.545605.patch
Type: text/x-patch
Size: 5566 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20230731/57e6bc21/attachment.bin>
More information about the llvm-commits
mailing list