[PATCH] D158127: AMDGPU: Permit more rsq formation in AMDGPUCodeGenPrepare

Matt Arsenault via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Aug 16 15:58:07 PDT 2023


arsenm created this revision.
arsenm added reviewers: AMDGPU, rampitec, foad.
Herald added subscribers: StephenFan, kerbowa, hiraditya, tpr, dstuttard, yaxunl, jvesely, kzhuravl.
Herald added a project: All.
arsenm requested review of this revision.
Herald added a subscriber: wdng.
Herald added a project: LLVM.

We were basing the defer the fast case to codegen based on the fdiv
itself, and not looking for a foldable sqrt input.


https://reviews.llvm.org/D158127

Files:
  llvm/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp
  llvm/test/CodeGen/AMDGPU/amdgpu-codegenprepare-fdiv.ll
  llvm/test/CodeGen/AMDGPU/fdiv_flags.f32.ll
  llvm/test/CodeGen/AMDGPU/rsq.f32.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D158127.550921.patch
Type: text/x-patch
Size: 12083 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20230816/1a321240/attachment.bin>


More information about the llvm-commits mailing list