[PATCH] D155593: AMDGPU: Overhaul and improve rcp and rsq f32 formation

Matt Arsenault via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Jul 19 06:24:33 PDT 2023


arsenm updated this revision to Diff 541988.
arsenm added a comment.

Defer the afn/unsafe-fp-math case to codegen for now, although it's really aggressive


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D155593/new/

https://reviews.llvm.org/D155593

Files:
  llvm/docs/ReleaseNotes.rst
  llvm/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp
  llvm/test/CodeGen/AMDGPU/GlobalISel/fdiv.f32.ll
  llvm/test/CodeGen/AMDGPU/amdgpu-codegenprepare-fdiv.ll
  llvm/test/CodeGen/AMDGPU/fdiv.ll
  llvm/test/CodeGen/AMDGPU/fdiv32-to-rcp-folding.ll
  llvm/test/CodeGen/AMDGPU/fdiv_flags.f32.ll
  llvm/test/CodeGen/AMDGPU/rcp-pattern.ll
  llvm/test/CodeGen/AMDGPU/rsq.f32.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D155593.541988.patch
Type: text/x-patch
Size: 450206 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20230719/040562b8/attachment-0001.bin>


More information about the llvm-commits mailing list