[PATCH] D73588: AMDGPU: Enhancement on FDIV lowering in AMDGPUCodeGenPrepare

Changpeng Fang via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Jan 31 14:26:00 PST 2020


cfang updated this revision to Diff 241807.
cfang added a comment.

update based on comment.

1. 2.5ulp threshold for fdiv.fast, 1.0 ulp for rcp
2. introduce a function lowerUsingFDivFast to use fdiv.fast, this function should be called after lowerUsingRcp because rcp is the prefedrence

This update should clear some confusion.


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D73588/new/

https://reviews.llvm.org/D73588

Files:
  llvm/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp
  llvm/lib/Target/AMDGPU/SIISelLowering.cpp
  llvm/test/CodeGen/AMDGPU/amdgpu-codegenprepare-fdiv.ll
  llvm/test/CodeGen/AMDGPU/fdiv.f16.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D73588.241807.patch
Type: text/x-patch
Size: 18194 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20200131/e1c0de3e/attachment.bin>


More information about the llvm-commits mailing list