[PATCH] D71293: AMDGPU: Generate the correct sequence of code for FDIV32 when correctly-rounded-divide-sqrt is set

Changpeng Fang via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue Jan 21 09:38:44 PST 2020


cfang marked 3 inline comments as done.
cfang added inline comments.


================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUCodeGenPrepare.cpp:603
   bool UnsafeDiv = HasUnsafeFPMath || FMF.isFast() ||
                                       FMF.allowReciprocal();
   // With UnsafeDiv node will be optimized to just rcp and mul.
----------------
arsenm wrote:
> I don't think just allow reciprocal is sufficient without either checking FPMath or afn. I think this needs to be something more like
> UnsafeFP || isFast || (allowReciprocal && (denormal hasLowAccuracy || approximateFunction))
Can you explain what is exactly "denormal hasLowAccuracy" here?


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D71293/new/

https://reviews.llvm.org/D71293





More information about the llvm-commits mailing list