[PATCH] D120150: Constant folding of llvm.amdgcn.trig.preop

Fri Feb 18 23:38:31 PST 2022

Ravi added inline comments.

================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUInstCombineIntrinsic.cpp:1009
+      Thi >>= 11;
+      double Res = (double)Thi;
+      int Scale = -53 - Shift;
----------------
arsenm wrote:
> Should stick with apfloat operations
Yes, will do that.

================
Comment at: llvm/lib/Target/AMDGPU/AMDGPUInstCombineIntrinsic.cpp:1013
+        Scale += 128;
+      Res = ldexp(Res, Scale);
+      return IC.replaceInstUsesWith(II, ConstantFP::get(Src->getType(), Res));
----------------
arsenm wrote:
> You can use scalbn for APFloat instead of relying on host ldexp
Yes..will try it out and check for any precision difference. Should be good as long as the internal implementation of APFloat is within 0.5 ULP.

Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D120150/new/

https://reviews.llvm.org/D120150