[PATCH] D48573: [AMDGPU] Add llvm.amdgcn.fmad.ftz intrinsic
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Jun 26 07:19:57 PDT 2018
rampitec added inline comments.
================
Comment at: include/llvm/IR/IntrinsicsAMDGPU.td:364
+// v_mad_f32/v_mac_f32, selected regardless of denorm support.
+def int_amdgcn_fmad_ftz :
+ Intrinsic<[llvm_float_ty],
----------------
arsenm wrote:
> rampitec wrote:
> > arsenm wrote:
> > > Does v_mad_f16 flush denormals? I don't think it does
> > It does. It cannot preserve denormals. It is even lowered into FMAD_FTZ, which is lowered to mad. The new part is this intrinsic sitting above the existing SDNode.
> In that case I would make this intrinsic type mangled and make it work for f16 as well
AFAIR f16 does not flush.
https://reviews.llvm.org/D48573
More information about the llvm-commits
mailing list