[PATCH] D43414: AMDGPU: Define FP_FAST_FMA{F} macros for amdgcn

Brian Sumner via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Feb 16 15:19:00 PST 2018


b-sumner added inline comments.


================
Comment at: lib/Basic/Targets/AMDGPU.cpp:345-348
+  if (getTriple().getArch() == llvm::Triple::amdgcn) {
+    Builder.defineMacro("FP_FAST_FMA");
+    Builder.defineMacro("FP_FAST_FMAF");
+  }
----------------
t-tye wrote:
> Do all amdgcn targets have fast FMA? @b-sumner can you clarify?
No.  All targets that support double precision should report FAST_FMA.  Only targets with full rate v_fma_f32 should report FAST_FMAF



https://reviews.llvm.org/D43414





More information about the llvm-commits mailing list