[PATCH] D32084: AMDGPU/GFX9: Set +fast-fmaf for >=gfx900 unless -cl-denorms-are-zero is set

Matt Arsenault via Phabricator via cfe-commits cfe-commits at lists.llvm.org
Fri Apr 21 10:57:31 PDT 2017


arsenm added inline comments.


================
Comment at: lib/Basic/Targets.cpp:2208-2210
+      TargetOpts.Features.push_back(
+          (Twine(hasFullSpeedFMAF32(TargetOpts.CPU) &&
+          !CGOpts.FlushDenorm ? '+' : '-') + Twine("fast-fmaf")).str());
----------------
We glue fast-fmaf to the specific subtargets, so I don't think you can actually turn it and off. It has this nonsensical behavior where it assumes if you disable a subtarget feature for the processors you aren't actually targeting the processor at all. I think a dedicated new feature might be needed


https://reviews.llvm.org/D32084





More information about the cfe-commits mailing list