[PATCH] D37325: [AMDGPU] Use v_pm_max_f16 for fcanonicalize

Stanislav Mekhanoshin via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Sep 1 15:59:00 PDT 2017


rampitec added inline comments.


================
Comment at: lib/Target/AMDGPU/SIInstructions.td:1289
+  (fcanonicalize (v2f16 (VOP3PMods v2f16:$src, i32:$src_mods))),
+  (V_PK_MUL_F16 SRCMODS.OP_SEL_1, (i32 CONST.V2FP16_ONE), $src_mods, $src, DSTCLAMP.NONE)
+>;
----------------
arsenm wrote:
> This won't work. For now it's probably easier to just throw an S_MOV_B32 of the constant. This won't encode correctly as a direct immediate because you need to manipulate op_sel
It was here before, this is the old code and the fix does not belong to the current change.


https://reviews.llvm.org/D37325





More information about the llvm-commits mailing list