[PATCH] D133723: [AMDGPU][GFX11] Use VGPR_32_F128 for VOP1,2,C

Dmitry Preobrazhensky via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Sep 14 11:20:50 PDT 2022


dp added inline comments.


================
Comment at: llvm/lib/Target/AMDGPU/SIInstructions.td:928
+    (f32 (f16_to_fp i32:$src0)),
+    (cvt32to16_e64 SRCMODS.NONE, $src0)
+  >;
----------------
The names `cvt16to32_e64` and `cvt32to16_e64` are misleading. They should be the other way around.


================
Comment at: llvm/lib/Target/AMDGPU/SIInstructions.td:987
+let SubtargetPredicate = HasTrue16BitInsts in
+defm : f16_fp_Pats<V_CVT_F16_F32_T16_e64, V_CVT_F32_F16_T16_e64>;
 
----------------
Why are these patterns special? `_E64` variants have no VGPR limitations. Is this required for future changes?


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D133723/new/

https://reviews.llvm.org/D133723



More information about the llvm-commits mailing list