[PATCH] D133723: [AMDGPU][GFX11] Use VGPR_32_F128 for VOP1,2,C
Dmitry Preobrazhensky via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Sep 14 11:20:50 PDT 2022
dp added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/SIInstructions.td:928
+ (f32 (f16_to_fp i32:$src0)),
+ (cvt32to16_e64 SRCMODS.NONE, $src0)
+ >;
----------------
The names `cvt16to32_e64` and `cvt32to16_e64` are misleading. They should be the other way around.
================
Comment at: llvm/lib/Target/AMDGPU/SIInstructions.td:987
+let SubtargetPredicate = HasTrue16BitInsts in
+defm : f16_fp_Pats<V_CVT_F16_F32_T16_e64, V_CVT_F32_F16_T16_e64>;
----------------
Why are these patterns special? `_E64` variants have no VGPR limitations. Is this required for future changes?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D133723/new/
https://reviews.llvm.org/D133723
More information about the llvm-commits
mailing list