[PATCH] D124734: [AMDGPU] Fix scalar_to_vector for v8i16/v8f16
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon May 2 11:10:39 PDT 2022
rampitec added inline comments.
================
Comment at: llvm/lib/Target/AMDGPU/SIInstructions.td:2710
+ (v8i16 (scalar_to_vector i16:$src0)),
+ (INSERT_SUBREG (IMPLICIT_DEF), $src0, sub0)
+>;
----------------
arsenm wrote:
> I don’t think these should be legal. We don’t naturally have 8 X 16 operations. A lowering that splits the vector would avoid introducing the wider registers and may combine better
We actually do have these operands:
```
v_smfmac_f32_16x16x32_f16
v_smfmac_f32_32x32x16_f16
v_smfmac_f32_16x16x32_bf16
v_smfmac_f32_32x32x16_bf16
```
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D124734/new/
https://reviews.llvm.org/D124734
More information about the llvm-commits
mailing list