[PATCH] D124734: [AMDGPU] Fix scalar_to_vector for v8i16/v8f16

Stanislav Mekhanoshin via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon May 2 11:10:39 PDT 2022


rampitec added inline comments.


================
Comment at: llvm/lib/Target/AMDGPU/SIInstructions.td:2710
+  (v8i16 (scalar_to_vector i16:$src0)),
+  (INSERT_SUBREG (IMPLICIT_DEF), $src0, sub0)
+>;
----------------
arsenm wrote:
> I don’t think these should be legal. We don’t naturally have 8 X 16 operations. A lowering that splits the vector would avoid introducing the wider registers and may combine better 
We actually do have these operands:
```
v_smfmac_f32_16x16x32_f16
v_smfmac_f32_32x32x16_f16
v_smfmac_f32_16x16x32_bf16
v_smfmac_f32_32x32x16_bf16
```


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D124734/new/

https://reviews.llvm.org/D124734



More information about the llvm-commits mailing list