[Mlir-commits] [mlir] [mlir][AMDGPU] Add gfx950 MFMAs to the amdgpu.mfma op (PR #133553)
Krzysztof Drewniak
llvmlistbot at llvm.org
Mon Mar 31 12:44:53 PDT 2025
================
@@ -650,10 +650,12 @@ def AMDGPU_MFMAPermBAttr : EnumAttr<AMDGPU_Dialect, AMDGPU_MFMAPermB,
// mfma
def MFMAInTypes : AnyTypeOf<[F32, F64, I32, I64,
VectorOfLengthAndType<[2], [F32]>,
- VectorOfLengthAndType<[4], [F16]>,
- VectorOfLengthAndType<[2, 4], [BF16]>,
- VectorOfLengthAndType<[4, 8], [I8]>,
- VectorOfLengthAndType<[8], [F8E5M2FNUZ, F8E4M3FNUZ, F8E5M2, F8E4M3FN]>]>;
+ VectorOfLengthAndType<[4, 8], [F16]>,
+ VectorOfLengthAndType<[2, 4, 8], [BF16]>,
----------------
krzysz00 wrote:
New instructions have doubled the width of fp16 and fp8 inputs. Note that `(32 * 16) / 64` is 8.
https://github.com/llvm/llvm-project/pull/133553
More information about the Mlir-commits
mailing list