[PATCH] D153002: [X86][AMX] set Stride to Tile's Col when doing combine amxcast and store into tilestore

LuoYuanke via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon Jun 19 02:22:11 PDT 2023


LuoYuanke added inline comments.


================
Comment at: llvm/test/CodeGen/X86/AMX/amx-combine.ll:145
+  %tile = call x86_amx @llvm.x86.tileloadd64.internal(i16 8, i16 32, i8* %src_ptr, i64 64)
+  %vec = call <256 x i8> @llvm.x86.cast.tile.to.vector.v256i8(x86_amx %tile)
+  store <256 x i8> %vec, <256 x i8>* %dst_ptr, align 256
----------------
Could you add comments in the amx-cast definition that the vector size can be smaller than AMX register size (1024 bytes)? I think vector size can NOT be larger than AMX register size, am I right?


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D153002/new/

https://reviews.llvm.org/D153002



More information about the llvm-commits mailing list