[PATCH] D153002: [X86][AMX] set Stride to Tile's Col when doing combine amxcast and store into tilestore
LuoYuanke via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Jun 19 02:22:11 PDT 2023
LuoYuanke added inline comments.
================
Comment at: llvm/test/CodeGen/X86/AMX/amx-combine.ll:145
+ %tile = call x86_amx @llvm.x86.tileloadd64.internal(i16 8, i16 32, i8* %src_ptr, i64 64)
+ %vec = call <256 x i8> @llvm.x86.cast.tile.to.vector.v256i8(x86_amx %tile)
+ store <256 x i8> %vec, <256 x i8>* %dst_ptr, align 256
----------------
Could you add comments in the amx-cast definition that the vector size can be smaller than AMX register size (1024 bytes)? I think vector size can NOT be larger than AMX register size, am I right?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D153002/new/
https://reviews.llvm.org/D153002
More information about the llvm-commits
mailing list