[PATCH] D93594: [X86] Pass to transform amx intrinsics to scalar operation.
Bing Yu via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Fri Feb 19 21:46:36 PST 2021
yubing added inline comments.
================
Comment at: llvm/lib/Target/X86/X86LowerAMXIntrinsics.cpp:211-212
+ IRBuilderBase &B, DomTreeUpdater &DTU,
+ LoopInfo &LI, Value *Row, Value *Col,
+ Value *K, Value *Acc, Value *LHS,
+ Value *RHS) {
----------------
xiangzhangllvm wrote:
> In fact, no need handle Row, Col, K here, just use fix size 16x16, the result of calculation is some in effective area. (just need tileload "keep" the "unused" area is 0).
> Then can use vector to handle all of the them, let type legalization to split the type.
We should keep the code here. In bf16, since +0.0(0x0000) * negative float is equal to -0.0(0x8000), following your solution is not able to ensure outer edge is allzero.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D93594/new/
https://reviews.llvm.org/D93594
More information about the cfe-commits
mailing list