[PATCH] D93594: [X86] Pass to transform amx intrinsics to scalar operation.
Bing Yu via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Sun Feb 28 21:09:12 PST 2021
yubing added inline comments.
================
Comment at: llvm/lib/Target/X86/X86LowerAMXIntrinsics.cpp:99
+ Loop *RowLoop = LI.AllocateLoop();
+ Loop *ColLoop = LI.AllocateLoop();
+ RowLoop->addChildLoop(ColLoop);
----------------
pengfei wrote:
> Not sure how about the arithmetic intrinsics. But at least for load and store intrinsics we can use LLVM intrinsic `llvm.masked.load/store` to reduce the inner loop.
I think We can compose a follow-up patch for this optimization
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D93594/new/
https://reviews.llvm.org/D93594
More information about the cfe-commits
mailing list