[PATCH] D72919: [AArch64] Add custom store lowering for 256 bit non-temporal stores.
Florian Hahn via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Sun Jan 19 20:27:48 PST 2020
fhahn added inline comments.
================
Comment at: llvm/lib/Target/AArch64/AArch64InstrInfo.td:2727
+def : Pat<(AArch64stnp FPR128:$Rt, FPR128:$Rt2, (am_indexed7s64 GPR64sp:$Rn, simm7s8:$offset)),
+ (STNPQi FPR128:$Rt, FPR128:$Rt2, GPR64sp:$Rn, simm7s8:$offset)>;
----------------
dmgreen wrote:
> Should this be simm7s16? Same for am_indexed7s128
Yes! I've updated that and now the offsets should be correct.
================
Comment at: llvm/test/CodeGen/AArch64/nontemporal.ll:361
+; CHECK-NEXT: stnp d1, d2, [x0, #16]
+; CHECK-NEXT: stnp d0, d3, [x0]
+; CHECK-NEXT: ret
----------------
dmgreen wrote:
> Should halves work too?
Yes, I've added the missing handler for v16f16.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D72919/new/
https://reviews.llvm.org/D72919
More information about the llvm-commits
mailing list