[PATCH] D128975: [AArc64] Add support for insert/extract for nxv1i1 types.
David Sherwood via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Jul 4 07:13:54 PDT 2022
david-arm added inline comments.
================
Comment at: llvm/test/CodeGen/AArch64/sve-insert-vector.ll:992
+; CHECK-NEXT: punpklo p2.h, p0.b
+; CHECK-NEXT: str p4, [sp, #7, mul vl] // 2-byte Folded Spill
+; CHECK-NEXT: punpklo p3.h, p2.b
----------------
david-arm wrote:
> I don't think this is a problem with your patch, but the spill and fill in this output looks unnecessary. We have enough registers to support this without spilling I think?
>
> Also, something weird seems to be happening with the offset for the spill/fill, i.e. "#7, mul vl". I assume that translates to offset = 7 x vscale x 2? It seems to fit into the stack space we've allocated, but I wonder if this is just pure luck?
Ah, perhaps we're actually just storing it into the top of the temporary stack space we've allocated, i.e. the top part of a "vscale x 16" byte object.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D128975/new/
https://reviews.llvm.org/D128975
More information about the llvm-commits
mailing list