[PATCH] D69044: [X86] Allow up to 4 loads per inline memcmp()
David Zarzycki via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Sat Oct 19 08:23:01 PDT 2019
davezarzycki added a comment.
The staging sounds fine. As an aside, this patch is exposing a bug in EVEX address/displacement generation:
length256_eq:
vmovdqu64 -128(%rdi), %zmm0
vmovdqu64 -64(%rdi), %zmm1
vmovdqu64 (%rdi), %zmm2
vmovdqu64 64(%rdi), %zmm3
The above code should be:
length256_eq:
vmovdqu64 (%rdi), %zmm0
vmovdqu64 64(%rdi), %zmm1
vmovdqu64 128(%rdi), %zmm2
vmovdqu64 192(%rdi), %zmm3
Any tips on how to debug this?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D69044/new/
https://reviews.llvm.org/D69044
More information about the llvm-commits
mailing list