[PATCH] D69044: [X86] Allow up to 4 loads per inline memcmp()

David Zarzycki via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Sat Oct 19 08:23:01 PDT 2019


davezarzycki added a comment.

The staging sounds fine. As an aside, this patch is exposing a bug in EVEX address/displacement generation:

  length256_eq:
      vmovdqu64 -128(%rdi), %zmm0
      vmovdqu64 -64(%rdi), %zmm1
      vmovdqu64 (%rdi), %zmm2
      vmovdqu64 64(%rdi), %zmm3

The above code should be:

  length256_eq:
      vmovdqu64 (%rdi), %zmm0
      vmovdqu64 64(%rdi), %zmm1
      vmovdqu64 128(%rdi), %zmm2
      vmovdqu64 192(%rdi), %zmm3

Any tips on how to debug this?


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D69044/new/

https://reviews.llvm.org/D69044





More information about the llvm-commits mailing list