[PATCH] D144086: [AArch64] Load into zero vector patterns
Dave Green via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Mar 8 04:35:43 PST 2023
dmgreen added a comment.
Hi - thanks for the report. It sounds like the offset might be wrong from the look at the assembly. This instructions specifically:
10534: 40 f4 7f 3d ldr b0, [x2, #4093]
vs
10534: 46 0c 00 d1 sub x6, x2, #3
1053c: c0 00 40 0d ld1 { v0.b }[0], [x6]
I think I see the problem - It looks like it should be using an LDUR for those instructions. When printing assembly it will produce an `ldr b0, [x2, -3]` instruction, but emitting obj files gives the large positive offset. I will put a fix in for that issue now.
I've been unable to produce the same output from https://gist.github.com/bjacob/2ed1bce14ae4d67b4261adee70089e29 though - I probably don't know the right set of commands and was just using mlir-translate to convert the file to llvm-ir. Do you know what commands are needed to compile it to assembly?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D144086/new/
https://reviews.llvm.org/D144086
More information about the llvm-commits
mailing list