[PATCH] D116166: [AArch64] Add a tablegen pattern for RADDHN/RADDHN2
Dave Green via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Sun Dec 26 03:24:57 PST 2021
dmgreen added a comment.
Hmm. Are you sure this one is a great idea? The "Setting up the zero register adds no latency" won't be true on any in-order cpu, and still has some frontend cost on an out of order cpu. The codesize will be larger in any case, so this probably shouldn't be done at -Os/-Oz.
The idea with these kind of transforms is that they are OK to do so long as they make some cpu better without making anything else worse. This is intrinsic only, but it may be best to only do it for specific cpus when not under minsize. Or do it at a different point where we know the movi can be pulled out of a loop. (If we really want to do it at all and not just leave it to the programmer if they need it).
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D116166/new/
https://reviews.llvm.org/D116166
More information about the llvm-commits
mailing list