[PATCH] D116166: [AArch64] Add a tablegen pattern for RADDHN/RADDHN2

Dave Green via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Sun Dec 26 03:24:57 PST 2021


dmgreen added a comment.

Hmm. Are you sure this one is a great idea? The "Setting up the zero register adds no latency" won't be true on any in-order cpu, and still has some frontend cost on an out of order cpu. The codesize will be larger in any case, so this probably shouldn't be done at -Os/-Oz.

The idea with these kind of transforms is that they are OK to do so long as they make some cpu better without making anything else worse. This is intrinsic only, but it may be best to only do it for specific cpus when not under minsize. Or do it at a different point where we know the movi can be pulled out of a loop. (If we really want to do it at all and not just leave it to the programmer if they need it).


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D116166/new/

https://reviews.llvm.org/D116166



More information about the llvm-commits mailing list