[PATCH] D33938: [x86] use vperm2f128 rather than vinsertf128 when there's a chance to fold a 32-byte load
Sanjay Patel via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Tue Jun 6 07:24:42 PDT 2017
spatel added inline comments.
================
Comment at: test/CodeGen/X86/avx-vperm2x128.ll:55
+; ALL: ## BB#0: ## %entry
+; ALL-NEXT: vperm2f128 {{.*#+}} ymm0 = mem[0,1,0,1]
+; ALL-NEXT: retq
----------------
RKSimon wrote:
> I wonder what's preventing this from using VBROADCASTF128 ?
I think it's just that we don't have the code to do the load shrinking + address offset. Ie, this is a 32-byte load even though we're only using half of it.
https://reviews.llvm.org/D33938
More information about the llvm-commits
mailing list