[PATCH] D46954: [X86][SSE] Support v4i32 rotations (PR37426)

Simon Pilgrim via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed May 16 08:36:50 PDT 2018


RKSimon added inline comments.


================
Comment at: test/CodeGen/X86/vector-rotate-128.ll:1599
+; AVX1-NEXT:    vpmuludq %xmm2, %xmm3, %xmm2
+; AVX1-NEXT:    vpmuludq %xmm1, %xmm0, %xmm0
+; AVX1-NEXT:    vpshufd {{.*#+}} xmm1 = xmm0[1,1,3,3]
----------------
It might be better to take the cost of 2 loads to remove the 2xPSHUFD and fold directly into the PMULUDQs? There is a small increase in codesize.


Repository:
  rL LLVM

https://reviews.llvm.org/D46954





More information about the llvm-commits mailing list