[PATCH] [X86][SSE] Missing SSE/AVX1 memory folding integer instructions

Quentin Colombet qcolombet at apple.com
Wed Jan 21 13:46:19 PST 2015


REPOSITORY
  rL LLVM

================
Comment at: test/CodeGen/X86/psubus.ll:29
@@ -28,3 +28,3 @@
 ; SSE2: @test1
-; SSE2: psubusw LCPI0_0(%rip), %xmm0
+; SSE2: psubusw %xmm0, %xmm1
 
----------------
RKSimon wrote:
> qcolombet wrote:
> > Why is this not folded anymore?
> Oddly the addition of the folding patterns has allowed the load of the constant (to %xmm0) to be pulled out of the loop.
Thanks for checking.
Could we get rid of the loop to have a test on the folding?

http://reviews.llvm.org/D7094

EMAIL PREFERENCES
  http://reviews.llvm.org/settings/panel/emailpreferences/






More information about the llvm-commits mailing list