[PATCH] Code Generator for Gather and Scatter Intrinsics.

Ahmed Bougacha ahmed.bougacha at gmail.com
Thu Apr 30 07:45:06 PDT 2015


================
Comment at: test/CodeGen/X86/masked_memop.ll:10-11
@@ -9,4 +9,4 @@
 ; AVX2-LABEL: test1
-; AVX2: vpmaskmovd      32(%rdi)
-; AVX2: vpmaskmovd      (%rdi)
+; AVX2: vpmaskmovd      {{.*}}(%rdi)
+; AVX2: vpmaskmovd      {{.*}}(%rdi)
 ; AVX2-NOT: blend
----------------
rengolin wrote:
> ab wrote:
> > This is surprising, do we know why this happens?
> I'm guessing the order has changed. Using AVX2-DAG on both and keeping the 32 as before should work, too, and be more explicit.
> 
>     ; AVX2-DAG: vpmaskmovd      32(%rdi)
>     ; AVX2-DAG: vpmaskmovd      (%rdi)
> 
> Then, the 32 can come in any order.
Right, -DAG would work.  What I'm saying is: the order change is a symptom - innocuous, yes -  of something else - probably innocuous as well.  We're just legalizing entirely new nodes and intrinsics.  Why would any of the existing codegen ever change?

Elena, if you could spare some time to investigate this, it'd be much appreciated!

http://reviews.llvm.org/D7665

EMAIL PREFERENCES
  http://reviews.llvm.org/settings/panel/emailpreferences/






More information about the llvm-commits mailing list