[llvm-dev] Instruction selection problems due to SelectionDAGBuilder

Alex Susu via llvm-dev llvm-dev at lists.llvm.org
Tue Aug 2 09:00:57 PDT 2016


   Hello.
     I'm having problems at instruction selection with my back end with the following 
basic-block due to a vector add with immediate constant vector (obtained by vectorizing a 
simple C program doing vector sum map):
     vector.ph:                                        ; preds = %vector.memcheck50
       %.splatinsert = insertelement <8 x i64> undef, i64 %i.07.unr, i32 0
       %.splat = shufflevector <8 x i64> %.splatinsert, <8 x i64> undef, <8 x i32> 
zeroinitializer
       %induction = add <8 x i64> %.splat, <i64 0, i64 1, i64 2, i64 3, i64 4, i64 5, i64 
6, i64 7>
       %.splatinsert56 = insertelement <8 x i64> undef, i64 %xtraiter, i32 0
       %.splat57 = shufflevector <8 x i64> %.splatinsert56, <8 x i64> undef, <8 x i32> 
zeroinitializer
       %induction58 = add <8 x i64> %.splat57, <i64 0, i64 -1, i64 -2, i64 -3, i64 -4, i64 
-5, i64 -6, i64 -7>
       br label %vector.body25

     The exact problem reported is:
         Selecting: t51: v8i64,ch = load<LD64[ConstantPool]> t0, ConstantPool:i64<<8 x 
i64> <i64 0, i64 1, i64 2, i64 3, i64 4, i64 5, i64 6, i64 7>> 0, undef:i64
         ISEL: Starting pattern match on root node: t51: v8i64,ch = 
load<LD64[ConstantPool]> t0, ConstantPool:i64<<8 x i64> <i64 0, i64 1, i64 2, i64 3, i64 
4, i64 5, i64 6, i64 7>> 0, undef:i64
           Initial Opcode index to 268
           Match failed at index 277
           Continuing at 396
           Match failed at index 398
           Continuing at 422
         LLVM ERROR: Cannot select: t51: v8i64,ch = load<LD64[ConstantPool]> t0, 
ConstantPool:i64<<8 x i64> <i64 0, i64 1, i64 2, i64 3, i64 4, i64 5, i64 6, i64 7>> 0, 
undef:i64
           t50: i64 = ConstantPool<<8 x i64> <i64 0, i64 1, i64 2, i64 3, i64 4, i64 5, 
i64 6, i64 7>> 0
           t48: i64 = undef
         In function: foo

     The reason is that for the basic-block my back end generates the following Selection DAG:
         (From 201_LoopVectorize/25_GOOD_map/NEW/6/1/NEW/STDerr3_wo_getSetCCResultType)
         Initial selection DAG: BB#15 'foo:vector.ph'
         SelectionDAG has 41 nodes:
           t0: ch = EntryToken
           t4: i32 = Constant<0>
                       t3: i64,ch = CopyFromReg t0, Register:i64 %vreg12
                     t6: v8i64 = insert_vector_elt undef:v8i64, t3, Constant:i64<0>
                   t7: v8i64 = vector_shuffle<0,0,0,0,0,0,0,0> t6, undef:v8i64
                   t15: v8i64 = BUILD_VECTOR Constant:i64<0>, Constant:i64<1>, 
Constant:i64<2>, Constant:i64<3>, Constant:i64<4>, Constant:i64<5>, Constant:i64<6>, 
Constant:i64<7>
                 t16: v8i64 = add t7, t15
               t18: ch = CopyToReg t0, Register:v8i64 %vreg16, t16
                         t20: i64,ch = CopyFromReg t0, Register:i64 %vreg5
                       t22: i64 = AssertSext t20, ValueType:ch:i8
                     t23: v8i64 = insert_vector_elt undef:v8i64, t22, Constant:i64<0>
                   t24: v8i64 = vector_shuffle<0,0,0,0,0,0,0,0> t23, undef:v8i64
                   t32: v8i64 = BUILD_VECTOR Constant:i64<0>, Constant:i64<-1>, 
Constant:i64<-2>, Constant:i64<-3>, Constant:i64<-4>, Constant:i64<-5>, Constant:i64<-6>, 
Constant:i64<-7>
                 t33: v8i64 = add t24, t32
               t35: ch = CopyToReg t0, Register:v8i64 %vreg17, t33
               t37: ch = CopyToReg t0, Register:i64 %vreg117, Constant:i64<0>
             t39: ch = TokenFactor t18, t35, t37
           t40: ch = br t39, BasicBlock:ch<vector.body25 0x1d07660>

     However, when using the mips64 back end (subtarget) we get this correct selection DAG:
         (From 201_LoopVectorize/25_GOOD_map/NEW/6/1/NEW/Mips64/STDerr_llc_mips64)
         Initial selection DAG: BB#15 'foo:vector.ph'
         SelectionDAG has 87 nodes:
           t0: ch = EntryToken
           t4: i32 = Constant<0>
                 t3: i64,ch = CopyFromReg t0, Register:i64 %vreg12
               t6: v8i64 = insert_vector_elt undef:v8i64, t3, Constant:i64<0>
             t7: v8i64 = vector_shuffle<0,0,0,0,0,0,0,0> t6, undef:v8i64
             t15: v8i64 = BUILD_VECTOR Constant:i64<0>, Constant:i64<1>, Constant:i64<2>, 
Constant:i64<3>, Constant:i64<4>, Constant:i64<5>, Constant:i64<6>, Constant:i64<7>
           t16: v8i64 = add t7, t15
                   t43: i64,ch = CopyFromReg t0, Register:i64 %vreg5
                 t45: i64 = AssertSext t43, ValueType:ch:i8
               t46: v8i64 = insert_vector_elt undef:v8i64, t45, Constant:i64<0>
             t47: v8i64 = vector_shuffle<0,0,0,0,0,0,0,0> t46, undef:v8i64
             t55: v8i64 = BUILD_VECTOR Constant:i64<0>, Constant:i64<-1>, 
Constant:i64<-2>, Constant:i64<-3>, Constant:i64<-4>, Constant:i64<-5>, Constant:i64<-6>, 
Constant:i64<-7>
           t56: v8i64 = add t47, t55
                   t17: i64 = extract_vector_elt t16, Constant:i64<0>
                 t26: ch = CopyToReg t0, Register:i64 %vreg16, t17
                   t18: i64 = extract_vector_elt t16, Constant:i64<1>
                 t28: ch = CopyToReg t0, Register:i64 %vreg17, t18
                   t19: i64 = extract_vector_elt t16, Constant:i64<2>
                 t30: ch = CopyToReg t0, Register:i64 %vreg18, t19
                   t20: i64 = extract_vector_elt t16, Constant:i64<3>
                 t32: ch = CopyToReg t0, Register:i64 %vreg19, t20
                   t21: i64 = extract_vector_elt t16, Constant:i64<4>
                 t34: ch = CopyToReg t0, Register:i64 %vreg20, t21
                   t22: i64 = extract_vector_elt t16, Constant:i64<5>
                 t36: ch = CopyToReg t0, Register:i64 %vreg21, t22
                   t23: i64 = extract_vector_elt t16, Constant:i64<6>
                 t38: ch = CopyToReg t0, Register:i64 %vreg22, t23
                   t24: i64 = extract_vector_elt t16, Constant:i64<7>
                 t40: ch = CopyToReg t0, Register:i64 %vreg23, t24
               t41: ch = TokenFactor t26, t28, t30, t32, t34, t36, t38, t40
                   t57: i64 = extract_vector_elt t56, Constant:i64<0>
                 t66: ch = CopyToReg t0, Register:i64 %vreg24, t57
                   t58: i64 = extract_vector_elt t56, Constant:i64<1>
                 t68: ch = CopyToReg t0, Register:i64 %vreg25, t58
                   t59: i64 = extract_vector_elt t56, Constant:i64<2>
                 t70: ch = CopyToReg t0, Register:i64 %vreg26, t59
                   t60: i64 = extract_vector_elt t56, Constant:i64<3>
                 t72: ch = CopyToReg t0, Register:i64 %vreg27, t60
                   t61: i64 = extract_vector_elt t56, Constant:i64<4>
                 t74: ch = CopyToReg t0, Register:i64 %vreg28, t61
                   t62: i64 = extract_vector_elt t56, Constant:i64<5>
                 t76: ch = CopyToReg t0, Register:i64 %vreg29, t62
                   t63: i64 = extract_vector_elt t56, Constant:i64<6>
                 t78: ch = CopyToReg t0, Register:i64 %vreg30, t63
                   t64: i64 = extract_vector_elt t56, Constant:i64<7>
                 t80: ch = CopyToReg t0, Register:i64 %vreg31, t64
               t81: ch = TokenFactor t66, t68, t70, t72, t74, t76, t78, t80
               t83: ch = CopyToReg t0, Register:i64 %vreg209, Constant:i64<0>
             t85: ch = TokenFactor t41, t81, t83
           t86: ch = br t85, BasicBlock:ch<vector.body25 0x1bd35f0>

     I am curious what is wrong - I've tried to match the Mips' back end: I have put most 
of the vector splat instructions and the vextract and INSERT_D_DESC instruction, etc .
     I also don't get enough DEBUG information to understand where exactly the problem 
comes from (probably I missed some TableGen record).

     Please let me know if you have any idea.

   Thank you very much,
     Alex


More information about the llvm-dev mailing list