[llvm-dev] Instruction selection problems due to SelectionDAGBuilder
Alex Susu via llvm-dev
llvm-dev at lists.llvm.org
Tue Aug 2 09:00:57 PDT 2016
Hello.
I'm having problems at instruction selection with my back end with the following
basic-block due to a vector add with immediate constant vector (obtained by vectorizing a
simple C program doing vector sum map):
vector.ph: ; preds = %vector.memcheck50
%.splatinsert = insertelement <8 x i64> undef, i64 %i.07.unr, i32 0
%.splat = shufflevector <8 x i64> %.splatinsert, <8 x i64> undef, <8 x i32>
zeroinitializer
%induction = add <8 x i64> %.splat, <i64 0, i64 1, i64 2, i64 3, i64 4, i64 5, i64
6, i64 7>
%.splatinsert56 = insertelement <8 x i64> undef, i64 %xtraiter, i32 0
%.splat57 = shufflevector <8 x i64> %.splatinsert56, <8 x i64> undef, <8 x i32>
zeroinitializer
%induction58 = add <8 x i64> %.splat57, <i64 0, i64 -1, i64 -2, i64 -3, i64 -4, i64
-5, i64 -6, i64 -7>
br label %vector.body25
The exact problem reported is:
Selecting: t51: v8i64,ch = load<LD64[ConstantPool]> t0, ConstantPool:i64<<8 x
i64> <i64 0, i64 1, i64 2, i64 3, i64 4, i64 5, i64 6, i64 7>> 0, undef:i64
ISEL: Starting pattern match on root node: t51: v8i64,ch =
load<LD64[ConstantPool]> t0, ConstantPool:i64<<8 x i64> <i64 0, i64 1, i64 2, i64 3, i64
4, i64 5, i64 6, i64 7>> 0, undef:i64
Initial Opcode index to 268
Match failed at index 277
Continuing at 396
Match failed at index 398
Continuing at 422
LLVM ERROR: Cannot select: t51: v8i64,ch = load<LD64[ConstantPool]> t0,
ConstantPool:i64<<8 x i64> <i64 0, i64 1, i64 2, i64 3, i64 4, i64 5, i64 6, i64 7>> 0,
undef:i64
t50: i64 = ConstantPool<<8 x i64> <i64 0, i64 1, i64 2, i64 3, i64 4, i64 5,
i64 6, i64 7>> 0
t48: i64 = undef
In function: foo
The reason is that for the basic-block my back end generates the following Selection DAG:
(From 201_LoopVectorize/25_GOOD_map/NEW/6/1/NEW/STDerr3_wo_getSetCCResultType)
Initial selection DAG: BB#15 'foo:vector.ph'
SelectionDAG has 41 nodes:
t0: ch = EntryToken
t4: i32 = Constant<0>
t3: i64,ch = CopyFromReg t0, Register:i64 %vreg12
t6: v8i64 = insert_vector_elt undef:v8i64, t3, Constant:i64<0>
t7: v8i64 = vector_shuffle<0,0,0,0,0,0,0,0> t6, undef:v8i64
t15: v8i64 = BUILD_VECTOR Constant:i64<0>, Constant:i64<1>,
Constant:i64<2>, Constant:i64<3>, Constant:i64<4>, Constant:i64<5>, Constant:i64<6>,
Constant:i64<7>
t16: v8i64 = add t7, t15
t18: ch = CopyToReg t0, Register:v8i64 %vreg16, t16
t20: i64,ch = CopyFromReg t0, Register:i64 %vreg5
t22: i64 = AssertSext t20, ValueType:ch:i8
t23: v8i64 = insert_vector_elt undef:v8i64, t22, Constant:i64<0>
t24: v8i64 = vector_shuffle<0,0,0,0,0,0,0,0> t23, undef:v8i64
t32: v8i64 = BUILD_VECTOR Constant:i64<0>, Constant:i64<-1>,
Constant:i64<-2>, Constant:i64<-3>, Constant:i64<-4>, Constant:i64<-5>, Constant:i64<-6>,
Constant:i64<-7>
t33: v8i64 = add t24, t32
t35: ch = CopyToReg t0, Register:v8i64 %vreg17, t33
t37: ch = CopyToReg t0, Register:i64 %vreg117, Constant:i64<0>
t39: ch = TokenFactor t18, t35, t37
t40: ch = br t39, BasicBlock:ch<vector.body25 0x1d07660>
However, when using the mips64 back end (subtarget) we get this correct selection DAG:
(From 201_LoopVectorize/25_GOOD_map/NEW/6/1/NEW/Mips64/STDerr_llc_mips64)
Initial selection DAG: BB#15 'foo:vector.ph'
SelectionDAG has 87 nodes:
t0: ch = EntryToken
t4: i32 = Constant<0>
t3: i64,ch = CopyFromReg t0, Register:i64 %vreg12
t6: v8i64 = insert_vector_elt undef:v8i64, t3, Constant:i64<0>
t7: v8i64 = vector_shuffle<0,0,0,0,0,0,0,0> t6, undef:v8i64
t15: v8i64 = BUILD_VECTOR Constant:i64<0>, Constant:i64<1>, Constant:i64<2>,
Constant:i64<3>, Constant:i64<4>, Constant:i64<5>, Constant:i64<6>, Constant:i64<7>
t16: v8i64 = add t7, t15
t43: i64,ch = CopyFromReg t0, Register:i64 %vreg5
t45: i64 = AssertSext t43, ValueType:ch:i8
t46: v8i64 = insert_vector_elt undef:v8i64, t45, Constant:i64<0>
t47: v8i64 = vector_shuffle<0,0,0,0,0,0,0,0> t46, undef:v8i64
t55: v8i64 = BUILD_VECTOR Constant:i64<0>, Constant:i64<-1>,
Constant:i64<-2>, Constant:i64<-3>, Constant:i64<-4>, Constant:i64<-5>, Constant:i64<-6>,
Constant:i64<-7>
t56: v8i64 = add t47, t55
t17: i64 = extract_vector_elt t16, Constant:i64<0>
t26: ch = CopyToReg t0, Register:i64 %vreg16, t17
t18: i64 = extract_vector_elt t16, Constant:i64<1>
t28: ch = CopyToReg t0, Register:i64 %vreg17, t18
t19: i64 = extract_vector_elt t16, Constant:i64<2>
t30: ch = CopyToReg t0, Register:i64 %vreg18, t19
t20: i64 = extract_vector_elt t16, Constant:i64<3>
t32: ch = CopyToReg t0, Register:i64 %vreg19, t20
t21: i64 = extract_vector_elt t16, Constant:i64<4>
t34: ch = CopyToReg t0, Register:i64 %vreg20, t21
t22: i64 = extract_vector_elt t16, Constant:i64<5>
t36: ch = CopyToReg t0, Register:i64 %vreg21, t22
t23: i64 = extract_vector_elt t16, Constant:i64<6>
t38: ch = CopyToReg t0, Register:i64 %vreg22, t23
t24: i64 = extract_vector_elt t16, Constant:i64<7>
t40: ch = CopyToReg t0, Register:i64 %vreg23, t24
t41: ch = TokenFactor t26, t28, t30, t32, t34, t36, t38, t40
t57: i64 = extract_vector_elt t56, Constant:i64<0>
t66: ch = CopyToReg t0, Register:i64 %vreg24, t57
t58: i64 = extract_vector_elt t56, Constant:i64<1>
t68: ch = CopyToReg t0, Register:i64 %vreg25, t58
t59: i64 = extract_vector_elt t56, Constant:i64<2>
t70: ch = CopyToReg t0, Register:i64 %vreg26, t59
t60: i64 = extract_vector_elt t56, Constant:i64<3>
t72: ch = CopyToReg t0, Register:i64 %vreg27, t60
t61: i64 = extract_vector_elt t56, Constant:i64<4>
t74: ch = CopyToReg t0, Register:i64 %vreg28, t61
t62: i64 = extract_vector_elt t56, Constant:i64<5>
t76: ch = CopyToReg t0, Register:i64 %vreg29, t62
t63: i64 = extract_vector_elt t56, Constant:i64<6>
t78: ch = CopyToReg t0, Register:i64 %vreg30, t63
t64: i64 = extract_vector_elt t56, Constant:i64<7>
t80: ch = CopyToReg t0, Register:i64 %vreg31, t64
t81: ch = TokenFactor t66, t68, t70, t72, t74, t76, t78, t80
t83: ch = CopyToReg t0, Register:i64 %vreg209, Constant:i64<0>
t85: ch = TokenFactor t41, t81, t83
t86: ch = br t85, BasicBlock:ch<vector.body25 0x1bd35f0>
I am curious what is wrong - I've tried to match the Mips' back end: I have put most
of the vector splat instructions and the vextract and INSERT_D_DESC instruction, etc .
I also don't get enough DEBUG information to understand where exactly the problem
comes from (probably I missed some TableGen record).
Please let me know if you have any idea.
Thank you very much,
Alex
More information about the llvm-dev
mailing list