For the following code fragment, <br><br>; <label>:27 ; preds = %27, %entry<br> %28 = load volatile i32* inttoptr (i64 2149581832 to i32*), align 8<br> %29 = icmp slt i32 %28, 0<br>
br i1 %29, label %27, label %loop.exit<br><br>loop.exit: ; preds = %27<br><br>llc will generate following MIPS code,<br><br>$BB0_1: <br> lui $3, 32800<br> ori $3, $3, 1032<br>
lw $3, 0($3)<br> bltz $3, $BB0_1<br> nop<br># BB#2:<br><br><br>The two operation lui and ori which are used to calculate memory address actually are loop invariants. They supposed to be moved out of the loop. I thought it might be a limitation of the MIPS backend. Then I tried the ARM backend,<br>
<br> .LBB1_1: <br> ldr r2, .LCPI1_2<br> ldr r2, [r2]<br> cmp r2, #0<br> blt .LBB1_1<br>@ BB#2: <br><br>The first ldr instruction is to load the address from constant pool. It also should be outside the loop.<br>
<br>I'm not sure if this is because of the optimisations are not enough in the common SelectionDAG optimisation phase, or should this kind of optimisation be implemented by the SelectionDAG instruction lowering phase for each target? <br>