[llvm] r274843 - Code size optimisation: don't expand a div to a mul and and a shift sequence.
Sjoerd Meijer via llvm-commits
llvm-commits at lists.llvm.org
Fri Jul 8 05:54:44 PDT 2016
Author: sjoerdmeijer
Date: Fri Jul 8 07:54:43 2016
New Revision: 274843
URL: http://llvm.org/viewvc/llvm-project?rev=274843&view=rev
Log:
Code size optimisation: don't expand a div to a mul and and a shift sequence.
As a result, the urem instruction will not be expanded to a sequence of umull,
lsrs, muls and sub instructions, but just a call to __aeabi_uidivmod.
Differential Revision: http://reviews.llvm.org/D22131
Added:
llvm/trunk/test/CodeGen/ARM/urem-opt-size.ll
Modified:
llvm/trunk/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
Modified: llvm/trunk/lib/CodeGen/SelectionDAG/DAGCombiner.cpp
URL: http://llvm.org/viewvc/llvm-project/llvm/trunk/lib/CodeGen/SelectionDAG/DAGCombiner.cpp?rev=274843&r1=274842&r2=274843&view=diff
==============================================================================
--- llvm/trunk/lib/CodeGen/SelectionDAG/DAGCombiner.cpp (original)
+++ llvm/trunk/lib/CodeGen/SelectionDAG/DAGCombiner.cpp Fri Jul 8 07:54:43 2016
@@ -14516,6 +14516,11 @@ SDValue DAGCombiner::BuildSDIVPow2(SDNod
/// number.
/// Ref: "Hacker's Delight" or "The PowerPC Compiler Writer's Guide".
SDValue DAGCombiner::BuildUDIV(SDNode *N) {
+ // when optimising for size, we don't want to expand a div to a mul and
+ // and a shift.
+ if (ForCodeSize)
+ return SDValue();
+
ConstantSDNode *C = isConstOrConstSplat(N->getOperand(1));
if (!C)
return SDValue();
Added: llvm/trunk/test/CodeGen/ARM/urem-opt-size.ll
URL: http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/ARM/urem-opt-size.ll?rev=274843&view=auto
==============================================================================
--- llvm/trunk/test/CodeGen/ARM/urem-opt-size.ll (added)
+++ llvm/trunk/test/CodeGen/ARM/urem-opt-size.ll Fri Jul 8 07:54:43 2016
@@ -0,0 +1,25 @@
+; When optimising for size, we don't want to expand a div to a mul and
+; and a shift sequence. As a result, the urem instruction will not be
+; expanded to a sequence of umull, lsrs, muls and sub instructions, but
+; just a call to __aeabi_uidivmod.
+;
+; RUN: llc -mtriple=armv7a-eabi -mattr=-neon -verify-machineinstrs %s -o - | FileCheck %s
+
+target datalayout = "e-m:e-p:32:32-i64:64-v128:64:128-a:0:32-n32-S64"
+target triple = "thumbv7m-arm-none-eabi"
+
+define i32 @foo() local_unnamed_addr #0 {
+entry:
+; CHECK-LABEL: foo:
+; CHECK: __aeabi_uidivmod
+; CHECK-NOT: umull
+ %call = tail call i32 bitcast (i32 (...)* @GetValue to i32 ()*)()
+ %rem = urem i32 %call, 1000000
+ %cmp = icmp eq i32 %rem, 0
+ %conv = zext i1 %cmp to i32
+ ret i32 %conv
+}
+
+declare i32 @GetValue(...) local_unnamed_addr
+
+attributes #0 = { minsize nounwind optsize }
More information about the llvm-commits
mailing list