[llvm-commits] [llvm] r151429 - in /llvm/trunk: include/llvm/Transforms/IPO.h include/llvm/Transforms/IPO/InlinerPass.h include/llvm/Transforms/Utils/Cloning.h lib/Transforms/IPO/InlineAlways.cpp lib/Transforms/IPO/InlineSimple.cpp lib/Transforms/IPO/Inliner.cpp lib/Transforms/Utils/InlineFunction.cpp
Duncan Sands
baldrick at free.fr
Mon Feb 27 00:08:25 PST 2012
Hi Chad,
>>>>> Add support for disabling llvm.lifetime intrinsics in the AlwaysInliner. These
>>>>> are optimization hints, but at -O0 we're not optimizing. This becomes a problem
>>>>> when the alwaysinline attribute is abused.
>>>>> rdar://10921594
>>>>
>>>> can you please explain some more for those of us who can't access rdar. Why is
>>>> it a problem, and what alwaysinline abuse do you have in mind?
>>>
>>> At -O0 the AlwaysInliner pass is run to honor the always_inline attribute. By default the @llvm.lifetime_start and @llvm.lifetime_end intrinsics were being emitted. These are compiler hints and at -O0 they're never used and thus should not be emitted. The particular test case I came across was compiling in 283s at -O0 (fast, huh). Now it compiles in 1.35s at -O0.
>>
>> thanks for the explanation. Why was it taking so long? Just the insertion of
>> these intrinsics or something else? It seems strange to me that compilation
>> should be slowed down so much - maybe it is a sign that something else is wrong,
>> some kind of inefficient handling of these intrinsics?
>
> Basically, the target-independent and target-dependent fast-isel implementations were unable to handle these intrinsics and were falling back to the selection DAG selector. Eric fixed the target-independent selector by ignoring these intrinsics (see: http://llvm.org/viewvc/llvm-project?view=rev&revision=150848). However, Eric and I both agree that this was an incomplete fix, which is what r151429/151430 was all about.
>
> The reason this was such a problem is because of how fast-isel handles calls. In most instanced when fast-isel fails to select an instruction it bails and falls back to the selection DAG selector for the remainder of a basic block. Call are treated differently, however. Failures by fast-isel to select a call fall back to the selection DAG for the call only and then return to fast-isel to finish selecting the remainder of the block. This is due to historical reasons, which Dan Gohman could probably best explain. This context switching between fast-isel and selection DAG isel is most likely the culprit. You also have to keep in mind that this was a very extreme case. Of the 250K lines of IR 50K were llvm.lifetime intrinsics (at -O0!!!). IMHO I don't think there's any fundamental issue with how fast-isel is working.
thanks for the explanation. Can't the SelectionDAGBuilder just lower these
intrinsics to nothing/undef at -O0?
Ciao, Duncan.
More information about the llvm-commits
mailing list