[llvm] r250032 - [LoopVectorize] Shrink integer operations into the smallest type possible

Michael Zolotukhin via llvm-commits llvm-commits at lists.llvm.org
Mon Nov 9 09:09:59 PST 2015


> On Nov 9, 2015, at 6:34 AM, James Molloy <james.molloy at arm.com> wrote:
> 
> Hi Michael,
> 
> I’ve addressed as many of these as possible in r252469.
Thanks, James!

Michael
> 
> Cheers,
> 
> James
> 
>> On 6 Nov 2015, at 17:38, Michael Zolotukhin <mzolotukhin at apple.com <mailto:mzolotukhin at apple.com>> wrote:
>> 
>> Hi James,
>> 
>>> On Nov 6, 2015, at 7:16 AM, James Molloy <james.molloy at arm.com <mailto:james.molloy at arm.com>> wrote:
>>> 
>>> Hi Michael,
>>> 
>>> I’ll get around to implementing your comments on Monday, if that’s OK?
>> Sure!
>>> 
>>>> Should we have the same issue in the SLP vectorizer as well?
>>> 
>>> Yes; that’s why I made this a utility and decoupled it from the loop vectoriser.
>> Cool, thanks!
>>> 
>>>> Would it be better to return the map by reference to avoid unnecessary copying
>>> 
>>> Silviu mentioned this in the review. I believe this should be move-constructed as we’re returning a temporary with a move operator, so zero cost.
>> Got it.
>>> 
>>>> Why can’t we just use Instruction* instead of Value* in all these sets? It looks like it could save some casts in the code
>>> 
>>> SGTM, I’ll do this.
>>> 
>>>> Unneeded whitespace. There are also some other spots formatted badly - maybe reformat it while the code is still relatively fresh?
>>> 
>>> Will do.
>>> 
>>>> We look for I->getOperand(0) in MinBWs, and then use MinBWs[I] (not operand 0) - is it intentional? If not, we can also avoid double querying MinBWs by using MinBWs.find.
>>> 
>>> Sounds reasonable to me. Will do.
>>> 
>>>> Do we really need to run all these passes to check loop-vectorizer?
>>> 
>>> We do need instcombine as we rely on it to sort out our mess. I’ll take a look at paring down the pass list and see what we actually need. I agree it does look a bit overkill at the moment…
>> 
>> Thanks for doing this!
>> 
>> Michael
>>> 
>>> Cheers,
>>> 
>>> James
>>> 
>>>> On 6 Nov 2015, at 08:13, Michael Zolotukhin <mzolotukhin at apple.com <mailto:mzolotukhin at apple.com>> wrote:
>>>> 
>>>> Hi James,
>>>> 
>>>> Thanks for working on this! Should we have the same issue in the SLP vectorizer as well?
>>>> 
>>>> Also, some remarks inline:
>>>> 
>>>>> @@ -434,3 +437,130 @@ llvm::Value *llvm::getSplatValue(Value *
>>>>> 
>>>>> return InsertEltInst->getOperand(1);
>>>>> }
>>>>> +
>>>>> +DenseMap<Instruction*, uint64_t> llvm::computeMinimumValueSizes(
>>>>> +  ArrayRef<BasicBlock*> Blocks, DemandedBits &DB,
>>>>> +  const TargetTransformInfo *TTI) {
>>>> Would it be better to return the map by reference to avoid unnecessary copying?
>>>> 
>>>>> +
>>>>> +  // DemandedBits will give us every value's live-out bits. But we want
>>>>> +  // to ensure no extra casts would need to be inserted, so every DAG
>>>>> +  // of connected values must have the same minimum bitwidth.
>>>>> +  EquivalenceClasses<Value*> ECs;
>>>>> +  SmallVector<Value*,16> Worklist;
>>>>> +  SmallPtrSet<Value*,4> Roots;
>>>>> +  SmallPtrSet<Value*,16> Visited;
>>>>> +  DenseMap<Value*,uint64_t> DBits;
>>>> Why can’t we just use Instruction* instead of Value* in all these sets? It looks like it could save some casts in the code.
>>>> 
>>>>> @@ -2009,6 +2034,7 @@ InnerLoopVectorizer::getVectorValue(Valu
>>>>> // If this scalar is unknown, assume that it is a constant or that it is
>>>>> // loop invariant. Broadcast V and save the value for future uses.
>>>>> Value *B = getBroadcastInstrs(V);
>>>>> +
>>>> Unneeded whitespace. There are also some other spots formatted badly - maybe reformat it while the code is still relatively fresh?
>>>> 
>>>>> @@ -5168,6 +5314,8 @@ LoopVectorizationCostModel::getInstructi
>>>>> case Instruction::ICmp:
>>>>> case Instruction::FCmp: {
>>>>>  Type *ValTy = I->getOperand(0)->getType();
>>>>> +    if (VF > 1 && MinBWs.count(dyn_cast<Instruction>(I->getOperand(0))))
>>>>> +      ValTy = IntegerType::get(ValTy->getContext(), MinBWs[I]);
>>>> We look for I->getOperand(0) in MinBWs, and then use MinBWs[I] (not operand 0) - is it intentional? If not, we can also avoid double querying MinBWs by using MinBWs.find.
>>>> 
>>>>> --- llvm/trunk/test/Transforms/LoopVectorize/AArch64/loop-vectorization-factors.ll (added)
>>>>> +++ llvm/trunk/test/Transforms/LoopVectorize/AArch64/loop-vectorization-factors.ll Mon Oct 12 07:34:45 2015
>>>>> @@ -0,0 +1,243 @@
>>>>> +; RUN: opt -S < %s -basicaa -loop-vectorize -simplifycfg -instsimplify -instcombine -licm -force-vector-interleave=1 2>&1 | FileCheck %s
>>>> Do we really need to run all these passes to check loop-vectorizer?
>>>> 
>>>> 
>>>> Thanks,
>>>> Michael
>>>> 
>>> 
>>> 
>>> ________________________________
>>> 
>>> -- IMPORTANT NOTICE: The contents of this email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Thank you.
> 
> 
> 
> -- IMPORTANT NOTICE: The contents of this email and any attachments are confidential and may also be privileged. If you are not the intended recipient, please notify the sender immediately and do not disclose the contents to any other person, use it for any purpose, or store or copy the information in any medium. Thank you.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20151109/287ba106/attachment.html>


More information about the llvm-commits mailing list