[llvm-commits] [llvm] r151877 - in /llvm/trunk: lib/Target/X86/X86ISelLowering.cpp test/CodeGen/X86/sibcall-5.ll
Evan Cheng
evan.cheng at apple.com
Mon Mar 5 10:50:34 PST 2012
On Mar 5, 2012, at 10:30 AM, Chad Rosier <mcrosier at apple.com> wrote:
>
> On Mar 1, 2012, at 8:13 PM, Evan Cheng wrote:
>
>>
>> On Mar 1, 2012, at 6:50 PM, Chad Rosier wrote:
>>
>>>
>>>
>>> + // If anything is glued to the copy, then we can't safely perform a tail call.
>>> + if (Copy->getOpcode() == ISD::CopyToReg &&
>>> + Copy->getNumOperands() == 4)
>>> + return false;
>>
>> Hi Chad,
>>
>> This is not the right way to check whether anything is glued to the copy. It's checking whether the copytoreg has a 4th argument and the use magic number is fragile. You want to check if the copytoreg has second value and whether that value has any use.
>
> If I understand the comments in SelectionDAG.h correctly, the 4th operand is the glue value. Thus, this seems to be the correct check. I do agree, however, that using a magic number is error prone. Is there a more generic (i.e., accessor function) for checking for glue?
Are you checking 1) something is glued to the copy? or 2) copy has a glue operand?
If it's #1, then you want to do something like Copy->hasAnyUseOfValue(1). If it's #2, then do Copy->getOperand(Copy->getNumOperands()-1).getValueType() == MVT::Glue (since glue has to be the last operand).
Evan
>
> Here's the exact comment and relevant code:
> ----------
> // This version of the getCopyToReg method takes an extra operand, which
> // indicates that there is potentially an incoming glue value (if Glue is not
> // null) and that there should be a glue result.
> SDValue getCopyToReg(SDValue Chain, DebugLoc dl, unsigned Reg, SDValue N,
> SDValue Glue) {
> SDVTList VTs = getVTList(MVT::Other, MVT::Glue);
> SDValue Ops[] = { Chain, getRegister(Reg, N.getValueType()), N, Glue };
> return getNode(ISD::CopyToReg, dl, VTs, Ops, Glue.getNode() ? 4 : 3);
> }
> ----------
>
>>
>> Also since the code fragment before is:
>>
>> if (Copy->getOpcode() != ISD::CopyToReg &&
>> Copy->getOpcode() != ISD::FP_EXTEND)
>> return false;
>>
>> You want to write something like
>>
>> if (Copy->getOpcode() != ISD::CopyToReg) {
>
> Don't you mean:
>
> if (Copy->getOpcode() == ISD::CopyToReg) {
> // If anything is glued to the copy...
> if (Glue)
> return false;
> }
>
> Specifically, this should be an equals comparison, not a not equal comparison.
Right.
Evan
>
> Chad
>
>> // If anything is glued to the copy ...
>> } else if (Copy->getOpcode() != ISD::FP_EXTEND)
>> return false;
>>
>> Evan
>>
>>> +
>>> bool HasRet = false;
>>> for (SDNode::use_iterator UI = Copy->use_begin(), UE = Copy->use_end();
>>> UI != UE; ++UI) {
>>>
>>> Modified: llvm/trunk/test/CodeGen/X86/sibcall-5.ll
>>> URL: http://llvm.org/viewvc/llvm-project/llvm/trunk/test/CodeGen/X86/sibcall-5.ll?rev=151877&r1=151876&r2=151877&view=diff
>>> ==============================================================================
>>> --- llvm/trunk/test/CodeGen/X86/sibcall-5.ll (original)
>>> +++ llvm/trunk/test/CodeGen/X86/sibcall-5.ll Thu Mar 1 20:50:46 2012
>>> @@ -1,5 +1,6 @@
>>> ; RUN: llc < %s -mtriple=i386-apple-darwin8 -mattr=+sse2 | FileCheck %s --check-prefix=X32
>>> ; RUN: llc < %s -mtriple=x86_64-apple-darwin -mattr=+sse2 | FileCheck %s --check-prefix=X64
>>> +; RUN: llc < %s -mtriple=x86_64-apple-darwin -mattr=-sse3 | FileCheck %s --check-prefix=X64_BAD
>>>
>>> ; Sibcall optimization of expanded libcalls.
>>> ; rdar://8707777
>>> @@ -29,3 +30,31 @@
>>> declare float @sinf(float) nounwind readonly
>>>
>>> declare double @sin(double) nounwind readonly
>>> +
>>> +; rdar://10930395
>>> +%0 = type opaque
>>> +
>>> +@"\01L_OBJC_SELECTOR_REFERENCES_2" = external hidden global i8*, section "__DATA, __objc_selrefs, literal_pointers, no_dead_strip"
>>> +
>>> +define hidden { double, double } @foo2(%0* %self, i8* nocapture %_cmd) uwtable optsize ssp {
>>> +; X64_BAD: foo
>>> +; X64_BAD: call
>>> +; X64_BAD: call
>>> +; X64_BAD: call
>>> + %1 = load i8** @"\01L_OBJC_SELECTOR_REFERENCES_2", align 8, !invariant.load !0
>>> + %2 = bitcast %0* %self to i8*
>>> + %3 = tail call { double, double } bitcast (i8* (i8*, i8*, ...)* @objc_msgSend to { double, double } (i8*, i8*)*)(i8* %2, i8* %1) optsize
>>> + %4 = extractvalue { double, double } %3, 0
>>> + %5 = extractvalue { double, double } %3, 1
>>> + %6 = tail call double @floor(double %4) optsize
>>> + %7 = tail call double @floor(double %5) optsize
>>> + %insert.i.i = insertvalue { double, double } undef, double %6, 0
>>> + %insert5.i.i = insertvalue { double, double } %insert.i.i, double %7, 1
>>> + ret { double, double } %insert5.i.i
>>> +}
>>> +
>>> +declare i8* @objc_msgSend(i8*, i8*, ...)
>>> +
>>> +declare double @floor(double) optsize
>>> +
>>> +!0 = metadata !{}
>>>
>>>
>>> _______________________________________________
>>> llvm-commits mailing list
>>> llvm-commits at cs.uiuc.edu
>>> http://lists.cs.uiuc.edu/mailman/listinfo/llvm-commits
>>
>
More information about the llvm-commits
mailing list