[cfe-dev] SymbolRef and SVal confusion

Jordan Rose jordan_rose at apple.com
Fri Jan 4 18:30:53 PST 2013


VisitCast handles the "decay" in the AST from a raw function name to a function pointer; all C function calls are actually calls to function pointers according to the standard. But the actual code that figures out the function to call is in CallEventManager::getSimpleCall, which...huh, doesn't actually look at the callee's SVal if it's known at compile time. Which means only calls through weak function pointers would lose out. I would actually be okay with this since these are (a) rare, and (b) probably not calls we do much special processing for anyway.

If you want to try hacking this in, I'd suggest using a conjured symbol with no Expr and no block count (so it's the same all across the program) and the appropriate pointer-to-function type:

QualType Ty = Ctx.getPointerType(FD->getType());
SVB.conjureSymbol(/*Stmt=*/0, /*LCtx=*/0, Ty, /*VisitCount=*/0, /*Tag=*/FD);

And then come up with a bunch of test cases and make sure that if you, say, define "malloc" as weak that we still treat it like "malloc". If everything works, send it back and I'll commit it to SVN.

Thanks for working on this!
Jordan


On Jan 3, 2013, at 13:15 , Richard <tarka.t.otter at googlemail.com> wrote:

> Hey Jordan,
> 
> I realise SymbolExtent is the wrong symbol class to use, it was just a quick hack to see how much more work was involved in getting the analyser to assume false on function decls. Not very much it turned out. I guess a new SymExpr subclass is needed. 
> 
> The bit I am not clear on is where the analyser calls a function, where I would need to add code to handle this new symbol type. Apologies if this is a stupid question, I had a dig through ExprEngine, but did not find what I was looking for. Is it VisitCast?
> 
> Ta.
> 
> On 3 Jan 2013, at 20:22, Jordan Rose <jordan_rose at apple.com> wrote:
> 
>> SymbolExtent isn't really meant for this; it's supposed to represent the metadata of how large an allocation is in memory. Doing this is basically like changing "return func" to "return sizeof(*func)", except that functions don't really have valid sizes anyway. You really can't put an extent symbol (type size_t) into a loc::MemRegionVal (some kind of pointer-ish thing).
>> 
>> In practice, this lets you do the null test, but won't actually let the analyzer call the function, which is no good.
>> 
>> I don't have any other immediate insights to offer. We just don't have values that can represent either null or a specific function at this time. You might be able to fake it for now by adding a pre-visit check for CastExprs of type CK_FunctionToPointerDecay, and eagerly splitting the path whenever someone references a weak function.
>> 
>> Jordan
>> 
>> 
>> On Jan 3, 2013, at 10:03 , Richard <tarka.t.otter at googlemail.com> wrote:
>> 
>>> I had a quick attempt at this, by creating a SymbolExtent of a weak function decl code region and creating a SymbolicRegion with that. This actually fixes the checker I was writing, which is nice. I am not sure if I understand fully the implications of doing this however. Where does the SymbolicRegion need to be constrained back to a FunctionTextRegion?
>>> 
>>> Index: Core/SValBuilder.cpp
>>> ===================================================================
>>> --- Core/SValBuilder.cpp	(revision 171384)
>>> +++ Core/SValBuilder.cpp	(working copy)
>>> @@ -190,7 +190,13 @@
>>>  }
>>>  
>>>  DefinedSVal SValBuilder::getFunctionPointer(const FunctionDecl *func) {
>>> -  return loc::MemRegionVal(MemMgr.getFunctionTextRegion(func));
>>> +  const FunctionTextRegion *Region = MemMgr.getFunctionTextRegion(func);
>>> +  if (func->isWeak()) {
>>> +    const SymbolExtent *Sym = SymMgr.getExtentSymbol(Region);
>>> +    return loc::MemRegionVal(MemMgr.getSymbolicRegion(Sym));
>>> +  }
>>> +    
>>> +  return loc::MemRegionVal(Region);
>>>  }
>>>  
>>>  DefinedSVal SValBuilder::getBlockPointer(const BlockDecl *block,
>>> 
>>> On 20 Dec 2012, at 19:31, Ted Kremenek <kremenek at apple.com> wrote:
>>> 
>>>> On Dec 20, 2012, at 10:14 AM, Jordan Rose <jordan_rose at apple.com> wrote:
>>>> 
>>>>> The problem is that functions are represented by FunctionTextRegions. As you noticed, our design is that only SymbolicRegions can represent NULL—all other regions are known to have an address. However, this is not true for weak symbols (functions or otherwise). In order to get this right, we probably need to enhance the analyzer to treat weak extern symbols like references, and then automatically dereference them upon use.
>>>> 
>>>> I don't think the "references" analogy is quite right.  Functions are already modeled in the AST using function pointers, and they are dereferenced during a function call.  We could possibly model weak-linked functions using SymbolicRegions, that are then later constrained to alias a specific FunctionTextRegion.  Aliasing is something we need to handle better anyway, and I think this would nicely fit into that model.
>>> 
>> 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/cfe-dev/attachments/20130104/8d93ae17/attachment.html>


More information about the cfe-dev mailing list