[cfe-dev] [analyzer] MallocChecker & checker dependencies

Sat Mar 17 07:25:27 PDT 2018

>
> This should already be supported by MallocChecker - i.e. it warns when you
> allocate memory with, say, new() and release it with free() or delete[]().
>
> If methods of a smart pointer are "inlined" during analysis, these bugs
> will already be caught automatically.

I took a look at your recent work & status report posted there in mailing
lists. And I have a question about how it would work in particular cases,
e.g.:
1. Returning smart_ptr by value from function that has it's implementation
in analyzed translation unit but it isn't called from any other function in
the same TU.

std::shared_ptr<S> createInstance() {
   return std::shared_ptr<S>(new S, free);
}

If I understand correctly, compiler is free to apply copy-elison
optimization on shared_ptr which is returned by value. So destructor call
won't be modeled properly because it's not inlined during analysis. And bug
won't be caught.

2. As you mentioned before, report would be fired only if constructor and
destructor have been inlined (which is not guaranteed to be done). As the
result such check won't work in general case.

Actually my point is about modeling smart pointers behavior specifically
and don't inline any calls to constructor / destructor & methods those
modeled by checker. And the question immediately arises: is there way to
control inline policy?
And sorry for any possible misunderstanding from my side as I'm not quite
familiar with CSA internals.

Regards, Alexey K

2018-03-17 5:21 GMT+03:00 Artem Dergachev <noqnoqneo at gmail.com>:

>
>
> On 16/03/2018 12:30 PM, Alexey Knyshev wrote:
>
>> Sorry, accidentally removed part of message before sending.
>>
>> Aleksei,
>>
>>     Could you share your ideas of checker design? It is possible that
>>     problems you met can be solved in different (maybe even better)
>>     way if we know the whole picture.
>>
>>
>> I'm currently have no clear idea how to implement it in the "right
>> manner". But first of all I would aim to track the origin (malloc, new, new
>> [], etc) of SVal and check if Deleter is the expected corresponding way to
>> deallocate such SVal.
>>
>
> This should already be supported by MallocChecker - i.e. it warns when you
> allocate memory with, say, new() and release it with free() or delete[]().
>
> If methods of a smart pointer are "inlined" during analysis, these bugs
> will already be caught automatically.
>
>
>> Regards, Alexey K
>>
>> 2018-03-16 22:22 GMT+03:00 Alexey Knyshev <alexey.knyshev at gmail.com
>> <mailto:alexey.knyshev at gmail.com>>:
>>
>>     Hello guys,
>>
>>     And thanks for your attention
>>
>>     Aleksei,
>>
>>         Could you share your ideas of checker design? It is possible
>>         that problems you met can be solved in different (maybe even
>>         better) way if we know the whole picture.
>>
>>
>>     As I said, I'm interested in improving current state of dynamic
>>     memory modeling. Especially stuff related to smart pointers (e.g.
>>     shared, unique) which also mentioned in list of potential checkers
>>     as smartptr.SmartPtrInit
>>     <https://clang-analyzer.llvm.org/potential_checkers.html>*
>>
>>     *
>>
>>         **You can see an example of how state trait is exported in
>>         GenericTaintChecker or MPIChecker. Generally, you just create
>>         a named ProgramStateTrait in the header.
>>
>>
>>     Briefly looked at MPITypes.h. Does it mean we should move
>>     RegionState to separate file and register it via Traits in the
>>     same manner to make it avaliable from other checkers (not other TU
>>     as mentioned in ento::mpi::RequestMap)?
>>
>>
> GenericTaintChecker is made in a fairly intrusive manner by putting stuff
> into the program state class directly, which isn't very sane. I'd
> definitely prefer something similar to dynamic type propagation (see
> DynamicTypeMap.cpp, DynamicTypeMap.h). You can use the usual
> REGISTER_MAP_WITH_PROGRAMSTATE (etc.) macros in the .cpp file as long as
> you provide accessor methods declared in the .h file that other checkers
> could include.
>
>
>>     Artem,
>>
>>         you need to keep all of this in mind when writing a checker.
>>
>>
>>     Sure, but on the other hand I think it's possible to implement and
>>     improve modeling of various API calls' effects step-by-step. Let's
>>     say, it case of SmartPtrInit checker mentioned before the bare
>>     minimum would be modeling of construction (and destruction to
>>     avoid leak report from existing related checkers).
>>
>>         which is why the few experimental C++ checkers that we
>>         currently have required heavy hacks to become possible. But
>>         for now you'd rather keep an eye on these problems.
>>
>>
>>     Does it mean it's better to wait a bit for Core improvements from
>>     main contributors? Does it make sense to make an efforts to
>>     implement SmartPtrItnit checker in current state of things?
>>
>>
> You should feel free to start working on the checker in an incremental
> manner (everything is better in an incremental manner!), but in some cases
> i may prefer improving the checker API or fixing core modeling bugs instead
> of writing large portions of checker code to work around inconvenient APIs
> or bugs, and this is something i might not be immediately capable of doing
> myself (due to being busy with other things), which may potentially delay
> your progress.
>
>     Thanks in advance and regards,
>>     Alexey K
>>
>>     2018-03-16 21:47 GMT+03:00 Artem Dergachev <noqnoqneo at gmail.com
>>     <mailto:noqnoqneo at gmail.com>>:
>>
>>
>>         Dynamic memory management is pretty much done at this point -
>>         we have very good support for malloc()-like stuff and
>>         reasonably good support for operator new() (support for
>>         operator new[]() is currently missing because it requires
>>         calling an indefinite number of constructors which is not yet
>>         implemented). There is also a known issue with custom operator
>>         new(...) returning null pointers - in this case we should not
>>         be calling the constructor but for now we evaluate the
>>         constructor conservatively.
>>
>>         Your real problem will be managing smart pointers themselves
>>         because they are C++ objects that have a myriad of methods,
>>         they can be copied, moved, assigned, move-assigned, they
>>         destroyed, they lifetime-extended, you need to keep all of
>>         this in mind when writing a checker. This is slowly getting
>>         easier because i'm currently working on that, but until
>>         recently it wasn't working correctly in the core, let alone
>>         checkers, which is why the few experimental C++ checkers that
>>         we currently have required heavy hacks to become possible. But
>>         for now you'd rather keep an eye on these problems.
>>
>>         On 16/03/2018 3:57 AM, Aleksei Sidorin via cfe-dev wrote:
>>
>>             Hi Alexey!
>>
>>             Could you share your ideas of checker design? It is
>>             possible that problems you met can be solved in different
>>             (maybe even better) way if we know the whole picture.
>>             Regarding your questions, you can find some answers below.
>>
>>             1. a) The first way for checker communication is to share
>>             their program state trait. You can see an example of how
>>             state trait is exported in GenericTaintChecker or
>>             MPIChecker. Generally, you just create a named
>>             ProgramStateTrait in the header. You can take a look at
>>             TaintManager.h and MPITypes.h and how they are used.
>>             b) To set a dependency from another checker, you can just
>>             register it while registering your checker. An example can
>>             be found in MallocChecker where register$Checker also
>>             calls registerCStringCheckerBasic to register a checker it
>>             depends on.
>>             As you pointed, inter-checker communication can become a
>>             source of some problems. Most of them are discussed in
>>             this conversation:
>>             http://clang-developers.42468.n3.nabble.com/analyzer-RFC-Des
>> ign-idea-separate-modelling-from-checking-td4059122.html
>>             <http://clang-developers.42468.n3.nabble.com/analyzer-RFC-
>> Design-idea-separate-modelling-from-checking-td4059122.html>
>>
>>             2. I think there is nothing bad in sharing RegionState
>>             across checkers in the way shown in 1a.
>>
>>             3. Artem Dergachev has done some excellent work on
>>             improvement of operator 'new' processing in CSA engine.
>>             Regarding checkers, I can see some on
>>             https://clang-analyzer.llvm.org/potential_checkers.html
>>             <https://clang-analyzer.llvm.org/potential_checkers.html>:
>>             for example, undefbehavior.AutoptrsOwnSameObj. You can
>>             search this list to find more.
>>
>>
>>             15.03.2018 23:54, Alexey Knyshev via cfe-dev пишет:
>>
>>                 Hi there!
>>
>>                 While thinking about how it would be possible to
>>                 implement various smart ptr related checkers I tried
>>                 to review current state of MallocChecker and came up
>>                 with that it would be great to have RegionState info
>>                 available to other checkers. Could you please share
>>                 your points of view and comments on the following
>>                 statements / questions:
>>
>>                 1. Is there any right way for chaining checkers? How
>>                 they are expected to communicate between each other
>>                 (excluding generation of nodes / ProgramStates). I've
>>                 heard that there are couple of problems caused by
>>                 inlining functions, constructors / descructors.
>>                 2. What do you think about moving RegionState to the
>>                 Core of CSA or providing optional extended info in
>>                 MemRegion about the source of such region (new
>>                 opearator / array new, malloc, alloca, etc). So it
>>                 would be available to all checkers.
>>                 3. Is there any roadmap for CSA and especially for
>>                 dynamic memory management modeling & related checkers?
>>
>>                 Regards, Alexey K
>>
>>                 --                 linkedin.com/profile <
>> http://linkedin.com/profile>
>>                 <https://www.linkedin.com/prof
>> ile/view?id=AAMAABn6oKQBDhBteiQnWsYm-S9yxT7wQkfWhSw
>>                 <https://www.linkedin.com/prof
>> ile/view?id=AAMAABn6oKQBDhBteiQnWsYm-S9yxT7wQkfWhSw>>
>>
>>                 github.com/alexeyknyshev
>>                 <http://github.com/alexeyknyshev>
>>                 <http://github.com/alexeyknyshev
>>                 <http://github.com/alexeyknyshev>>
>>                 bitbucket.org/alexeyknyshev
>>                 <http://bitbucket.org/alexeyknyshev>
>>                 <https://bitbucket.org/alexeyknyshev/
>>                 <https://bitbucket.org/alexeyknyshev/>>
>>
>>
>>                 _______________________________________________
>>                 cfe-dev mailing list
>>                 cfe-dev at lists.llvm.org <mailto:cfe-dev at lists.llvm.org>
>>                 http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev
>>                 <http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev>
>>
>>
>>
>>             --             Best regards,
>>             Aleksei Sidorin,
>>             SRR, Samsung Electronics
>>
>>
>>             _______________________________________________
>>             cfe-dev mailing list
>>             cfe-dev at lists.llvm.org <mailto:cfe-dev at lists.llvm.org>
>>             http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev
>>             <http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev>
>>
>>
>>
>>
>>
>>     --     linkedin.com/profile
>>     <https://www.linkedin.com/profile/view?id=AAMAABn6oKQBDhBtei
>> QnWsYm-S9yxT7wQkfWhSw>
>>
>>     github.com/alexeyknyshev <http://github.com/alexeyknyshev>
>>     bitbucket.org/alexeyknyshev <https://bitbucket.org/alexeyknyshev/>
>>
>>
>>
>>
>> --
>> linkedin.com/profile <https://www.linkedin.com/prof
>> ile/view?id=AAMAABn6oKQBDhBteiQnWsYm-S9yxT7wQkfWhSw>
>>
>> github.com/alexeyknyshev <http://github.com/alexeyknyshev>
>> bitbucket.org/alexeyknyshev <https://bitbucket.org/alexeyknyshev/>
>>
>
>

-- 
linkedin.com/profile
<https://www.linkedin.com/profile/view?id=AAMAABn6oKQBDhBteiQnWsYm-S9yxT7wQkfWhSw>

github.com/alexeyknyshev
bitbucket.org/alexeyknyshev
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/cfe-dev/attachments/20180317/649d25ca/attachment.html>