[cfe-commits] [patch] TransformIterator (motivating use case in Clang)

Douglas Gregor dgregor at apple.com
Tue May 15 14:11:37 PDT 2012


On May 15, 2012, at 11:57 AM, David Blaikie <dblaikie at gmail.com> wrote:

> [-llvm-commits since it's not too relevant to that list at this point]
> 
> On Tue, May 15, 2012 at 11:24 AM, Douglas Gregor <dgregor at apple.com> wrote:
>> 
>> On May 15, 2012, at 9:31 AM, David Blaikie <dblaikie at gmail.com> wrote:
>> 
>>> On Tue, May 15, 2012 at 9:19 AM, Douglas Gregor <dgregor at apple.com> wrote:
>>>> 
>>>> On May 6, 2012, at 7:02 PM, David Blaikie wrote:
>>>> 
>>>>> This patch adds an iterator much like boost's transform_iterator
>>>>> (without some extra bells & whistles) for use in some changes I'd like
>>>>> to make to Clang to correct some misimplemented iterators there.
>>>>> 
>>>>> A few gotchas that could be improved/changed depending on opinions:
>>>>> * I may be playing a little loose with the return type of the functor
>>>>> (see the example/motivating functor, to_pointer) - the return type
>>>>> actually must be a reference, though the result_type provides the
>>>>> underlying value type, not the reference type. If this is violated
>>>>> Clang will emit a warning, but I could make it more robust with a
>>>>> compile time assertion in the TransformIterator that the result_type
>>>>> is actually a reference type, and strip that off to provide the
>>>>> value_type of the TransformIterator.
>>>> 
>>>> It's much more correct for the value and reference types of the iterator type to be, e.g.,
>>>> 
>>>>  typedef typename Functor::result_type reference;
>>>>  typedef typename remove_reference<reference>::type value_type;
>>>> 
>>>>> * I realize adding pointer-yness back onto an iterator over references
>>>>> when the underlying data structure (in the Clang patch you can see
>>>>> this situation) is a sequence of pointers may be a bit overkill -
>>>>> though the alternatives are writing different algorithms in the two
>>>>> cases where I've used TransformIterator, or having the decl_iterator
>>>>> iterate over pointers (in which case I should probably change the
>>>>> other iterators I've modified to iterate over pointers for symmetry).
>>>> 
>>>> diff --git include/clang/AST/DeclBase.h include/clang/AST/DeclBase.h
>>>> index 6aef681..0a16ea5 100644
>>>> --- include/clang/AST/DeclBase.h
>>>> +++ include/clang/AST/DeclBase.h
>>>> @@ -1175,17 +1175,18 @@ public:
>>>>     Decl *Current;
>>>> 
>>>>   public:
>>>> -    typedef Decl*                     value_type;
>>>> -    typedef Decl*                     reference;
>>>> -    typedef Decl*                     pointer;
>>>> +    typedef Decl                     value_type;
>>>> +    typedef value_type&              reference;
>>>> +    typedef value_type*              pointer;
>>>> 
>>>> Since we tend to traffic in declaration pointers, not references, it's really beneficial to have value_type be Decl*.
>>> 
>>> Fair enough - I was hoping to reduce the amount of pointers handed
>>> around, but I realize in this case especially (where the underlying
>>> sequence is a sequence of pointers) it's really an awkward fit.
>> 
>> Why is reducing the number of pointers a specific goal?
> 
> Evidently not something everyone finds to be a useful goal - and on
> that basis perhaps I should just leave it alone.
> 
> But for myself I find functions taking references (& locals that are
> references) easier to read/reason about because it's clear that
> they're not null. Trying to decide which functions are accepting
> optional values and which ones aren't makes me pause a little - though
> perhaps more out of habit than real curiosity about whether something
> is null. If switching things to references causes pointer-habitual
> people to have the same kind of pause then that's clearly not an
> outright improvement.

Unless we're seeing lots of problems due to the pointer-centric style, I don't see a motivation to make this kind of change in the code base.

>>> This means killing off the op-> for these iterators though, right? Or
>>> do you want to keep that as an excusable convenience?
>> 
>> Honestly, I preferred the world back in its conveniently-inconsistent state where operator* returned a pointer and operator-> returned that same pointer, because being able to refer to Iter->getDeclName() rather than (*Iter)->getDeclName() is *really* convenient.
> 
> It is convenient, certainly. Though I find op* returning T& to be
> convenient/useful as well. In many cases we assign *i to a local
> pointer (I'm not sure why - perhaps we should just have a better name
> for the iterator and use that directly) & don't get the convenience of
> op-> (I assume this code was written before the op-> was added,
> actually) let alone the convenience of just using '.'.
> 
> But if that's what you want, I can do that.

"I.foo" vs. "I->foo" doesn't matter much to me, but "(*I)->foo" and "&*I" are really, really ugly.

>> The inclusion of operator-> in the iterator concepts was a mistake. Nothing in the standard library actually depends on it, no sane algorithms need it, and it's a major PITA to implement for iterator adaptors (as you've noticed <g>).
> 
> That's fine - though, since I want to replace
> specific/filtered_decl_iterators with some generic adapters, those
> adapters will need to provide the op-> convenience you want, whether
> or not it was specified in the concept/standard.

Yeah.

>>> In which case my
>>> change boils down to just changing the pointer typedefs to be Decl**
>>> in these various iterator types. (& might make my planned work,
>>> generalizing the filtering iterators, a bit weird - possibly providing
>>> an op-> to the generic filtering iterators whenever the value type is
>>> a pointer to match/support this oddity)
>> 
>> Decl** won't always work, though. You'll almost certainly need a proxy.
> 
> For the basic decl_iterators they do (because the underlying sequence
> is Decl*) but for the specific/filtered, where it'd be SpecificDecl**
> but we don't have a real SpecificDecl* to point to, it wouldn't and
> I'd need a proxy. Are there other cases/reasons for needing a proxy
> that you're referring to?

I don't know of other cases, no.

>>> Would you prefer the other iterators I changed
>>> (specific_decl_iterator, filtered_decl_iterator) be switched to
>>> pointer value_type too?
>> 
>> I did prefer the pointer value_type; perhaps others want to weigh in.
> 
> I'm fairly sure all 3 of these should be consistent, and that the
> specific/filtered should be refactored into a common adapter
> (typedef'd in place so clients of Decl wouldn't need to change, of
> course). Generally I prefer references, but I realize they're not a
> perfect fit.
> 
> Yeah - I'd like to be able to get more discussion on this & other
> stylistic (micro design?) aspects - I kind of feel like I'm just
> throwing stuff up & seeing what's stuck after a few weeks.

IRC is generally the best place for micro-design discussions.

	- Doug



More information about the cfe-commits mailing list