[cfe-dev] RFC: allow modules found via -fmodule-map-file to shadow implicitly discovered modules

Richard Smith via cfe-dev cfe-dev at lists.llvm.org
Tue Nov 10 17:47:55 PST 2015


On Tue, Nov 10, 2015 at 4:50 PM, Ben Langmuir via cfe-dev <
cfe-dev at lists.llvm.org> wrote:

>
> On Nov 10, 2015, at 4:39 PM, Richard Smith <richard at metafoo.co.uk> wrote:
>
> On Tue, Nov 10, 2015 at 3:39 PM, Ben Langmuir via cfe-dev <
> cfe-dev at lists.llvm.org> wrote:
>
>> Hey all,
>>
>> *Background*
>>
>> Suppose you’re developing a system library FancyLib that installs a
>> module map file into a default search path, and you’re using implicit
>> module maps.  When you’re developing locally, you will likely end up with
>> multiple versions of your module that are reachable in your search paths:
>> * The system version /usr/include/FancyLib/module.modulemap
>> * The local development version
>> ~/…/Build/Products/Debug/FancyLib/module.modulemap
>>
>> You generally want to use the local version, and ignore the one from the
>> system.  Unfortunately, this currently causes module-redefinition errors if
>> both module map files are ever parsed.  For framework modules, we have been
>> getting away with this because the compiler almost never looks into a
>> framework directory if it has previously looked into a framework directory
>> of the same name.  Sadly the same is not true of non-frameworks.
>>
>>
>> *Proposed change for users*
>>
>> Users developing non-framework system modules would add:
>>     -fmodule-map-file=path/to/local/copy/of/module.modulemap
>>
>> Typically, the “local copy” of the module map file would be to somewhere
>> inside their build products.
>>
>>
>> *Proposed clang changes*
>>
>> The flag -fmodule-map-file already exists in clang, and it causes clang
>> to explicitly parse the specified module map, making any modules defined in
>> that file available for @import (or via auto-import from a #import).
>>
>> I propose we extend -fmodule-map-file so that modules found by this
>> mechanism are allowed to shadow modules that we found implicitly in header
>> search paths.  When searching for a module by name, we will not consider
>> the shadowed module at all. It will be an error for header search to find a
>> header that is considered part of a shadowed module.
>>
>> If two modules with the same name are *both* found in the contents of
>> -fmodule-map-file arguments, then it is a redefinition error.  Since these
>> flags are ordered, we could theoretically allow shadowing between them, but
>> I cannot think of a good use-case for this and suspect it would usually be
>> unexpected to users.
>>
>> If there are two modules with the same name and *neither* of them comes
>> from -fmodule-map-file, then it is a redefinition error just like it is
>> today.
>>
>> *Why not use header search path order to do shadowing?*
>>
>> I originally thought we should just allow modules that come from earlier
>> search paths to shadow modules from later ones.  This would avoid having to
>> use an extra flag -fmodule-map-file, and let us infer the module
>> configuration “for free”, since users are already accustomed to setting up
>> their header search paths.  However, there are a number of problems with
>> this approach:
>>
>> * Module map files from “later” search paths are often parsed ahead of
>> module map files from “earlier” paths.  We would either need to eagerly
>> parse module map files from all search paths in order (slow), or we would
>> need to be able to retroactively make a module shadowed (brittle).
>> * Module map lookup does not know about search paths, so when searching
>> for a module map file by umbrella directory we don't know when we cross
>> search path boundaries
>> * We would need to lock down the HeaderSearch interface to prevent
>> modifying search path order in the middle of a build (although that might
>> be a generally good thing…)
>>
>
> The direction here makes complete sense to me.
>
>
>> My patch implementing this is attached for reference.
>>
>
> This seems to be somewhat more general than we need for the above proposal
> -- it supports a fully-general set of scopes, whereas it would seem
> sufficient to track a flag to indicate whether the module came from a
> -fmodule-map-file= file or from an implicit module map file. Do you have
> some future extension in mind that would need more than 2 different scopes?
>
>
> Nothing concrete, although it feels like if we might someday be able to
> get at least one other scope for the default system search paths.  The
> reason I chose to not just use a flag was - perhaps embarrassingly - that I
> haven’t been able to come up with a nice way to express what that “flag” is
> to the ModuleMap class without defining it in terms of our command-line
> interface, which feels like a conceptual layering violation.  I’m open to
> suggestions thought :-)
>

The flag seems to be whether the module map is explicit or implicit, but
that's more at the level of HeaderSearch's world view than ModuleMap's.

I also think it's probably unnecessary to build shadow modules as your
patch does (unless you expect those files to be reachable via
#include_next?). Instead, you could do the same thing we do when the
definition of a module was provided by a loaded module file: skip to the
close brace of the module definition and return. We particularly don't want
to stat all the files named in the shadow module, since the extra stat
syscalls can be moderately expensive. It seems like a cleaner semantic
model to say that the shadowed module is completely ignored than that it
exists and has the same name but is a somehow different module.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/cfe-dev/attachments/20151110/d1844fdd/attachment.html>


More information about the cfe-dev mailing list