<div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr">Belatedly jumping in here, as there is a potential alternative answer for this in the newer iteration of the ORC APIs.<div><br></div><div>The new APIs replace symbol resolvers with first class symbol tables, "JITDylib" instances, which provide a way to namespace code so that duplicate names do not clash. (JITDylibs are also faster to search, and internally provide synchronization support for concurrent compilation).</div><div><br></div><div><div>Modules (and any other program representations) are always added to JITDylibs in the new API, and you control symbol resolution by describing "links-against" style relationships between JITDylibs the same way you would when building a program/library on the command line. You can also attach symbol definition generators to JITDylibs to generate new definitions programmatically if desired. I have included an example below that shows how to build a simple IR JIT that uses both techniques.</div></div><div><br></div><div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-style:solid;border-left-color:rgb(204,204,204);padding-left:1ex">The JIT contains any number of LLVM modules, each of which defines a<br>function, plus a "main" module that calls those functions. Several<br>functions may have the same signature, so I need to find a way to<br>resolve them.<br></blockquote><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-style:solid;border-left-color:rgb(204,204,204);padding-left:1ex">Originally, I just put each module's code in its own namespace when it<br>was compiled. But now we want to be able to compile them separately to<br>bitcode files and read them later. So at compilation time there is no<br>longer any way to assign a unique namespace to each.</blockquote></div><div><br></div><div>In this case, I believe that you could place each Module (or group of modules whose names are guaranteed not to clash) in its own JITDylib. You would use whatever disambiguation process you are using now to find the "correct" version of the function to find the "correct" JITDylib instead, and this would allow you to resolve correctly without modifying stored IR.</div><div><br></div><div>Cheers,</div><div>Lang.<br></div><div><br></div><div>Example code:</div><div><br></div><div><div><font color="#38761d"><span style="font-family:monospace,monospace">// Create a JITTargetMachineBuilder and DataLayout.</span><br></font></div><div><font face="monospace, monospace" color="#38761d">// We use a target machine builder rather than a single target machine as the new APIs are</font></div><div><font face="monospace, monospace" color="#38761d">// capable of compiling on multiple threads, though we do not do that in this example</font></div><div><font face="monospace, monospace">auto JTMB = ExitOnErr(JITTargetMachineBuilder::detectHost());</font></div><div><font face="monospace, monospace">auto DL = ExitOnErr(JTMB.getDefaultDataLayoutForTarget());</font></div><div><font face="monospace, monospace"><br></font></div><div><font face="monospace, monospace" color="#38761d">// Now we create an ExecutionSession (string pool, error reporting,</font></div><div><font face="monospace, monospace" color="#38761d">// session mutex), and object and IR compile layers.</font></div><div><font face="monospace, monospace">ExecutionSession ES;</font></div><div><font face="monospace, monospace">RTDyldObjectLinkingLayer ObjLayer(ES, [](){ return llvm::make_unique<SectionMemoryManager>(); });</font></div><div><font face="monospace, monospace">IRCompileLayer CompileLayer(ES, ObjLayer, ConcurrentIRCompiler(JTMB));</font></div><div><font face="monospace, monospace"><br></font></div><div><font face="monospace, monospace" color="#38761d">// Now we get to the interesting part: We declare two JITDylibs. One,</font></div><div><font face="monospace, monospace" color="#38761d">// ProcessSymbolsLib, will auto-generate definitions by calling dlsym</font></div><div><font face="monospace, monospace" color="#38761d">// on the current process, making this process's symbols available to</font></div><div><font face="monospace, monospace" color="#38761d">// JIT'd code.</font></div><div><font face="monospace, monospace" color="#38761d">// The second, Main, will contain our JIT'd code. We add a "links-against"</font></div><div><font face="monospace, monospace" color="#38761d">// relationship from Main to ProcessSymbolsLib by calling addToSearchOrder.</font></div><div><font face="monospace, monospace">auto &ProcessSymbolsLib = ES.createJITDylib("<process symbols>");</font></div><div><font face="monospace, monospace">ProcessSymbolsLib.setGenerator(ExitOnErr(DynamicLibrarySearchGenerator::GetForCurrentProcess(DL)));</font></div><div><font face="monospace, monospace">auto &Main = ES.createJITDylib("main");</font></div><div><font face="monospace, monospace">Main.addToSearchOrder(ProcessSymbolsLib);</font></div><div><font face="monospace, monospace"><br></font></div><div><font face="monospace, monospace" color="#38761d">// Now we can add code to the Main library, and perform a look up on it.</font></div><div><font face="monospace, monospace" color="#38761d">// ExecutionSession::lookup takes as its first argument a list of JITDylibs to search</font></div><div><font face="monospace, monospace" color="#38761d">// for the requested definition.</font></div><div><font face="monospace, monospace">ExitOnErr(CompileLayer.add(Main, ThreadSafeModule(std::move(Mod), std::move(Ctx))));</font></div><div><font face="monospace, monospace">auto FooSym = ExitOnErr(ES.lookup({&Main}, "_foo"));</font></div><div><font face="monospace, monospace">auto Foo = (FooTy)FooSym.getAddress();</font></div><div><font face="monospace, monospace">Foo();</font></div></div><div><font face="monospace, monospace"><br></font></div><div><br></div></div></div></div></div></div></div><br><div class="gmail_quote"><div dir="ltr">On Thu, Sep 13, 2018 at 8:23 AM Geoff Levner via llvm-dev <<a href="mailto:llvm-dev@lists.llvm.org">llvm-dev@lists.llvm.org</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Well, I've gotten this to work by playing with the symbol resolver as<br>
you suggest. Almost...<br>
<br>
In the main module, I declare the functions in (fictitious)<br>
namespaces. In the JIT, the symbol resolver recognizes those<br>
namespaces, which tell it in which modules to look for the<br>
corresponding unnamespaced functions. In a simple test case, that<br>
works. But in a more complex case, execution fails when I try to run<br>
the constructors for the main module. The error message says that the<br>
namespaced functions from the main module were not found, so<br>
apparently somebody somewhere is looking for those symbols and<br>
bypassing the JIT's symbol resolver... Perhaps the linking layer?<br>
<br>
I think I will go back to the UUID-based namespace idea, which would<br>
be less of a headache because it doesn't involve LLVM...<br>
On Thu, 13 Sep 2018 at 11:12, Geoff Levner <<a href="mailto:glevner@gmail.com" target="_blank">glevner@gmail.com</a>> wrote:<br>
><br>
> On Wed, 12 Sep 2018 at 21:48, Andres Freund <<a href="mailto:andres@anarazel.de" target="_blank">andres@anarazel.de</a>> wrote:<br>
> ><br>
> > Hi,<br>
> ><br>
> > On 2018-09-12 12:09:24 +0200, Geoff Levner via llvm-dev wrote:<br>
> > > Greetings, LLVM wizards!<br>
> ><br>
> > Not one of them...<br>
> ><br>
> ><br>
> > > We have an application that uses Clang and Orc JIT to compile and<br>
> > > execute C++ code on the fly.<br>
> > ><br>
> > > The JIT contains any number of LLVM modules, each of which defines a<br>
> > > function, plus a "main" module that calls those functions. Several<br>
> > > functions may have the same signature, so I need to find a way to<br>
> > > resolve them.<br>
> > ><br>
> > > Originally, I just put each module's code in its own namespace when it<br>
> > > was compiled. But now we want to be able to compile them separately to<br>
> > > bitcode files and read them later. So at compilation time there is no<br>
> > > longer any way to assign a unique namespace to each.<br>
> ><br>
> > Why not? If you assign a random uuid, or a sequential number of<br>
> > whatnot, that should work.<br>
><br>
> Yes, that is the solution I am looking into at the moment, actually:<br>
> using a UUID to generate a namespace when the module is compiled.<br>
> However, that means saving the UUID somewhere; the bitcode is no<br>
> longer self-sufficient. I suppose I could create a special global<br>
> variable in the module containing the UUID...<br>
><br>
> > > 2. Assign each module a unique namespace, but don't change the modules<br>
> > > themselves: just add the namespace when a function is called from the<br>
> > > main module, and modify the JIT's symbol resolver to strip the<br>
> > > namespace and look for the function only in the relevant module.<br>
> ><br>
> > That's kind of what I do for a similar-ish problem in the JIT engine in<br>
> > postgres (which uses orcjit). There multiple dynamically loaded<br>
> > extensions can register functions whose source code is available, and<br>
> > each of them can have conflicting symbols. The equivalent of your main<br>
> > module generates function names that encode information about which<br>
> > module to look for the actual definition of the function, and then does<br>
> > the symbol resolution outside of LLVMs code. I do that both when<br>
> > inlining these functions, and when generating funciton calls to the<br>
> > external function.<br>
><br>
> I did try something like that. The problem I ran into is that the<br>
> symbol resolver receives mangled function names. It is easy enough to<br>
> demangle them there, but hard to mangle names before compiling. Once<br>
> you have decoded your function name in the symbol resolver, how do you<br>
> generate a mangled name for the actual function you want to resolve<br>
> to?<br>
><br>
> > Not sure if that helps.<br>
> ><br>
> > Greetings,<br>
> ><br>
> > Andres Freund<br>
><br>
> Thanks, Andres.<br>
_______________________________________________<br>
LLVM Developers mailing list<br>
<a href="mailto:llvm-dev@lists.llvm.org" target="_blank">llvm-dev@lists.llvm.org</a><br>
<a href="http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev" rel="noreferrer" target="_blank">http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-dev</a><br>
</blockquote></div>