<div class="gmail_quote">On Mon, Mar 26, 2012 at 4:26 PM, Douglas Gregor <span dir="ltr"><<a href="mailto:dgregor@apple.com">dgregor@apple.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div style="word-wrap:break-word"><br><div><div><div class="h5"><div>On Mar 26, 2012, at 7:21 AM, Manuel Klimek <<a href="mailto:klimek@google.com" target="_blank">klimek@google.com</a>> wrote:</div><br><blockquote type="cite">
<div class="gmail_quote">On Mon, Mar 26, 2012 at 3:56 PM, Douglas Gregor <span dir="ltr"><<a href="mailto:dgregor@apple.com" target="_blank">dgregor@apple.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div><br>
On Mar 26, 2012, at 4:08 AM, Manuel Klimek <<a href="mailto:klimek@google.com" target="_blank">klimek@google.com</a>> wrote:<br>
<br>
> On Wed, Mar 21, 2012 at 10:44 PM, Marshall Clow <<a href="mailto:mclow.lists@gmail.com" target="_blank">mclow.lists@gmail.com</a>> wrote:<br>
>> On Mar 21, 2012, at 8:07 AM, Manuel Klimek wrote:<br>
>>> On Wed, Mar 21, 2012 at 3:09 PM, Marshall Clow <<a href="mailto:mclow.lists@gmail.com" target="_blank">mclow.lists@gmail.com</a>> wrote:<br>
>>>> On Mar 21, 2012, at 6:25 AM, Manuel Klimek wrote:<br>
</div><div>>> I think I understand now. The point that I was trying to make is that I think that for simple tools we (or at least I) would like to be able to forego building some kind of database (JSON, Makefile, Ninja), and just specify the options and source files on the command line.<br>
>><br>
>> $ ./oneOffTool -I blah -Dblix=4 -I other_dir -o output_dir -someCustomFlag *.cpp<br>
>><br>
>> As long as we're abstracting the idea of a "compilation database", I think that generating such a db from an argc/argv pair seems like a good idea (and not very hard).<br>
>><br>
>> Hopefully what I'm trying to convey is clearer now.<br>
><br>
> Well, the interesting thing is how to design this. From a design point<br>
> of view, the command line becomes very different depending on whether<br>
> you basically want to "forward all args to clang in the tool" or "find<br>
> the args for the specified files and run the clang tool over them".<br>
> The first one inherently means I can only specify the command line for<br>
> a single invocation, so it is also pretty limited. I agree it is an<br>
> important use case though.<br>
><br>
> A possible solution would be to allow for a command line flag that<br>
> switches to use the command line arguments directly as arguments for<br>
> the clang tools. My problem is that I don't really see a good way to<br>
> handle that with llvm-provided command line abstractions (perhaps I'm<br>
> missing something there?)<br>
><br>
> What I basically want would be:<br>
> <tool> -d <build directory> <file1 ... fileN><br>
> <tool> --use-args <clang command line><br>
><br>
> But I don't want to roll my own command line parsing if possible.<br>
<br>
</div>I could be something as trivial as<br>
<br>
<tool> <tool arguments> -- <clang command line><br>
<br>
where the lack of "--" means that the tool should expect a build directory in <tool arguments>. You'd need to look through argv to find the "--" and split at that point, but the rest of the command-line parsing would be unchanged.<br>
</blockquote><div><br></div><div>I like the idea. I'll play around to see whether I can get the llvm command line parsing to behave like I want to.</div><div>I can probably just feed the part of the command line before the "--" to the llvm command line parsing.</div>
</div></blockquote><div><br></div></div></div>Yeah, I think it's fine to just feed everything before "--" to the llvm command-line parsing.</div><div><div class="im"><br><blockquote type="cite"><div class="gmail_quote">
<div>I'd probably like to write something like (leaving figuring out how to pack that up to make writing simple tools simpler for later):</div><div><define tool command lines></div><div>int main(...) {</div>
<div> // Make argc and argv references so we can change them? Kind of yuck, alternative ideas welcome.</div><div> llvm::OwningPtr<CompilationDatabase> Database(createSimpleCompilationDatabase(argc, argv));</div><div>
<do llvm command line parsing on rest of argc, argv></div><div> if (!Database) {</div><div> Database.reset(loadCompilationDatabase(BuildDirectory));</div><div> }</div><div> ClangTool Tool(Database.get(), Filenames);</div>
<div>}</div><div><br></div><div>where SimpleCompilationDatabase then returns the same arguments independent of the file name...</div><div><br></div></div></blockquote><div><br></div></div><div>Agreed.</div><div class="im">
<br><blockquote type="cite"><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div><br>
>> Then again, when you say "CompilationDatabase", I think of something that I can extract a set { commands, arguments, source file } tuples to be executed.<br>
>> Maybe that's where we are are talking past each other.<br>
><br>
> When I say CompilationDatabase I think of a map<filename, { command,<br>
> arguments, path}>. Does that make sense?<br>
<br>
<br>
</div>Makes sense to me, although of course this could end up being a multimap.<br></blockquote><div><br></div><div>Which is also an interesting design question - at least for refactorings we'll probably want to run over all different command lines the file was compiled with (think: different configurations with different macros enabled). This would lead to a slightly different interface for CompilationDatabase though (returning a list of compile command lines for each target).</div>
</div></blockquote><br></div></div><div>I think it's important for us to support this case in the interface.</div></div></blockquote><div><br></div>Agreed. So something like:<br><br>/// \brief Specifies the working directory and command of a compilation.<br>
struct CompileCommand {<br> /// \brief The working directory the command was executed from.<br> std::string Directory;<br><br> /// \brief The command line that was executed.<br> std::vector<std::string> CommandLine;<br>
};</div><div class="gmail_quote"><br></div><div class="gmail_quote">class CompilationDatabase {</div><div class="gmail_quote">public:</div><div class="gmail_quote"> /// \brief Returns all compile commands in which the specified file was compiled.</div>
<div class="gmail_quote"> ///</div><div class="gmail_quote"> /// This includes compile commands that span multiple source files.</div><div class="gmail_quote"> /// For example, for a compile command line</div><div class="gmail_quote">
/// $ clang++ -o test a.cc b.cc t.cc</div><div class="gmail_quote"> /// $ clang++ -o production a.cc b.cc -DPRODUCTION</div><div class="gmail_quote"> /// A compilation database representing the project would return both command lines </div>
<div class="gmail_quote"> /// for a.cc and b.cc and only the first command line for t.cc.</div><div class="gmail_quote"> virtual vector<CompileCommand> getCompileCommands(StringRef FilePath) = 0;</div><div class="gmail_quote">
};</div><div class="gmail_quote"><br></div><div class="gmail_quote">Also, I'd like to pull CompilationDatabase into its own header file if that is fine with you.<br><div><br></div><div>Thoughts?</div><div>/Manuel</div>
</div>