<div dir="ltr"><br><br><div class="gmail_quote">On Wed, Apr 1, 2015 at 8:07 PM Duncan P. N. Exon Smith <<a href="mailto:dexonsmith@apple.com">dexonsmith@apple.com</a>> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><br>
> On 2015 Apr 1, at 17:07, JF Bastien <<a href="mailto:jfb@google.com" target="_blank">jfb@google.com</a>> wrote:<br>
><br>
> > On Mon, Mar 30, 2015 at 9:52 AM Eric Christopher <<a href="mailto:echristo@gmail.com" target="_blank">echristo@gmail.com</a>> wrote:<br>
> > From PR18808 I said a few things and that I was going to redirect to the mailing list for further discussion. So here we are, go.<br>
> ><br>
> > 1) Whether or not to allow changing of target-cpu/target-feature/<u></u>triple at link time code generation.<br>
> ><br>
> > - Not convinced here of the facility to do so. Could just recompile the individual bitcode files to get what you want, but there are some users that are trying to ship bitcode (as crazy as that sounds).<br>
><br>
> IMO, it's cleanest of the target-cpu/target-feature/etc. are set at<br>
> compile time. That's where users are accustomed to specifying codegen<br>
> options already, and besides: the frontend needs to know the backend in<br>
> order to conform to the ABI, set macros, emit calls to target-specific<br>
> intrinsics, etc.<br>
><br>
> Cleanest yes, most familiar yes, but doesn't fit the usecase of PNaCl/Emscripten/Renderscript/<u></u>Halide/... as Eric was mentioning. These indeed need to figure out proper ABI, macros, intrinsics, but the existence of these is a pretty good proof that something can be done (I'm not saying it's clean or pretty!).<br>
<br>
But the typical clang user shouldn't suffer just because there are<br>
interesting use cases out there. Cleanest and familiar are important.<br>
<br>
I'd be happy enough with a command-line option to specify "don't encode<br>
the target" to support this kind of thing. Although Eric's idea from<br>
elsewhere in the thread seems better than adding a driver option:<br>
<br>
$ clang -target le64-unknown-unknown -emit-llvm foo.c -o foo.bc<br>
$ clang -target aarch64-linux-gnu foo.bc -O3 -o foo.aarch64<br>
$ clang -target x86_64-linux-gnu foo.bc -O3 -o foo.x86_64<br>
<br>
In this scenario, I figure the Frontend would recognize `le64` as a<br>
special architecture whose target shouldn't get encoded in the IR... or<br>
the backend would recognize that it should be overwritten.<br>
<br></blockquote><div><br></div><div>I think, in this sort of case, le64-unknown-unknown (or some such) is actually going to be a decent sort of triple for the generic case. The question is what to do at lto/codegen time as far as:</div><div><br></div><div>a) making them rewrite, or</div><div>b) allowing it to change from the outside as we were imagining for the llc type of use case or </div><div>c) at least programmatically, i.e. if you explicitly construct a target machine it'll override at least the triple</div><div><br></div><div>a seems the friendliest to us, b seems a little weird, but piggy backs on our existing plans, c seems kinda fun and opens up some decent apis, but I'm not sure about some of the longer term consequences.</div><div><br></div><div>Just some food for thought as we go through this.</div><div><br></div><div>-eric</div><div><br></div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
> > That said I was actually envisioning something like:<br>
> ><br>
> > clang -emit-llvm foo.c -o foo.bc<br>
> > ...<br>
> ><br>
> > clang -O3 -flto all.bc -arch x86_64h -o haswell_slice<br>
> > clang -O3 -flto all.bc -arch x86_64 -o x86_64_slice<br>
> ><br>
> > for the same set of bitcode files. But given the front end language restrictions on doing anything actually interesting there it's not too much of a constraint.<br>
><br>
> Many of the differences between architectures CPUs affect preprocesser<br>
> definitions, right? Link-time is too late for the frontend to emit<br>
> Haswell-specific intrinsics, for example.<br>
><br>
> That said, it would be cool if this worked.<br>
><br>
> Agreed.<br>
><br>
> > Another usage is the (admittedly one I don't think we want to support) halide one that I discovered this week:<br>
> ><br>
> > clang foo.c -emit-llvm foo.bc<br>
> > clang -target aarch64-linux-gnu foo.bc -O3 -o foo.aarch64<br>
> > clang -target x86_64-linux-gnu foo.bc -O3 -o foo.x86_64<br>
> > ...<br>
><br>
> Whereas this is just insane :0.<br>
><br>
> Disagreed: different usecase from above :-)<br>
><br>
> Whoever maintains "portable" things has to figure out how this works, and I think right now it's still the wild west (hence insane may not be too far of a qualification), but I don't think it's an invalid usecase.<br>
<br>
I didn't say invalid :).</blockquote></div></div>