<div dir="ltr">Ok last message. I bet it's because the patch writes std::string Name(reg_info->Name);<div><br></div><div>change this to StringRef and it should be fine. I'd be curious to see how many instructions that generates.</div></div><br><div class="gmail_quote"><div dir="ltr">On Tue, Sep 5, 2017 at 2:12 PM Zachary Turner <<a href="mailto:zturner@google.com">zturner@google.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr">I noticed you said it generates new and delete. I find that strange, we should look into why that's happening because it's not supposed to be.</div><br><div class="gmail_quote"><div dir="ltr">On Tue, Sep 5, 2017 at 2:07 PM Zachary Turner <<a href="mailto:zturner@google.com" target="_blank">zturner@google.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr">StringSwitch doesn't create any std::strings (doing so would allocate memory), but it does do the memcmp. Unless it's in a hot path I think optimizing for readability is the right choice.</div><br><div class="gmail_quote"><div dir="ltr">On Tue, Sep 5, 2017 at 2:02 PM Jason Molenda <<a href="mailto:jmolenda@apple.com" target="_blank">jmolenda@apple.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><br>
<br>
> On Sep 5, 2017, at 1:34 PM, Davide Italiano <<a href="mailto:dccitaliano@gmail.com" target="_blank">dccitaliano@gmail.com</a>> wrote:<br>
><br>
> On Tue, Sep 5, 2017 at 1:23 PM, Jason Molenda <<a href="mailto:jmolenda@apple.com" target="_blank">jmolenda@apple.com</a>> wrote:<br>
>> Hi Davide, sorry I was offline for this discussion.<br>
>><br>
>> I was a little curious about llvm::StringSwitch, I hadn't seen it before. Is it creating std::string's for all of these strings, then memcmp'ing the contents? Greg originally wrote these RegisterIsCalleeSaved() methods in the ABI's hand optimizing the character comparisons for efficiency, sacrificing readability in the process big-time but we added the comments to make it easier to follow.<br>
>><br>
>> This version is much easier to read but loses the efficiency. Looking at the assembly generated by clang -Os, we're getting the same of the input string and then running memcmp() against all of the c-strings.<br>
>><br>
>> If we're going to ignore the efficiency that Greg was shooting for here, why not write it with an array of c-strings and strcmp, like<br>
>><br>
>> const char *preserved_registers[] = { "r12", "r13", "r14", "r15", "rbp", "ebp", "rbx", "ebx",<br>
>> "rip", "eip", "rsp", "esp", "sp", "fp", "pc", NULL };<br>
>><br>
>> for (int i = 0; preserved_registers[i] != NULL; i++)<br>
>> if (strcmp (reg, preserved_registers[i]) == 0)<br>
>> return true<br>
>> return false;<br>
>><br>
>> ?<br>
>><br>
>><br>
>> The original version, as hard to read as it was, compiles down to 60 instructions with no memory allocations or function calls with clang -Os. Using llvm::StringSwitch is 184, with new, delete, memcpy, memcmp function calls. The strcmp() one weighs in around 30-35 instructions with calls to strcmp.<br>
>><br>
>> I don't think this function is especially hot, I don't know if Greg's original focus on performance here was really the best choice. But if we're going to give up some performance, we could go the more generic strmp() route just as easily, couldn't we?<br>
>><br>
><br>
> Hi Jason,<br>
> I hoped to receive comments, so, thank you. I profiled lldb a bit<br>
> recently and I never saw this function showing up in the profile.<br>
> That said, I agree we shouldn't completely give up performances for<br>
> readability in this case. [In particular, I'm more worried about the<br>
> increase in code size].<br>
<br>
When I first looked at this function compiled -O0 it was 880 instructions long and I laughed out loud. :)<br>
<br>
I don't feel strongly about this, your change is fine, I was mostly curious if I was missing something.<br>
<br>
I wouldn't want to make extra work for equivalent readability/performance (IMO) unless you want to. I think many of the other ABI plugins have similar code in them; if I were changing any others, I would use the simpler loop & strcmp() method I think.<br>
<br>
<br>
><br>
> With that in mind, I'm under the impression your solution would work.<br>
> An alternative would be that of looking at clang codegen for<br>
> StringSwitch and see whether it generates this code.<br>
> FWIW, I expected that function to be lowered to a switch (in LLVM IR)<br>
> and in case it has too many cases, being transformed back into a loop.<br>
> I guess this actually doesn't work as LLVM doesn't really try to<br>
> modify libcalls lying around too much, and the optimizer can't reason<br>
> about this case.<br>
> I guess it will be an interesting thing to look regardless, but the<br>
> solution you propose seems better, at least in the sense that doesn't<br>
> rely on the compiler to generate particularly good code to be<br>
> efficient.<br>
><br>
> Do you want me to apply this patch & run the regression test suite?<br>
><br>
> Thanks,<br>
><br>
> --<br>
> Davide<br>
<br>
</blockquote></div></blockquote></div></blockquote></div>