<div dir="ltr"><br><br><div class="gmail_quote"><div dir="ltr">On Wed, Sep 21, 2016 at 3:50 AM Luke Drummond <<a href="mailto:luke.drummond@codeplay.com">luke.drummond@codeplay.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Hi Zachary<br class="gmail_msg">
<br class="gmail_msg">
Thanks for the review. My comments are inline (I've cut down some of the<br class="gmail_msg">
diff context for brevity)<br class="gmail_msg">
<br class="gmail_msg">
On 16/09/16 15:57, Zachary Turner wrote:<br class="gmail_msg">
[snip]<br class="gmail_msg">
> -#define MAXLINESTR_(x) "%" STRINGIFY(x) "s"<br class="gmail_msg">
> -#define MAXLINESTR MAXLINESTR_(MAXLINE)<br class="gmail_msg">
> +bool RSModuleDescriptor::ParsePragmaCount(llvm::StringRef *lines,<br class="gmail_msg">
> + size_t n_lines) {<br class="gmail_msg">
> + // Skip the pragma prototype line<br class="gmail_msg">
> + ++lines;<br class="gmail_msg">
> + for (; n_lines--; ++lines) {<br class="gmail_msg">
> + const auto kv_pair = lines->split(" - ");<br class="gmail_msg">
> + m_pragmas[kv_pair.first.trim().str()] =<br class="gmail_msg">
> kv_pair.second.trim().str();<br class="gmail_msg">
> + }<br class="gmail_msg">
> + return true;<br class="gmail_msg">
> +}<br class="gmail_msg">
> +<br class="gmail_msg">
> +bool RSModuleDescriptor::ParseExportReduceCount(llvm::StringRef *lines,<br class="gmail_msg">
> + size_t n_lines) {<br class="gmail_msg">
><br class="gmail_msg">
> This should take an ArrayRef<StringRef>. In general this is true any<br class="gmail_msg">
> time you're passing an array to a function via (T* items, size_t length).<br class="gmail_msg">
><br class="gmail_msg">
><br class="gmail_msg">
> + // The list of reduction kernels in the `.<a href="http://rs.info" rel="noreferrer" class="gmail_msg" target="_blank">rs.info</a><br class="gmail_msg">
> <<a href="http://rs.info" rel="noreferrer" class="gmail_msg" target="_blank">http://rs.info</a>>` symbol is of the form<br class="gmail_msg">
> + // "signature - accumulatordatasize - reduction_name -<br class="gmail_msg">
> initializer_name -<br class="gmail_msg">
> + // accumulator_name - combiner_name -<br class="gmail_msg">
> + // outconverter_name - halter_name"<br class="gmail_msg">
> + // Where a function is not explicitly named by the user, or is<br class="gmail_msg">
> not generated<br class="gmail_msg">
> + // by the compiler, it is named "." so the<br class="gmail_msg">
> + // dash separated list should always be 8 items long<br class="gmail_msg">
> + Log *log = GetLogIfAllCategoriesSet(LIBLLDB_LOG_LANGUAGE);<br class="gmail_msg">
> + // Skip the exportReduceCount line<br class="gmail_msg">
> + ++lines;<br class="gmail_msg">
> + for (; n_lines--; ++lines) {<br class="gmail_msg">
><br class="gmail_msg">
> With ArrayRef this would be written as:<br class="gmail_msg">
><br class="gmail_msg">
> for (StringRef line : lines.drop_front()) {<br class="gmail_msg">
> }<br class="gmail_msg">
><br class="gmail_msg">
I'm not sure what the technical argument against pointer and length vs<br class="gmail_msg">
ArrayRef is, but I agree these range-based for loops are pretty. I'll<br class="gmail_msg">
add them in a future commit (I've got some more RenderScript cleanups<br class="gmail_msg">
pending, and I'll test this locally first - but this looks like good<br class="gmail_msg">
cleanup).<br class="gmail_msg"></blockquote><div>A couple of advantages:</div><div><br></div><div>1) pointer-and-length is unsafe from a security standpoint. You run the risk of accessing the array out of bounds, whereas ArrayRef will assert if you try to do this, so you don't go stomp some memory and only find out about it later.</div><div><br></div><div>2) ArrayRef maintains its own length and can be trivially copied, sliced and manipulated so you can use it in a functional style. Not having to maintain the length yourself as you iterate over it in fancy ways removes one possible source of programmer error (plus it just makes the code look nicer).</div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<br class="gmail_msg">
[snip]<br class="gmail_msg">
> __attribute__((kernel))<br class="gmail_msg">
> + {"exportForEachCount", eExportForEach},<br class="gmail_msg">
> + // The number of generalreductions: This marked in the<br class="gmail_msg">
> script by `#pragma<br class="gmail_msg">
> + // reduce()`<br class="gmail_msg">
> + {"exportReduceCount", eExportReduce},<br class="gmail_msg">
> + // Total count of all RenderScript specific `#pragmas` used<br class="gmail_msg">
> in the script<br class="gmail_msg">
> + {"pragmaCount", ePragma},<br class="gmail_msg">
> + {"objectSlotCount", eObjectSlot}}};<br class="gmail_msg">
><br class="gmail_msg">
> // parse all text lines of .<a href="http://rs.info" rel="noreferrer" class="gmail_msg" target="_blank">rs.info</a> <<a href="http://rs.info" rel="noreferrer" class="gmail_msg" target="_blank">http://rs.info</a>><br class="gmail_msg">
> for (auto line = info_lines.begin(); line != info_lines.end();<br class="gmail_msg">
> ++line) {<br class="gmail_msg">
><br class="gmail_msg">
> How about (for auto line : info_lines) {<br class="gmail_msg">
<br class="gmail_msg">
We actually need to manually increment the line pointer to skip over the<br class="gmail_msg">
already-parsed lines, so the range based for loop doesn't work in this case.<br class="gmail_msg"></blockquote><div>Ahh, yea I see where you're doing that now. But after incrementing a line, it's not checked whether it's at end until it gets to the next loop iteration. This could cause a double-increment off the end right?</div><div><br></div><div>BTW, there's another style which you may or may not be fond of that would also handle this case. It would go like this:</div><div><br></div><div>auto lines_ref = llvm::makeArrayRef(lines);</div><div>while (!lines_ref.empty()) {</div><div> auto line = lines_ref.front();</div><div><br></div><div> // do stuff</div><div><br></div><div> // ++line;</div><div> lines_ref = lines_ref.drop_front(); // instead</div><div><br></div><div> // end of loop</div><div> lines_ref = lines_ref.drop_front();</div><div>}</div><div><br></div><div>It doesn't buy you much in this case, but it can be a nice pattern sometimes.</div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<br class="gmail_msg">
><br class="gmail_msg">
> - uint32_t numDefns = 0;<br class="gmail_msg">
> - if (sscanf(line->c_str(), "exportVarCount: %" PRIu32 "",<br class="gmail_msg">
> &numDefns) == 1) {<br class="gmail_msg">
> - while (numDefns--)<br class="gmail_msg">
> - m_globals.push_back(RSGlobalDescriptor(this,<br class="gmail_msg">
> (++line)->c_str()));<br class="gmail_msg">
> - } else if (sscanf(line->c_str(), "exportForEachCount: %" PRIu32 "",<br class="gmail_msg">
> - &numDefns) == 1) {<br class="gmail_msg">
> - while (numDefns--) {<br class="gmail_msg">
> - uint32_t slot = 0;<br class="gmail_msg">
> - name[0] = '\0';<br class="gmail_msg">
> - static const char *fmt_s = "%" PRIu32 " - " MAXLINESTR;<br class="gmail_msg">
> - if (sscanf((++line)->c_str(), fmt_s, &slot, name.data()) ==<br class="gmail_msg">
> 2) {<br class="gmail_msg">
> - if (name[0] != '\0')<br class="gmail_msg">
> - m_kernels.push_back(RSKernelDescriptor(this,<br class="gmail_msg">
> name.data(), slot));<br class="gmail_msg">
> - }<br class="gmail_msg">
> - }<br class="gmail_msg">
> - } else if (sscanf(line->c_str(), "pragmaCount: %" PRIu32 "",<br class="gmail_msg">
> &numDefns) ==<br class="gmail_msg">
> - 1) {<br class="gmail_msg">
> - while (numDefns--) {<br class="gmail_msg">
> - name[0] = value[0] = '\0';<br class="gmail_msg">
> - static const char *fmt_s = MAXLINESTR " - " MAXLINESTR;<br class="gmail_msg">
> - if (sscanf((++line)->c_str(), fmt_s, name.data(),<br class="gmail_msg">
> value.data()) != 0) {<br class="gmail_msg">
> - if (name[0] != '\0')<br class="gmail_msg">
> - m_pragmas[std::string(name.data())] = value.data();<br class="gmail_msg">
> - }<br class="gmail_msg">
> - }<br class="gmail_msg">
> - } else {<br class="gmail_msg">
> - Log *log(GetLogIfAllCategoriesSet(LIBLLDB_LOG_LANGUAGE));<br class="gmail_msg">
> - if (log) {<br class="gmail_msg">
> + const auto kv_pair = line->split(": ");<br class="gmail_msg">
> + const auto key = kv_pair.first;<br class="gmail_msg">
> + const auto val = kv_pair.second.trim();<br class="gmail_msg">
> +<br class="gmail_msg">
> + const auto handler = rs_info_handlers.find(key);<br class="gmail_msg">
> + if (handler == rs_info_handlers.end())<br class="gmail_msg">
> + continue;<br class="gmail_msg">
><br class="gmail_msg">
> Instead of constructing this map, you could use llvm::StringSwitch, like<br class="gmail_msg">
> this:<br class="gmail_msg">
> int handler = llvm::StringSwitch<int>(key)<br class="gmail_msg">
> .Case("exportVarCount", eExportVar)<br class="gmail_msg">
> .Case("exportForEachCount", eExportForEach)<br class="gmail_msg">
> .Case("exportReduceCount", eExportReduce)<br class="gmail_msg">
> .Case("pragmaCount", ePragma)<br class="gmail_msg">
> .Case("objectSlotCount", eObjectSlot)<br class="gmail_msg">
> .Default(-1);<br class="gmail_msg">
><br class="gmail_msg">
<br class="gmail_msg">
I wasn't aware at all of `llvm::StringSwitch`. This seems like it is<br class="gmail_msg">
more fit for purpose than what I had. Will add.<br class="gmail_msg">
<br class="gmail_msg">
[snip]<br class="gmail_msg">
> @@ -110,6 +162,7 @@ public:<br class="gmail_msg">
> const lldb::ModuleSP m_module;<br class="gmail_msg">
> std::vector<RSKernelDescriptor> m_kernels;<br class="gmail_msg">
> std::vector<RSGlobalDescriptor> m_globals;<br class="gmail_msg">
> + std::vector<RSReductionDescriptor> m_reductions;<br class="gmail_msg">
> std::map<std::string, std::string> m_pragmas;<br class="gmail_msg">
><br class="gmail_msg">
> You should consider changing this to llvm::StringMap, which has better<br class="gmail_msg">
> performance characteristics than std::map<br class="gmail_msg">
<br class="gmail_msg">
For the case of a small number of keys (which is certainly true for the<br class="gmail_msg">
`m_pragmas` case), I've found `std::map` to have significantly better<br class="gmail_msg">
performance characteristics in artificial tests when compared to<br class="gmail_msg">
`llvm::StringMap` (x86_64 linux gcc-6.2 with libstdc++) so I'm, not too<br class="gmail_msg">
keen to replace all instances of `std::map<string, T>` at this time. In<br class="gmail_msg">
many cases though, I think the llvm containers (particularly StringRef)<br class="gmail_msg">
have a number of advantages over the standard library, and I'll<br class="gmail_msg">
definitely start moving over where it makes sense.<br class="gmail_msg">
<br class="gmail_msg">
I threw together a very simple testcase for these performance numbers,<br class="gmail_msg">
which is attached.<br class="gmail_msg"></blockquote><div><br></div><div>Interesting, I'll take a look. </div></div></div>