[PATCH] D33503: Make BinaryStreamReader::readCString a bit faster.

Zachary Turner via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed May 24 09:38:51 PDT 2017


zturner created this revision.

Previously it would do a character by character search for a null terminator, to account for the fact that an arbitrary stream need not store its data contiguously so you couldn't just do a `memchr`.  However, the stream API has a function which will return the longest contiguous chunk without doing a copy, and by using this function we can do a `memchr` on the individual chunks.  For certain types of streams like data from object files etc, this is guaranteed to find the null terminator with only a single `memchr`, but even with discontiguous streams such as `MappedBlockStream`, it's rare that any given string will cross a block boundary, so even those will almost always be satisfied with a single `memchr`.

This optimization is worth a 10-12% reduction in link time (4.2 seconds -> 3.75 seconds)


https://reviews.llvm.org/D33503

Files:
  llvm/lib/Support/BinaryStreamReader.cpp


Index: llvm/lib/Support/BinaryStreamReader.cpp
===================================================================
--- llvm/lib/Support/BinaryStreamReader.cpp
+++ llvm/lib/Support/BinaryStreamReader.cpp
@@ -42,29 +42,30 @@
 }
 
 Error BinaryStreamReader::readCString(StringRef &Dest) {
-  // TODO: This could be made more efficient by using readLongestContiguousChunk
-  // and searching for null terminators in the resulting buffer.
-
-  uint32_t Length = 0;
-  // First compute the length of the string by reading 1 byte at a time.
   uint32_t OriginalOffset = getOffset();
-  const char *C;
+  uint32_t FoundOffset = 0;
   while (true) {
-    if (auto EC = readObject(C))
+    uint32_t ThisOffset = getOffset();
+    ArrayRef<uint8_t> Buffer;
+    if (auto EC = readLongestContiguousChunk(Buffer))
       return EC;
-    if (*C == '\0')
+    StringRef S(reinterpret_cast<const char *>(Buffer.begin()), Buffer.size());
+    size_t Pos = S.find_first_of('\0');
+    if (LLVM_LIKELY(Pos != StringRef::npos)) {
+      FoundOffset = Pos + ThisOffset;
       break;
-    ++Length;
+    }
   }
-  // Now go back and request a reference for that many bytes.
-  uint32_t NewOffset = getOffset();
+  assert(FoundOffset >= OriginalOffset);
+
   setOffset(OriginalOffset);
+  size_t Length = FoundOffset - OriginalOffset;
 
   if (auto EC = readFixedString(Dest, Length))
     return EC;
 
-  // Now set the offset back to where it was after we calculated the length.
-  setOffset(NewOffset);
+  // Now set the offset back to after the null terminator.
+  setOffset(FoundOffset + 1);
   return Error::success();
 }
 


-------------- next part --------------
A non-text attachment was scrubbed...
Name: D33503.100114.patch
Type: text/x-patch
Size: 1608 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20170524/224863b6/attachment.bin>


More information about the llvm-commits mailing list