<div dir="ltr"><span style="font-size:12.8000001907349px">> That doesn’t sound very safe :).</span><div><span style="font-size:12.8000001907349px"><br></span></div><div><span style="font-size:12.8000001907349px">That's why I opted for noinline first. :) </span></div><div><span style="font-size:12.8000001907349px"><br></span></div><div>> <span style="font-size:12.8000001907349px">Let me run some benchmarks </span><span class="" tabindex="0" style="font-size:12.8000001907349px"><span class="">tomorrow</span></span><span style="font-size:12.8000001907349px"> and make sure there is no unexpected performance impact. I am leaning towards turn off FORTIFY or using memcpy</span></div><div><span style="font-size:12.8000001907349px"><br></span></div><div><span style="font-size:12.8000001907349px">SGTM. </span></div></div><div class="gmail_extra"><br><div class="gmail_quote">On Mon, Sep 7, 2015 at 7:14 PM, Steven Wu <span dir="ltr"><<a href="mailto:stevenwu@apple.com" target="_blank">stevenwu@apple.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div style="word-wrap:break-word"><br><div><span class=""><blockquote type="cite"><div>On Sep 6, 2015, at 11:54 PM, George Burgess IV <<a href="mailto:george.burgess.iv@gmail.com" target="_blank">george.burgess.iv@gmail.com</a>> wrote:</div><br><div><div dir="ltr">Hm. We can also make _strcpy_nochk always_inline. It doesn't *guarantee* a failure of __builtin_object_size, but it does require clang to lower to @llvm.objectsize, which is what was happening when nothing was failing. This way, we shouldn't end up dropping any perf on the floor, and we have a guaranteed solution for all platforms until someone makes @llvm.objectsize substantially smarter.</div></div></blockquote></span>That doesn’t sound very safe :). And you can play with the inline threshold in LLVM and break the benchmark.<span class=""><br><blockquote type="cite"><div><div dir="ltr"><div><br></div><div>WRT Disabling FORTIFY: Ubuntu's default libc + Android's Bionic seem to both use -D_FORTIFY_SOURCE=0 to mean "turn off FORTIFY", so it's probably okay to assume that your proposed change will work well on most (if not all) platforms, as well.</div><div><br></div><div>I'm happy with either solution. Please note that FORTIFY may slow things down a bit, so if we track things over time, then we might see a bit of a perf increase from the latter change.</div></div></div></blockquote></span>Let me run some benchmarks tomorrow and make sure there is no unexpected performance impact. I am leaning towards turn off FORTIFY or using memcpy.<div><div class="h5"><br><blockquote type="cite"><div><div dir="ltr"><div><br></div></div><div class="gmail_extra"><br><div class="gmail_quote">On Sun, Sep 6, 2015 at 11:18 PM, Steven Wu <span dir="ltr"><<a href="mailto:stevenwu@apple.com" target="_blank">stevenwu@apple.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div style="word-wrap:break-word">Turning off inlining will hurt the performance for the cases when the compiler can optimize strcpy functions and we do track performance for test-suite.<div>On apple platform, the minimal change can just be turning off FORTIFY_SOURCE for this benchmark. Do you know how your change will affect other platforms?</div><div><br></div><div>--- a/MultiSource/Benchmarks/MiBench/consumer-typeset/Makefile<br>+++ b/MultiSource/Benchmarks/MiBench/consumer-typeset/Makefile<br>@@ -1,7 +1,7 @@<br> LEVEL = ../../../..<br> <br> PROG = consumer-typeset<br>-CPPFLAGS = -DOS_UNIX=1 -DOS_DOS=0 -DOS_MAC=0 -DDB_FIX=0 -DUSE_STAT=1 -DSAFE_DFT=0 -DCOLLATE=1 -DLIB_DIR=\"lout.lib\" -DFONT_DIR=\"font\" -DMAPS_DIR=\"maps\" -DINCL_DIR=\"include\" -DDATA_DIR=\"data\" -DHYPH_DIR=\"hyph\" -DLOCALE_DIR=\"locale\" -DCHAR_IN=1 -DCHAR_OUT=0 -DLOCALE_ON=1 -DASSERT_ON=1 -DDEBUG_ON=0 -DPDF_COMPRESSION=0<br>+CPPFLAGS = -DOS_UNIX=1 -DOS_DOS=0 -DOS_MAC=0 -DDB_FIX=0 -DUSE_STAT=1 -DSAFE_DFT=0 -DCOLLATE=1 -DLIB_DIR=\"lout.lib\" -DFONT_DIR=\"font\" -DMAPS_DIR=\"maps\" -DINCL_DIR=\"include\" -DDATA_DIR=\"data\" -DHYPH_DIR=\"hyph\" -DLOCALE_DIR=\"locale\" -DCHAR_IN=1 -DCHAR_OUT=0 -DLOCALE_ON=1 -DASSERT_ON=1 -DDEBUG_ON=0 -DPDF_COMPRESSION=0 -D_FORTIFY_SOURCE=0<br> LDFLAGS = -lm<br> RUN_OPTIONS = -x -I $(PROJ_SRC_DIR)/data/include -D $(PROJ_SRC_DIR)/data/data -F $(PROJ_SRC_DIR)/data/font -C $(PROJ_SRC_DIR)/data/maps -H $(PROJ_OBJ_DIR)/data/hyph $(PROJ_SRC_DIR)/large.lout<br><br></div><div><div><div><br><div><blockquote type="cite"><div>On Sep 6, 2015, at 10:40 PM, George Burgess IV <<a href="mailto:george.burgess.iv@gmail.com" target="_blank">george.burgess.iv@gmail.com</a>> wrote:</div><br><div><div dir="ltr">Well, we can lie by obfuscating things a bit. :)<div><br></div><div>__builtin_object_size relies heavily on inlining to work well across-functions. So, the following should make it always fail:<div><br></div><div>static void __attribute__((noinline)) _strcpy_nochk(char *dst, char *src) { strcpy(dst, src); }</div><div>#define StringCopy(a, b) _strcpy_nochk((char *)(a), (char *)(b))</div><div><br></div></div></div><div class="gmail_extra"><br><div class="gmail_quote">On Sun, Sep 6, 2015 at 9:25 PM, Steven Wu <span dir="ltr"><<a href="mailto:stevenwu@apple.com" target="_blank">stevenwu@apple.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div style="word-wrap:break-word"><br><div><span><blockquote type="cite"><div>On Sep 6, 2015, at 6:38 PM, George Burgess IV <<a href="mailto:george.burgess.iv@gmail.com" target="_blank">george.burgess.iv@gmail.com</a>> wrote:</div><br><div><div dir="ltr"><div>Thanks for catching this!</div><div><br></div>I'm not familiar with at all with test-suite, so someone else may be better suited to review your patch. That being said:<div><br></div><div>id3tag.c -- LGTM</div><div><br></div><div>externs.c -- Did some investigating. This code is rather different than what I'm used to. :)</div></div></div></blockquote><blockquote type="cite"><div><div dir="ltr"><div><br></div><div>From the compiler's perspective: string(OBJECT) returns a pointer to a 4-char array inside of OBJECT. We try to copy a 119-byte string (file path) into this. memcpy() works here because Apple uses __builtin_object_size(p, 0) for memcpy, and __builtin_object_size(p, 1) (sometimes 0) for strcpy; the former has no clue what size OBJECT is, so it fails, while the latter says "hey, this field is clearly 4 bytes," causing FORTIFY to kill the program when we try to copy more than that. (relevant header: <a href="http://opensource.apple.com/source/Libc/Libc-1044.10.1/include/secure/_string.h" target="_blank">http://opensource.apple.com/source/Libc/Libc-1044.10.1/include/secure/_string.h</a> )</div><div><br></div><div>From the program's perspective at runtime, OBJECT points to a non-constant sized malloc. NewWord guarantees that *OBJECT has sufficient space to store the entire string. The types lie.</div><div><br></div><div>In light of this, if we could add a comment to the new StringCopy definition that reads something like: "Types lie at some points in this program, and FORTIFY implementations rely on types. Using memcpy instead of strcpy here makes some FORTIFY impls permissive enough that they won't crash us when we're doing tricky (but still correct) things," that may be best. :)</div></div></div></blockquote><div><br></div></span>I guess there is no good way to lie to the type system that you can legally copy the string into 4 char array?<br><div>How about we just #define StringCopy to __builtin___strcpy_chk with __builtin_object_size(p, 0)? It might break -fno-builtin build though.</div><span><font color="#888888"><div><br></div><div>Steven</div></font></span><div><div><div><br></div><br><blockquote type="cite"><div><div class="gmail_extra"><br><div class="gmail_quote">On Fri, Sep 4, 2015 at 11:25 PM, Steven Wu <span dir="ltr"><<a href="mailto:stevenwu@apple.com" target="_blank">stevenwu@apple.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">r246877 exposes some undefined behavior in LNT that cause the test to crash. Both consumer-lame and consumer-typeset has strcpy that overflow the buffer, one intentionally one maybe by accident. Here is my proposed way to fix the tests.<br>
<br>
Thanks<br>
<span><font color="#888888"><br>
Steven<br>
<br>
</font></span><br><br>
<br>
>From 7151bf31954c9d0680e4bb592dbc130d0234650b Mon Sep 17 00:00:00 2001<br>
From: Steven Wu <<a href="mailto:stevenwu@apple.com" target="_blank">stevenwu@apple.com</a>><br>
Date: Fri, 4 Sep 2015 23:15:21 -0700<br>
Subject: [PATCH] Fix the undefined behavior in MiBench<br>
<br>
The undefined behavior is causing runtime failure after r246877.<br>
Fix consumer-lane by replacing strcpy with direct assignment to avoid<br>
buffer overflow. consumer-typeset is intentionally overflowing buffer so<br>
use memcpy instead of strcpy.<br>
---<br>
MultiSource/Benchmarks/MiBench/consumer-lame/id3tag.c | 2 +-<br>
MultiSource/Benchmarks/MiBench/consumer-typeset/externs.h | 3 ++-<br>
2 files changed, 3 insertions(+), 2 deletions(-)<br>
<br>
diff --git a/MultiSource/Benchmarks/MiBench/consumer-lame/id3tag.c b/MultiSource/Benchmarks/MiBench/consumer-lame/id3tag.c<br>
index e24a966..23f2b86 100644<br>
--- a/MultiSource/Benchmarks/MiBench/consumer-lame/id3tag.c<br>
+++ b/MultiSource/Benchmarks/MiBench/consumer-lame/id3tag.c<br>
@@ -34,7 +34,7 @@ void id3_inittag(ID3TAGDATA *tag) {<br>
strcpy( tag->album, "");<br>
strcpy( tag->year, "");<br>
strcpy( tag->comment, "");<br>
- strcpy( tag->genre, "ˇ"); /* unset genre */<br>
+ tag->genre[0] = 'ˇ'; /* unset genre */<br>
tag->track = 0;<br>
<br>
tag->valid = 0; /* not ready for writing*/<br>
diff --git a/MultiSource/Benchmarks/MiBench/consumer-typeset/externs.h b/MultiSource/Benchmarks/MiBench/consumer-typeset/externs.h<br>
index 7e050bd..a14e8ee 100644<br>
--- a/MultiSource/Benchmarks/MiBench/consumer-typeset/externs.h<br>
+++ b/MultiSource/Benchmarks/MiBench/consumer-typeset/externs.h<br>
@@ -3266,7 +3266,8 @@ extern int strcollcmp(char *a, char *b);<br>
( UseCollate ? strcollcmp((char *)(a),(char *)(b)) <= 0 \<br>
: strcmp((char *)(a),(char *)(b)) <= 0 )<br>
#define StringCat(a, b) strcat((char *)(a),(char *)(b))<br>
-#define StringCopy(a, b) strcpy((char *)(a),(char *)(b))<br>
+#define StringCopy(a, b) (char*)memcpy((void *)(a),(void *)(b), \<br>
+ strlen((char*)(b)) + 1)<br>
#define StringLength(a) strlen((char *)(a))<br>
#define StringFOpen(a, b) fopen( (char *) (a), (b) )<br>
#define StringFPuts(a, b) fputs( (char *) (a), (b) )<br>
--<br>
2.3.8 (Apple Git-58)<br>
<br>
<br>
<br>
<br></blockquote></div><br></div>
</div></blockquote></div></div></div><br></div></blockquote></div><br></div>
</div></blockquote></div><br></div></div></div></div></blockquote></div><br></div>
</div></blockquote></div></div></div><br></div></blockquote></div><br></div>