[cfe-dev] Clang generates calls to llvm.memcpy with overlapping arguments, but LangRef requires the arguments to not overlap
Richard Smith via cfe-dev
cfe-dev at lists.llvm.org
Tue Aug 25 13:16:35 PDT 2020
On Tue, 25 Aug 2020, 10:53 Florian Hahn via cfe-dev, <cfe-dev at lists.llvm.org>
wrote:
> Hi,
>
> It appears that Clang generates calls to `llvm.memcpy` with potentially
> overlapping arguments in some cases.
>
> For the snippet below
>
> struct S
> {
> char s[25];
> };
>
> struct S *p;
>
> void test2() {
> ...
> foo (&b, 1);
> b = a;
> b = *p;
> ...
> }
>
>
> Clang uses `llvm.memcpy` to copy the struct:
>
> call void @foo(%struct.S* %2, i32 1)
> %7 = bitcast %struct.S* %2 to i8*
> %8 = bitcast %struct.S* %1 to i8*
> call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 1 %7, i8* align 1 %8, i64
> 25, i1 false)
> %9 = load %struct.S*, %struct.S** @p, align 8
> %10 = bitcast %struct.S* %2 to i8*
> %11 = bitcast %struct.S* %9 to i8*
> call void @llvm.memcpy.p0i8.p0i8.i64(i8* align 1 %10, i8* align 1 %11,
> i64 25, i1 false)
>
>
> In the C example, `foo` could set `p = &b` and then `b = *p` would just
> copy the contents from `b` into `b`. This means that the the arguments to
> the second llvm.memcpy call may overlap, which seems not allowed according
> to the current version of the LangRef (
> https://llvm.org/docs/LangRef.html#llvm-memcpy-intrinsic). This is
> problematic, because the fact is used in BasicAliasAnalysis for example (
> https://github.com/llvm/llvm-project/blob/master/llvm/lib/Analysis/BasicAliasAnalysis.cpp#L982
> ).
>
> The full, build-able example can be found here:
> https://godbolt.org/z/PY1vKq
>
> I might be missing something, but it appears that Clang should not create
> call to `llvm.memcpy` unless it can guarantee the arguments cannot overlap.
> I am not sure what the best alternative to `llvm.memcpy` would be in case
> the arguments overlap.
>
To be clear: the concern here is that the source and destination could be
equal, not that they might partially overlap (which I'm pretty sure the
relevant language rules disallow)?
This seems reminiscent of the issue that memcpy is formally undefined if
given a size of zero and a null pointer argument, for which we changed the
rules for @llvm.memcpy to define the behaviour in that case. Can we do the
same here (or at least add another LLVM intrinsic that permits exact
overlap, but not partial overlap)? Note that both GCC and Clang have pretty
much always generated a memcpy for cases such as:
struct A { char buff[999999]; };
void f(struct A *p, struct A *q) { *p = *q; }
in both C and C++ modes, and MSVC and ICC do roughly the same thing when
optimizing (though in ICC's case it invokes an intel-specific function, and
who knows if that explicitly defines the behavior for exact overlap), so
one would imagine that if there are any memcpy implementations for which
memcpy(p, p, n) doesn't work, we'd have seen program misbehavior by now. As
a consequence, as with the memcpy(0, 0, 0) case, LLVM can probably lower a
memcpy-with-exact-overlap-permitted to a memcpy libcall on ~all targets.
Cheers,
> Florian
> _______________________________________________
> cfe-dev mailing list
> cfe-dev at lists.llvm.org
> https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-dev
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.llvm.org/pipermail/cfe-dev/attachments/20200825/2edaabf6/attachment.html>
More information about the cfe-dev
mailing list