[llvm-dev] [Coroutines] How to handle arguments passed by value semantically?
Xun Li via llvm-dev
llvm-dev at lists.llvm.org
Wed May 12 08:07:39 PDT 2021
The description of the problem doesn't sound right to me.
Even when parameters are in the form of a byval pointer, their content
will still be copied into the local stack for coroutines correctly,
not just copying pointers. This is currently handled by the front-end.
For example: https://godbolt.org/z/EGoaMEje4
If you look at the IR of foo, there is a memcpy that copies the
parameter to local stack.
On Wed, May 12, 2021 at 12:43 AM chuanqi.xcq
<yedeng.yd at linux.alibaba.com> wrote:
>
> Hi all,
>
> Recently I am trying to work on coroutine bug here: https://reviews.llvm.org/D101980.
> This patch is trying to copy the arguments who had been marked with `byval` into the coroutine frame.
> There summary in the link tells some background.
> Simply,
> (1) By the language standard, the arguments of coroutine should be copied into the `coroutine state`.
> (2) The implementation now in LLVM would copy the arguments of coroutine into the coroutine frame if needed.
> (3) However, the arguments in IR are not equal to the arguments in the source. For example, the following code in C++
> ```
> void foo() {
> coro_func(A());
> }
> ```
> would be translated into something like:
> ```
> void foo() {
> %a = alloca A
> ; initializing for %a
> coro_func(%a);
> }
> ```
> So the actual argument type in IR becomes A* instead of A. And the current implementation in LLVM coroutine
> module would only copy the pointer to A into the coroutine frame instead of A, which is the problem I want to solve.
>
> I had thought we could handle these case by handling the arguments marked with `byval`. But as @rjmccall said, there are
> cases where arguments are passed by value semantically and passed by reference actually.
>
> So here is the decision we need to make. Should we mark all they arguments passed by value semantically in the frontend?
> Or should we just emit the copy in the callee site for coroutines in the frontend?
>
> It looks like the both solution would take many efforts. Then there are drawbacks for each of them.
> First, for the solution based on attribute marks. The drawbacks are:
> (1) What's the attribute should we use? The `byval` is deprecated actually. So we need to design a new attribute to mark the arguments
> of coroutine. I am not sure if it is a good idea to use a new attribute when we meet a new problem. It looks like there are too many
> attributes and metadata in Clang/LLVM now.
> (2) The extra copy. There would be an extra copy in the coroutine to copy the arguments to the frame. Maybe we can do some peephole
> optimization to mitigate the problem.
> (3) Multiple destruction problem. I am not sure if this the same problem @rjmccall mentioned about `move semantics`. My understanding is
> ```
> class A{
> void* data_ptr;
> A(A&&);
> ~A() { if(data_ptr) delete data_ptr; }
> };
> void foo() {
> A a;
> coro_func(std::move(a));
> }
> ```
> The problem here is that the data_ptr of A copied in the coroutine frame may be invalided by the destruction in the temporary produced
> in the caller site if we choose to copy A into the frame instead of move. But we can't get the semantics about move in the middle end.
> The final problem about destruction looks the hardest to handle for the attribute based solutions.
>
> And for the solution based on emitting the copy in the callee site for coroutines,
> (1) It would violate the principle of LLVM to handle the copy of arguments. LLVM would emit the copy in the caller site instead of callee site.
> I believe there must be some reasons that LLVM decides to emit the copy in the caller site. I hope someone could tell me if any one knows.
> And it would make the styles to handle arguments broken in Clang/LLVM.
> (2) It looks not easy to refactor this even only for coroutine. This is not a strong argument. I just say that we need a long time to do this refactoring.
> And there may be some bugs remained during this process.
> (3) We need to teach other passes. If we choose to emit the copy in the callee site, the IR for coroutine now would look like:
> ```
> void coro_func(A* %a) {
> %a.copy = alloca A
> copy %a to %a.copy
> ; use %a.copy instead of %a
> }
> ```
> This pattern should be eliminated by LLVM passes. So we need to teach LLVM passes to remain this structure, which means we need to insear guard codes
> to many other passes.
> BTW, if we compile C++ code in debug mode, we would see a similar pattern:
> ```
> void coro_func(A* %a) {
> %a.addr = alloca A*
> store %a, %a.addr
> %a.val = load %a.addr
> ; use %a.copy instead of %a.val
> }
> ```
> This pattern only copies the address of %a into the new allocated variables instead of copying the contents. I don't know the meaning of doing so now.
>
> So it is clearly that both solutions had some drawbacks and not easy to implement. Do you guys have any thoughts?
>
> Thanks,
> Chuanqi
--
Xun
More information about the llvm-dev
mailing list