<div dir="ltr">Fixing <a href="mailto:llvm-dev@llvm.org">llvm-dev@llvm.org</a> to <a href="mailto:llvm-dev@lists.llvm.org">llvm-dev@lists.llvm.org</a></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Tue, Nov 17, 2020 at 11:27 AM Thomas Lively <<a href="mailto:tlively@google.com">tlively@google.com</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr">Thanks for your work on this, Paulo!<br><br>Here's some more detail about how function pointers work today in the<br>WebAssembly backend and how they differ from the `funcref` Paulo is working on.<br><br>Today in the WebAssembly backend, both function and data pointers are emitted as<br>WebAssembly `i32` values. Data pointers are indices into the WebAssembly linear<br>memory and function pointers are indices into a separate function table. The<br>linker is responsible for laying out function references in the table and<br>resolving function pointer relocations to the correct table indices.<br><br>WebAssembly `funcref` values are an alternative mechanism for referencing<br>functions that our LLVM backend does not yet support. Unlike the `i32` table<br>indices used to implement normal function pointers, `funcref` values are opaque<br>and non-integral. `funcref`s cannot be stored into memory and cannot be<br>inspected, but they can be called.<br><br>We would like to model both table indices and `funcref`s as function pointers in<br>LLVM IR since they are both callable in WebAssembly. Since table indices are<br>numbers that can be inspected, manipulated, and stored into memory, it makes<br>sense to model them as LLVM IR function pointers into a "normal" address space<br>like address space 0. `funcref` values should instead be modeled as LLVM IR<br>function pointers into a non-integral address space. Since we want to be able to<br>use both in a single program, we need the ability to call function pointers of<br>different address spaces in the same LLVM IR module. That means having multiple<br>program address spaces, as described in this RFC and Paulo's corresponding<br>patch.<br></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Tue, Nov 17, 2020 at 9:11 AM Paulo Matos <pmatos@linki.tools> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">Hello all,<br>

<br>

TL;DR; The current design for the implementation of reference types in<br>

the WebAssembly backend requires the use of multiple program address<br>

spaces. We propose an implementation of multiple address spaces in<br>

D91428 [1] - this is a backwards compatible change.<br>

<br>

# Problem<br>

<br>

Currently the default program address and the default data address space<br>

is the same, namely AS0. We can at the moment, change the program<br>

address space with P<n> in the data layout string. This allows harvard<br>

architectures to separate code and data into different address spaces.<br>

However, only one program address space is allowed.<br>

<br>

At Igalia [2], we are interested in implementing support for reference<br>

types [3] on the WebAssembly backend. After discussions with Thomas<br>

Lively and Andy Wingo, the design we have for reference types involves<br>

having funcrefs and externrefs living in a different address spaces<br>

(non-integral) from normal code/data. Since funcrefs are callable, we<br>

also need to be able to call them. However, as things stand if we use<br>

`P1` in the data layout, normal function calls will cease to work.<br>

<br>

# Design Summary<br>

<br>

The reference types implementation introduces two (reference) types:<br>

funcrefs and externrefs.<br>

Funcrefs are references to functions that can be called, while<br>

externrefs are opaque. Because of the way they interact with memory they<br>

need to live in a separate address space. However, to call funcrefs<br>

which live in a separate address space, this address space needs to be<br>

marked as a program address space. Since we wish that normal function<br>

calls keep working as well, we need AS0 to be a program address space<br>

too. This is what the solution to this RFC addresses. The Data Layout<br>

string for WebAssembly would therefore contain ni:1-P0-P1.<br>

<br>

# Solution<br>

<br>

As mentioned in the TL;DR; the proposed implementation is live in D91428<br>

[1]. The patch is small and backwards compatible, and there should be no<br>

visible changes, unless you use multiple program address spaces.<br>

<br>

With the patch, you should be able to use multiple Px-Py-... to mark<br>

your address spaces as code address spaces. The first one will be<br>

considered the default program address space. So, the order in which the<br>

Ps show up in the data layout string _matters_.<br>

<br>

The program address spaces are kept in a small vector without<br>

duplicates. Therefore P0-P1-P0 is the same as P0-P1, and P0 is the<br>

default address space.<br>

<br>

<br>

# Refs<br>

<br>

[1] <a href="https://reviews.llvm.org/D91428" rel="noreferrer" target="_blank">https://reviews.llvm.org/D91428</a><br>

[2] <a href="https://www.igalia.com" rel="noreferrer" target="_blank">https://www.igalia.com</a><br>

[3] <a href="https://webassembly.github.io/reference-types/core/" rel="noreferrer" target="_blank">https://webassembly.github.io/reference-types/core/</a><br>

<br>

Regards,<br>

-- <br>

Paulo Matos<br>

</blockquote></div>

</blockquote></div>