stevenwdv wrote: I later did a benchmark of that specific part in https://github.com/WebAssembly/binaryen/pull/8257#discussion_r2798873934, which yielded a difference of approximately 0.8s of useless reallocs & copies. This upstream version has more copies though. https://github.com/llvm/llvm-project/pull/179048