spupyrev wrote: Do you have some recent perf numbers for this kind of optimization? We've tried something similar internally in the past but didn't see any perf wins; i wonder if it makes sense for us to reconsider the optimization. https://github.com/llvm/llvm-project/pull/68860