[llvm] AMDGPU: Allow operand folding between loop body and its preheader (PR #137022)

Matt Arsenault via llvm-commits llvm-commits at lists.llvm.org
Thu May 1 08:50:47 PDT 2025


arsenm wrote:

> I thought this was in the category of stuff we can't really do because we've conflated exec mask mutation with the LLVM CFG. When is this optimisation sound?

Yes, we don't have a proper representation and just have to hack around it. In this case, since the readfirstlanes are explicit in the original IR, it would be better served by #128687

https://github.com/llvm/llvm-project/pull/137022


More information about the llvm-commits mailing list