[llvm] [AMDGPU] Change default loop alignment (PR #155343)

via llvm-commits llvm-commits at lists.llvm.org
Tue Aug 26 13:07:10 PDT 2025


hjagasiaAMD wrote:

[triton-aiter.xlsx](https://github.com/user-attachments/files/21994782/triton-aiter.xlsx)
Triton Aiter results, especially for unbatched Llama, see a substantial performance hit

I see three options
- Warn the user when they have sub-optimal alignment and give a flag to user (though no easy for user to know whether the flag will help upfront - as alignment can have side effects and can be different impact on different architectures. Saw some cases where nops in unexecuted code paths changed alignment of subsequent executed branch targets hurting performance) (this would also mean we would introduce yet another backend flag)
- Abandon the PR

Would like reviewer suggestions

https://github.com/llvm/llvm-project/pull/155343


More information about the llvm-commits mailing list