[llvm] [AMDGPU] Change default loop alignment (PR #155343)
    via llvm-commits 
    llvm-commits at lists.llvm.org
       
    Tue Aug 26 07:14:14 PDT 2025
    
    
  
hjagasiaAMD wrote:
> You also need to factor the idea that an inner loop may be nested 10 levels deep, which is not unusual for MIOpen. Then the inner one tends to be small.
Can you pls elaborate on this. All small loops up to 3 cache lines (192 bytes) are aligned, nested or not. Innermost Loops larger than 3 cache lines are also aligned.
https://github.com/llvm/llvm-project/pull/155343
    
    
More information about the llvm-commits
mailing list