[llvm] [AMDGPU][MISched] Allow memory ops of different base pointers to be clustered (PR #140674)

via llvm-commits llvm-commits at lists.llvm.org
Tue Jun 24 10:23:04 PDT 2025


choikwa wrote:

> > In practice it will result in performance changes but since we have both an experimental and theoretical basis for the change I think that we should move forward with the patch.
> 
> What's the theoretical basis? The RFC thread ends with a suggestion "to understand why and where that 10% ~ 15% came from first".

As suggested in RFC, I can provide profiling results for more empirical data. Currently working through some issues with rocprofv3 + ATT on another issue, but once that gets resolved, I think we'll be able to get a better look at the HW level.

https://github.com/llvm/llvm-project/pull/140674


More information about the llvm-commits mailing list