[llvm] [AMDGPU][MISched] Allow memory ops of different base pointers to be clustered (PR #140674)
via llvm-commits
llvm-commits at lists.llvm.org
Tue Jun 24 10:23:04 PDT 2025
choikwa wrote:
> > In practice it will result in performance changes but since we have both an experimental and theoretical basis for the change I think that we should move forward with the patch.
>
> What's the theoretical basis? The RFC thread ends with a suggestion "to understand why and where that 10% ~ 15% came from first".
As suggested in RFC, I can provide profiling results for more empirical data. Currently working through some issues with rocprofv3 + ATT on another issue, but once that gets resolved, I think we'll be able to get a better look at the HW level.
https://github.com/llvm/llvm-project/pull/140674
More information about the llvm-commits
mailing list