[llvm] [WIP][AMDGPU] combine uniform AMDGPU lane Intrinsics (PR #116953)
Pankaj Dwivedi via llvm-commits
llvm-commits at lists.llvm.org
Thu Dec 12 00:24:28 PST 2024
PankajDwivedi-25 wrote:
> Also, there must be a test that shows how this pass can be used to eliminate a trivial waterfall loop. That is a loop which depends on "wfall", and all threads in the wave finish their work in the first iteration itself.
I see this case is not getting optimized in the checks generated & the loop structure remains intact.
https://github.com/llvm/llvm-project/pull/116953
More information about the llvm-commits
mailing list