[clang] [llvm] [AMDGPU] Implement Waitcnt Expansion for Profiling (PR #169345)
Pankaj Dwivedi via llvm-commits
llvm-commits at lists.llvm.org
Mon Nov 24 08:14:52 PST 2025
PankajDwivedi-25 wrote:
> Why would you restrict this to "non-zero counter values"?
When a waitcnt already has a zero counter value expanding it would just generate another waitcnt(0), which provides no additional profiling granularity.
If you believe there's a use case for expanding waitcnt(0), I'd be happy to discuss it.
> Why does this need a new subtarget feature?
On this, I am still not fully sure what would be the best approach to handle. I there any suggestion from you?
https://github.com/llvm/llvm-project/pull/169345
More information about the llvm-commits
mailing list