[clang] [llvm] [AMDGPU] Implement Waitcnt Expansion for Profiling (PR #169345)

Pankaj Dwivedi via llvm-commits llvm-commits at lists.llvm.org
Mon Nov 24 08:14:52 PST 2025


PankajDwivedi-25 wrote:

> Why would you restrict this to "non-zero counter values"?

When a waitcnt already has a zero counter value expanding it would just generate another waitcnt(0), which provides no additional profiling granularity.
If you believe there's a use case for expanding waitcnt(0), I'd be happy to discuss it.



> Why does this need a new subtarget feature?

On this, I am still not fully sure what would be the best approach to handle. I there any suggestion from you?

https://github.com/llvm/llvm-project/pull/169345


More information about the llvm-commits mailing list