[clang] [llvm] [AMDGPU] Introduce asyncmark/wait intrinsics (PR #173259)

Nicolai Hähnle via cfe-commits cfe-commits at lists.llvm.org
Fri Jan 2 07:53:17 PST 2026


https://github.com/nhaehnle commented:

Mostly LGTM.

I suspect there's an infinite loop due to how `mergeAsyncMarkers` works when the IR contains a loop that has an `asyncmark` intrinsic but no `wait.asyncmark`. What prevents the AsyncMarkers array from just growing indefinitely?

It would generally be nice to have some additional tests where the asyncmarkers in control flow aren't balanced. BTW, I think it'd be fine to have tests that are a bit more bare-bone, e.g. always using the same pointer arguments for the load.lds. That would result in less "noise" in the  assembly.

https://github.com/llvm/llvm-project/pull/173259


More information about the cfe-commits mailing list