krzysz00 wrote: I think I agree with you. Would this definition make `gpu.barrier` correspond to HIP/CUDA's `__syncthreads()`? Since I have a suspicion that that was the original semantics of `gpu.barrier` https://github.com/llvm/llvm-project/pull/110527