[PATCH] D19191: AMDGPU/SI: add llvm.amdgcn.ps.live intrinsic
Nicolai Hähnle via llvm-commits
llvm-commits at lists.llvm.org
Sat Apr 16 14:32:20 PDT 2016
nhaehnle added a comment.
I don't think it should be convergent, because other threads in the wave do not affect the return value. It does have to be a source of divergence, though.
I didn't consider the possibility of reading from the EXEC mask, but it's tricky. The intrinsic must return the //initial// EXEC mask. In a pixel shader, the current EXEC mask can be a superset of the initial EXEC. Even if Mesa always emits the intrinsic call as the first thing in the shader, I don't think we can/should rely on the various scheduler steps in LLVM maintaining that ordering.
http://reviews.llvm.org/D19191
More information about the llvm-commits
mailing list