[PATCH] D18162: AMDGPU: Add SIWholeQuadMode pass
Matt Arsenault via llvm-commits
llvm-commits at lists.llvm.org
Mon Mar 21 10:56:57 PDT 2016
arsenm added a comment.
In http://reviews.llvm.org/D18162#378752, @nhaehnle wrote:
>
> The derivative taken by the llvm.SI.image.sample is undefined in GLSL if the control-flow is dynamically non-uniform, so it is perfectly legal to sink the llvm.amdgcn.image.load into the IF block (and the same applies to any other computation that leads to a derivative).
This is the same situation as barriers, so I think this should still be convergent. The problem it is solving is if LLVM introduces uses that do not hit uniform control flow, like introducing a call in either side of an if/then block
http://reviews.llvm.org/D18162
More information about the llvm-commits
mailing list