[PATCH] D52944: AMDGPU: Add llvm.amdgcn.ds.ordered.add & swap

Marek Olšák via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon Nov 12 19:43:10 PST 2018


mareko added a comment.

In https://reviews.llvm.org/D52944#1260081, @nhaehnle wrote:

> In https://reviews.llvm.org/D52944#1259919, @arsenm wrote:
>
> > Does this need to be marked as isSourceOfDivergence?
>
>
> Now that you mention it, yes, even though it's for stupid reasons: I believe the ds_ordered_count instruction executes only in a single lane, so it's intuitively a uniform operation; however, it returns its result only in lane 0, so it's formally non-uniform.


ds_ordered_count hangs if more than 1 lane is active.

Would everybody tolerate "i32 m0" in the intrinsic?


Repository:
  rL LLVM

https://reviews.llvm.org/D52944





More information about the llvm-commits mailing list