[PATCH] D57748: AMDGPU: Add inverse ballot intrinsic
Matt Arsenault via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu Feb 7 09:48:51 PST 2019
arsenm added a comment.
In D57748#1389109 <https://reviews.llvm.org/D57748#1389109>, @arsenm wrote:
> In D57748#1389105 <https://reviews.llvm.org/D57748#1389105>, @nhaehnle wrote:
>
> > Why can't we recognize this as a pattern? Basically, it's just `(src & (1 << thread_idx))`, and `thread_idx` can be matched as a sequence of `mbcnt` intrinsics.
> >
> > Hmm, except the SelectionDAG is only per-basic block. Ugh.
>
>
> We could always make CodeGenPrepare always sink these
I think this would be cleaner since then we wouldn't have to worry about the conceptually disturbing readfirstlane case
Repository:
rL LLVM
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D57748/new/
https://reviews.llvm.org/D57748
More information about the llvm-commits
mailing list