[PATCH] D109900: [AMDGPU] Filtering out the inactive lanes bits when lowering copy to SCC
Ruiling, Song via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu Sep 16 15:31:55 PDT 2021
ruiling added a reviewer: arsenm.
ruiling added a comment.
> When we pass the scalar i1 values to uniform VALU instructions
> we use the fllowing representation: false - 0, true - 0xffffffffffffffff.
> VALU instructions only process the olanes that are active, that is controlled by the EXEC mask.
> We need to filter out odd bits when copy the computation result back to SCC.
Could you add some more description what's going on in failure cases? So that is easy for others to understand the problem. And we also need a test to show the problem.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D109900/new/
https://reviews.llvm.org/D109900
More information about the llvm-commits
mailing list