[PATCH] D109900: [AMDGPU] Filtering out the inactive lanes bits when lowering copy to SCC

Ruiling, Song via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Sep 16 15:31:55 PDT 2021


ruiling added a reviewer: arsenm.
ruiling added a comment.

> When we pass the scalar i1 values to uniform VALU instructions 
> we use the fllowing representation: false - 0, true - 0xffffffffffffffff. 
> VALU instructions only process the olanes that are active, that is controlled by the EXEC mask.
> We need to filter out odd bits when copy the computation result back to SCC.

Could you add some more description what's going on in failure cases? So that is easy for others to understand the problem. And we also need a test to show the problem.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D109900/new/

https://reviews.llvm.org/D109900



More information about the llvm-commits mailing list