[PATCH] D109900: [AMDGPU] Filtering out the inactive lanes bits when lowering copy to SCC

Alexander via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Sep 16 10:33:56 PDT 2021


alex-t created this revision.
alex-t added reviewers: rampitec, foad.
Herald added subscribers: kerbowa, hiraditya, t-tye, tpr, dstuttard, yaxunl, nhaehnle, jvesely, kzhuravl, arsenm.
alex-t requested review of this revision.
Herald added a subscriber: wdng.
Herald added a project: LLVM.

SALU i1 representation is just a one bit but VALU is 64bits.

  When we pass the scalar i1 values to uniform VALU instructions 
  we use the fllowing representation: false - 0, true - 0xffffffffffffffff. 
  VALU instructions only process the olanes that are active, that is controlled by the EXEC mask.
  We need to filter out odd bits when copy the computation result back to SCC.


Repository:
  rG LLVM Github Monorepo

https://reviews.llvm.org/D109900

Files:
  llvm/lib/Target/AMDGPU/SIFixSGPRCopies.cpp
  llvm/test/CodeGen/AMDGPU/amdgpu-codegenprepare-idiv.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D109900.372988.patch
Type: text/x-patch
Size: 19879 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20210916/c0182bc4/attachment.bin>


More information about the llvm-commits mailing list