[PATCH] D109900: [AMDGPU] Filtering out the inactive lanes bits when lowering copy to SCC
Alexander via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu Sep 16 10:33:56 PDT 2021
alex-t created this revision.
alex-t added reviewers: rampitec, foad.
Herald added subscribers: kerbowa, hiraditya, t-tye, tpr, dstuttard, yaxunl, nhaehnle, jvesely, kzhuravl, arsenm.
alex-t requested review of this revision.
Herald added a subscriber: wdng.
Herald added a project: LLVM.
SALU i1 representation is just a one bit but VALU is 64bits.
When we pass the scalar i1 values to uniform VALU instructions
we use the fllowing representation: false - 0, true - 0xffffffffffffffff.
VALU instructions only process the olanes that are active, that is controlled by the EXEC mask.
We need to filter out odd bits when copy the computation result back to SCC.
Repository:
rG LLVM Github Monorepo
https://reviews.llvm.org/D109900
Files:
llvm/lib/Target/AMDGPU/SIFixSGPRCopies.cpp
llvm/test/CodeGen/AMDGPU/amdgpu-codegenprepare-idiv.ll
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D109900.372988.patch
Type: text/x-patch
Size: 19879 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20210916/c0182bc4/attachment.bin>
More information about the llvm-commits
mailing list