[PATCH] D136663: Handling ADD|SUB U64 decomposed Pseudos not getting lowered to SDWA form
Yashwant Singh via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu Oct 27 04:35:46 PDT 2022
yassingh updated this revision to Diff 471118.
yassingh added a comment.
Only shrink V_ADD_CO_U32_e32 instruction if V_ADDC_CO_U32_e32 is not shrinkable. Hence avoiding introducing redundant copy instruction as suggested by @arsenm and @foad
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D136663/new/
https://reviews.llvm.org/D136663
Files:
llvm/lib/Target/AMDGPU/SIPeepholeSDWA.cpp
llvm/test/CodeGen/AMDGPU/v_add_u64_pseudo_sdwa.ll
llvm/test/CodeGen/AMDGPU/v_sub_u64_pseudo_sdwa.ll
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D136663.471118.patch
Type: text/x-patch
Size: 4936 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20221027/b2caf11e/attachment.bin>
More information about the llvm-commits
mailing list