[PATCH] D136663: Handling ADD|SUB U64 decomposed Pseudos not getting lowered to SDWA form

Thu Oct 27 04:35:46 PDT 2022

yassingh updated this revision to Diff 471118.
yassingh added a comment.

Only shrink V_ADD_CO_U32_e32 instruction if V_ADDC_CO_U32_e32 is not shrinkable. Hence avoiding introducing redundant copy instruction as suggested by @arsenm and @foad

Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D136663/new/

https://reviews.llvm.org/D136663

Files:
  llvm/lib/Target/AMDGPU/SIPeepholeSDWA.cpp
  llvm/test/CodeGen/AMDGPU/v_add_u64_pseudo_sdwa.ll
  llvm/test/CodeGen/AMDGPU/v_sub_u64_pseudo_sdwa.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D136663.471118.patch
Type: text/x-patch
Size: 4936 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20221027/b2caf11e/attachment.bin>