[PATCH] D136663: Handling ADD|SUB U64 decomposed Pseudos not getting lowered to SDWA form
Jay Foad via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Oct 26 08:00:42 PDT 2022
foad added a comment.
Can you precommit the tests and rebase so that we can see the diffs?
================
Comment at: llvm/test/CodeGen/AMDGPU/v_add_u64_pseudo_sdwa.ll:9-10
+; GFX9-NEXT: v_add_co_u32_sdwa v0, vcc, v1, v0 dst_sel:DWORD dst_unused:UNUSED_PAD src0_sel:DWORD src1_sel:BYTE_0
+; GFX9-NEXT: v_mov_b32_e32 v1, 0
+; GFX9-NEXT: v_addc_co_u32_e32 v1, vcc, 0, v1, vcc
+; GFX9-NEXT: global_store_dwordx2 v[0:1], v[0:1], off
----------------
This is silly. You have added a mov instruction to shrink the addc instruction for no reason, because it is not actually converted to sdwa form.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D136663/new/
https://reviews.llvm.org/D136663
More information about the llvm-commits
mailing list