[PATCH] D114232: [AMDGPU] Fold more inline constant operands by commuting instructions
Jay Foad via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Nov 19 07:29:55 PST 2021
foad added inline comments.
================
Comment at: llvm/test/CodeGen/AMDGPU/fold-multiple-commute.mir:18
+ ; CHECK-NEXT: [[V_ADDC_U32_e32_:%[0-9]+]]:vgpr_32 = V_ADDC_U32_e32 0, [[COPY]], implicit-def $vcc, implicit $vcc, implicit $exec
+ ; CHECK-NEXT: [[V_ADDC_U32_e32_1:%[0-9]+]]:vgpr_32 = V_ADDC_U32_e32 0, [[COPY1]], implicit-def $vcc, implicit $vcc, implicit $exec
+ ; CHECK-NEXT: S_NOP 0, implicit [[V_ADDC_U32_e32_]], implicit [[V_ADDC_U32_e32_1]]
----------------
It occurs to me now that SIFoldOperands would probably have succeeded here if these were using the e64 form of the instruction. But there are obviously real world cases where it sees the e32 form, otherwise none of the .ll tests would have been improved by this patch.
Is it worth abandoning this patch and pursuing why we are selecting e32 instructions in the first place?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D114232/new/
https://reviews.llvm.org/D114232
More information about the llvm-commits
mailing list