[PATCH] D55570: [AMDGPU] Improve SDWA generation for V_OR_B32_E32.

Matt Arsenault via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri Jun 28 17:09:21 PDT 2019


arsenm added inline comments.


================
Comment at: test/CodeGen/AMDGPU/sdwa-ors.mir:4-11
+# GCN-LABEL: {{^}}name: sdwa_test
+# GFX9: V_OR_B32_sdwa 0, %{{[0-9]+}}, 0, killed %{{[0-9]+}}, 0, 6, 0, 2, 6, implicit $exec
+# GFX9: V_OR_B32_sdwa 0, %{{[0-9]+}}, 0, killed %{{[0-9]+}}, 0, 6, 0, 1, 6, implicit $exec
+# GFX9: V_OR_B32_sdwa 0, %{{[0-9]+}}, 0, killed %{{[0-9]+}}, 0, 6, 0, 0, 6, implicit $exec
+# GFX9: V_OR_B32_sdwa 0, %{{[0-9]+}}, 0, killed %{{[0-9]+}}, 0, 6, 0, 3, 6, implicit $exec
+# GFX9: V_OR_B32_sdwa 0, %{{[0-9]+}}, 0, killed %{{[0-9]+}}, 0, 6, 0, 5, 6, implicit $exec
+# GFX9: V_OR_B32_sdwa 0, %{{[0-9]+}}, 0, killed %{{[0-9]+}}, 0, 6, 0, 4, 6, implicit $exec
----------------
Can probably use update_mir_test_checks for this one


================
Comment at: test/CodeGen/AMDGPU/sdwa-ors.mir:35
+    %16:vgpr_32 = V_OR_B32_e64 killed %15, killed %13, implicit $exec
+    GLOBAL_STORE_DWORD %2, %16, 0, 0, 0, implicit $exec :: (volatile store 4, addrspace 1)
+
----------------
Can you condense the register values for this? Running -run-pass=none with -simplify-mir should work once you delete the registers section


CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D55570/new/

https://reviews.llvm.org/D55570





More information about the llvm-commits mailing list