[PATCH] D33455: [AMDGPU] Combine and (srl) into shl (bfe)

Matt Arsenault via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Tue May 23 11:39:32 PDT 2017


arsenm added inline comments.


================
Comment at: lib/Target/AMDGPU/AMDGPUISelLowering.cpp:3482
+    if ((OffsetVal + WidthVal) >= 32 &&
+        !(Subtarget->getGeneration() >= AMDGPUSubtarget::VOLCANIC_ISLANDS &&
+          OffsetVal == 16 && WidthVal == 16)) {
----------------
There is an SDWA subtarget feature


================
Comment at: lib/Target/AMDGPU/SIISelLowering.cpp:4242
+    // nb = number of trailing zeroes in mask
+    // It can be optimized out using SDWA for GFX8+ in the finalizer,
+    // given that we are selecting 8 or 16 bit fields starting at byte boundary.
----------------
s/finalizer/SDWA pass/


================
Comment at: test/CodeGen/AMDGPU/bfe-combine.ll:3
+
+; CHECK-LABEL: {{^}}bfe_combine8:
+; CHECK: v_bfe_u32 v[[BFE:[0-9]+]], v{{[0-9]+}}, 8, 8
----------------
s/CHECK/GCN


Repository:
  rL LLVM

https://reviews.llvm.org/D33455





More information about the llvm-commits mailing list