[PATCH] D33576: [AMDGPU] Require waitcnt before barrier for all targets; adjust tests.

Mark Searles via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Fri May 26 15:18:30 PDT 2017


msearles updated this revision to Diff 100481.
msearles added a comment.

Ug. Fix revision; added wrong set of diffs; correcting.


https://reviews.llvm.org/D33576

Files:
  lib/Target/AMDGPU/AMDGPUSubtarget.h
  test/CodeGen/AMDGPU/llvm.amdgcn.s.barrier.ll


Index: test/CodeGen/AMDGPU/llvm.amdgcn.s.barrier.ll
===================================================================
--- test/CodeGen/AMDGPU/llvm.amdgcn.s.barrier.ll
+++ test/CodeGen/AMDGPU/llvm.amdgcn.s.barrier.ll
@@ -3,9 +3,8 @@

 ; GCN-LABEL: {{^}}test_barrier:
 ; GFX8: buffer_store_dword
-; GFX8: s_waitcnt
 ; GFX9: flat_store_dword
-; GFX9-NOT: s_waitcnt
+; GCN: s_waitcnt
 ; GCN: s_barrier
 define amdgpu_kernel void @test_barrier(i32 addrspace(1)* %out, i32 %size) #0 {
 entry:
Index: lib/Target/AMDGPU/AMDGPUSubtarget.h
===================================================================
--- lib/Target/AMDGPU/AMDGPUSubtarget.h
+++ lib/Target/AMDGPU/AMDGPUSubtarget.h
@@ -730,7 +730,7 @@
   /// \returns True if waitcnt instruction is needed before barrier instruction,
   /// false otherwise.
   bool needWaitcntBeforeBarrier() const {
-    return getGeneration() < GFX9;
+    return true;
   }

   /// \returns true if the flat_scratch register should be initialized with the


-------------- next part --------------
A non-text attachment was scrubbed...
Name: D33576.100481.patch
Type: text/x-patch
Size: 990 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20170526/51fac489/attachment.bin>


More information about the llvm-commits mailing list