[PATCH] D42647: AMDGPU: Track physreg uses in SILoadStoreOptimizer

Marek Olšák via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Feb 21 09:48:52 PST 2018


mareko added inline comments.


================
Comment at: test/CodeGen/AMDGPU/smrd.ll:258
 ;
-; GCN: s_buffer_load_dword
-; GCN: s_buffer_load_dword
-define amdgpu_ps float @smrd_imm_nomerge_m0(<4 x i32> inreg %desc, i32 inreg %prim, float %u, float %v) #0 {
+; Merging is still thwarted on GFX9 due to s_set_gpr_idx
+;
----------------
mareko wrote:
> Merging of these copcodes is disabled on GFX9 because of the cache line straddling bug.
Note that merging buffer loads is fully enabled on GFX9 now. See git commit ea06ecf3436ea455a7f304095ebf7f4f4ec989f3 .


Repository:
  rL LLVM

https://reviews.llvm.org/D42647





More information about the llvm-commits mailing list