[llvm-branch-commits] [llvm] [AMDGPU][SILoadStoreOptimizer] Merge constrained sloads (PR #96162)

Jay Foad via llvm-branch-commits llvm-branch-commits at lists.llvm.org
Wed Jul 10 06:25:34 PDT 2024


================
@@ -658,17 +658,17 @@ define amdgpu_kernel void @image_bvh_intersect_ray_nsa_reassign(ptr %p_node_ptr,
 ;
 ; GFX1013-LABEL: image_bvh_intersect_ray_nsa_reassign:
 ; GFX1013:       ; %bb.0:
-; GFX1013-NEXT:    s_load_dwordx8 s[0:7], s[0:1], 0x24
+; GFX1013-NEXT:    s_load_dwordx8 s[4:11], s[0:1], 0x24
----------------
jayfoad wrote:

I guess this code changes because xnack is enabled by default for GFX10.1? Is there anything we could do to add known alignment info here, to avoid the code pessimization?

https://github.com/llvm/llvm-project/pull/96162


More information about the llvm-branch-commits mailing list