[llvm-branch-commits] [llvm] [AMDGPU][SILoadStoreOptimizer] Merge constrained sloads (PR #96162)

Christudasan Devadasan via llvm-branch-commits llvm-branch-commits at lists.llvm.org
Wed Jul 10 06:33:59 PDT 2024


================
@@ -658,17 +658,17 @@ define amdgpu_kernel void @image_bvh_intersect_ray_nsa_reassign(ptr %p_node_ptr,
 ;
 ; GFX1013-LABEL: image_bvh_intersect_ray_nsa_reassign:
 ; GFX1013:       ; %bb.0:
-; GFX1013-NEXT:    s_load_dwordx8 s[0:7], s[0:1], 0x24
+; GFX1013-NEXT:    s_load_dwordx8 s[4:11], s[0:1], 0x24
----------------
cdevadas wrote:

> I guess this code changes because xnack is enabled by default for GFX10.1?
Yes.
> Is there anything we could do to add known alignment info here, to avoid the code pessimization?
I'm not sure what can be done for it.



https://github.com/llvm/llvm-project/pull/96162


More information about the llvm-branch-commits mailing list