[llvm-branch-commits] [llvm] [AMDGPU][SILoadStoreOptimizer] Merge constrained sloads (PR #96162)
Christudasan Devadasan via llvm-branch-commits
llvm-branch-commits at lists.llvm.org
Wed Jul 10 06:33:59 PDT 2024
================
@@ -658,17 +658,17 @@ define amdgpu_kernel void @image_bvh_intersect_ray_nsa_reassign(ptr %p_node_ptr,
;
; GFX1013-LABEL: image_bvh_intersect_ray_nsa_reassign:
; GFX1013: ; %bb.0:
-; GFX1013-NEXT: s_load_dwordx8 s[0:7], s[0:1], 0x24
+; GFX1013-NEXT: s_load_dwordx8 s[4:11], s[0:1], 0x24
----------------
cdevadas wrote:
> I guess this code changes because xnack is enabled by default for GFX10.1?
Yes.
> Is there anything we could do to add known alignment info here, to avoid the code pessimization?
I'm not sure what can be done for it.
https://github.com/llvm/llvm-project/pull/96162
More information about the llvm-branch-commits
mailing list