[clang] [lld] [flang] [llvm] [compiler-rt] [openmp] [lldb] [clang-tools-extra] [libcxx] [libc] [mlir] AMDGPU: Do not generate non-temporal hint when Load_Tr intrinsic did not specify it (PR #79104)
Piotr Sobczak via cfe-commits
cfe-commits at lists.llvm.org
Tue Jan 23 00:30:04 PST 2024
================
@@ -13,9 +13,8 @@ define amdgpu_kernel void @global_load_tr_b64(ptr addrspace(1) %addr, ptr addrsp
; GFX12-SDAG-W32-NEXT: s_load_b128 s[0:3], s[0:1], 0x24
; GFX12-SDAG-W32-NEXT: v_mov_b32_e32 v2, 0
; GFX12-SDAG-W32-NEXT: s_wait_kmcnt 0x0
-; GFX12-SDAG-W32-NEXT: global_load_tr_b64 v[0:1], v2, s[0:1] offset:32 th:TH_LOAD_NT
+; GFX12-SDAG-W32-NEXT: global_load_tr_b64 v[0:1], v2, s[0:1] offset:32
----------------
piotrAMD wrote:
Thanks!
https://github.com/llvm/llvm-project/pull/79104
More information about the cfe-commits
mailing list