[clang] [lld] [flang] [llvm] [compiler-rt] [openmp] [lldb] [clang-tools-extra] [libcxx] [libc] [mlir] AMDGPU: Do not generate non-temporal hint when Load_Tr intrinsic did not specify it (PR #79104)

Piotr Sobczak via llvm-commits llvm-commits at lists.llvm.org
Tue Jan 23 00:30:04 PST 2024


================
@@ -13,9 +13,8 @@ define amdgpu_kernel void @global_load_tr_b64(ptr addrspace(1) %addr, ptr addrsp
 ; GFX12-SDAG-W32-NEXT:    s_load_b128 s[0:3], s[0:1], 0x24
 ; GFX12-SDAG-W32-NEXT:    v_mov_b32_e32 v2, 0
 ; GFX12-SDAG-W32-NEXT:    s_wait_kmcnt 0x0
-; GFX12-SDAG-W32-NEXT:    global_load_tr_b64 v[0:1], v2, s[0:1] offset:32 th:TH_LOAD_NT
+; GFX12-SDAG-W32-NEXT:    global_load_tr_b64 v[0:1], v2, s[0:1] offset:32
----------------
piotrAMD wrote:

Thanks!

https://github.com/llvm/llvm-project/pull/79104


More information about the llvm-commits mailing list