[llvm] [AMDGPU][True16] Fix the VGPR register class for 16-bit values. (PR #76170)
Jay Foad via llvm-commits
llvm-commits at lists.llvm.org
Fri Dec 22 02:16:36 PST 2023
================
@@ -100,7 +100,9 @@ define amdgpu_kernel void @fadd_f16(
; GFX11-GISEL-NEXT: s_waitcnt vmcnt(0)
; GFX11-GISEL-NEXT: buffer_load_u16 v1, off, s[0:3], 0 glc dlc
; GFX11-GISEL-NEXT: s_waitcnt vmcnt(0)
-; GFX11-GISEL-NEXT: v_add_f16_e32 v0.l, v0.l, v1.l
+; GFX11-GISEL-NEXT: v_mov_b16_e32 v0.h, v1.l
----------------
jayfoad wrote:
Any idea why this regressed? It looks like a failure of copy coalescing.
https://github.com/llvm/llvm-project/pull/76170
More information about the llvm-commits
mailing list