[llvm] [AMDGPU][True16] Fix the VGPR register class for 16-bit values. (PR #76170)

Jay Foad via llvm-commits llvm-commits at lists.llvm.org
Fri Dec 22 02:16:36 PST 2023


================
@@ -100,7 +100,9 @@ define amdgpu_kernel void @fadd_f16(
 ; GFX11-GISEL-NEXT:    s_waitcnt vmcnt(0)
 ; GFX11-GISEL-NEXT:    buffer_load_u16 v1, off, s[0:3], 0 glc dlc
 ; GFX11-GISEL-NEXT:    s_waitcnt vmcnt(0)
-; GFX11-GISEL-NEXT:    v_add_f16_e32 v0.l, v0.l, v1.l
+; GFX11-GISEL-NEXT:    v_mov_b16_e32 v0.h, v1.l
----------------
jayfoad wrote:

Any idea why this regressed? It looks like a failure of copy coalescing.

https://github.com/llvm/llvm-project/pull/76170


More information about the llvm-commits mailing list