[llvm] [AMDGPU][True16][Codegen] remove packed build_vector pattern from true16 (PR #148715)
Brox Chen via llvm-commits
llvm-commits at lists.llvm.org
Thu Jul 17 08:37:14 PDT 2025
================
@@ -4007,6 +4007,8 @@ define amdgpu_ps <2 x half> @global_load_saddr_i16_d16lo_zero_hi(ptr addrspace(1
; GFX12-GISEL-TRUE16-NEXT: global_load_d16_b16 v0, v0, s[2:3]
; GFX12-GISEL-TRUE16-NEXT: s_wait_loadcnt 0x0
; GFX12-GISEL-TRUE16-NEXT: v_and_b32_e32 v0, 0xffff, v0
+; GFX12-GISEL-TRUE16-NEXT: s_delay_alu instid0(VALU_DEP_1)
+; GFX12-GISEL-TRUE16-NEXT: v_lshl_or_b32 v0, 0, 16, v0
; GFX12-GISEL-TRUE16-NEXT: ; return to shader part epilog
;
; GFX12-GISEL-FAKE16-LABEL: global_load_saddr_i16_d16lo_zero_hi:
----------------
broxigarchen wrote:
Regarding using non-d16 forms when load and zext, I remember there should be patterns. Let me take a look
https://github.com/llvm/llvm-project/pull/148715
More information about the llvm-commits
mailing list