[PATCH] D142782: [AMDGPU] Add basic support for extended i8 perm matching
Matt Arsenault via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Thu Feb 23 06:53:25 PST 2023
arsenm added inline comments.
================
Comment at: llvm/test/CodeGen/AMDGPU/fast-unaligned-load-store.global.ll:64
; GFX11-NEXT: s_waitcnt vmcnt(0)
-; GFX11-NEXT: v_lshl_or_b32 v0, v0, 16, v2
; GFX11-NEXT: s_setpc_b64 s[30:31]
----------------
Yes, this is worse. Should avoid cases that can use v_lshl_or_b32
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D142782/new/
https://reviews.llvm.org/D142782
More information about the llvm-commits
mailing list