[llvm] [AMDGPU] fix SIPeepholeSDWA optimization for fp16 (PR #109395)
Matt Arsenault via llvm-commits
llvm-commits at lists.llvm.org
Mon Dec 16 23:17:44 PST 2024
arsenm wrote:
> Is it expected to be the same? Also, in the first case, I can see the combination of two encodings: 64 in v_cmp_ge_i32_e64 and 32 in v_cmp_lt_i32_e32. Does it work correctly? If so, do you have any idea what wrong could go here?
Not sure what the question is. Yes, VOPC instructions are available in a VOP3 form as well.
https://github.com/llvm/llvm-project/pull/109395
More information about the llvm-commits
mailing list