[llvm] [AMDGPU] fix SIPeepholeSDWA optimization for fp16 (PR #109395)

Matt Arsenault via llvm-commits llvm-commits at lists.llvm.org
Mon Dec 16 23:17:44 PST 2024


arsenm wrote:

> Is it expected to be the same? Also, in the first case, I can see the combination of two encodings: 64 in v_cmp_ge_i32_e64 and 32 in v_cmp_lt_i32_e32. Does it work correctly? If so, do you have any idea what wrong could go here?

Not sure what the question is. Yes, VOPC instructions are available in a VOP3 form as well.

https://github.com/llvm/llvm-project/pull/109395


More information about the llvm-commits mailing list