[llvm] [AMDGPU] fix SIPeepholeSDWA optimization for fp16 (PR #109395)

Pankaj Dwivedi via llvm-commits llvm-commits at lists.llvm.org
Mon Dec 16 23:28:02 PST 2024


PankajDwivedi-25 wrote:

> Yes, VOPC instructions are available in a VOP3 form as well.

Thank you for your feedback!

It's a weird case here then. I have no clue what is wrong with int8 sorting. Why some of the output values are becoming -1's?
all I can see in the final ASM diff is the folding of bfe into sdwa.

https://github.com/llvm/llvm-project/pull/109395


More information about the llvm-commits mailing list