[llvm] [AMDGPU] fix SIPeepholeSDWA optimization for fp16 (PR #109395)
Pankaj Dwivedi via llvm-commits
llvm-commits at lists.llvm.org
Mon Dec 16 23:28:02 PST 2024
PankajDwivedi-25 wrote:
> Yes, VOPC instructions are available in a VOP3 form as well.
Thank you for your feedback!
It's a weird case here then. I have no clue what is wrong with int8 sorting. Why some of the output values are becoming -1's?
all I can see in the final ASM diff is the folding of bfe into sdwa.
https://github.com/llvm/llvm-project/pull/109395
More information about the llvm-commits
mailing list