[llvm] [AMDGPU] Eliminate unnecessary packing in wider f16 vectors for sdwa/opsel-able instruction (PR #137137)

Krzysztof Drewniak via llvm-commits llvm-commits at lists.llvm.org
Thu Dec 4 23:47:14 PST 2025


================
@@ -207,6 +227,52 @@ class SDWADstPreserveOperand : public SDWADstOperand {
 #endif
 };
 
+class SDWAFP16ChainOperand : public SDWAOperand {
+private:
+  SIPeepholeSDWA &Parent;
+  FP16PackCandidate Candidate;
----------------
krzysz00 wrote:

Initial comment: Why does this say "FP16" when this issue isn't limited to f16 instructions?

https://github.com/llvm/llvm-project/pull/137137


More information about the llvm-commits mailing list