[llvm] [AMDGPU] Eliminate unnecessary packing in wider f16 vectors for sdwa/opsel-able instruction (PR #137137)

Pierre van Houtryve via llvm-commits llvm-commits at lists.llvm.org
Fri Apr 25 04:21:50 PDT 2025


================
@@ -1361,6 +1379,499 @@ bool SIPeepholeSDWALegacy::runOnMachineFunction(MachineFunction &MF) {
   return SIPeepholeSDWA().run(MF);
 }
 
+static bool isSrcDestFP16Bits(MachineInstr *MI, const SIInstrInfo *TII) {
----------------
Pierre-vh wrote:

@vg0204 Please use replies instead of editing the original comment. I got very confused while reading this.



https://github.com/llvm/llvm-project/pull/137137


More information about the llvm-commits mailing list