[llvm] [AMDGPU] [PeepholeOpt] Eliminate unnecessary packing in wider f16 vectors for sdwa/opsel-able instruction (PR #137137)

Krzysztof Drewniak via llvm-commits llvm-commits at lists.llvm.org
Thu Apr 24 08:27:43 PDT 2025


================
@@ -1361,6 +1379,499 @@ bool SIPeepholeSDWALegacy::runOnMachineFunction(MachineFunction &MF) {
   return SIPeepholeSDWA().run(MF);
 }
 
+static bool isSrcDestFP16Bits(MachineInstr *MI, const SIInstrInfo *TII) {
----------------
krzysz00 wrote:

Is there a way to tablegen this switch-case?

(Also, are there relevant things that use the OPSEL scheme and not the SDWA one?)

https://github.com/llvm/llvm-project/pull/137137


More information about the llvm-commits mailing list