[llvm] [AMDGPU] Eliminate unnecessary packing in wider f16 vectors for sdwa/opsel-able instruction (PR #137137)
Pierre van Houtryve via llvm-commits
llvm-commits at lists.llvm.org
Thu Dec 11 00:42:23 PST 2025
================
@@ -89,6 +100,16 @@ class SIPeepholeSDWALegacy : public MachineFunctionPass {
using namespace AMDGPU::SDWA;
+struct FP16PackCandidate {
----------------
Pierre-vh wrote:
Please add comments to tell what each field does
https://github.com/llvm/llvm-project/pull/137137
More information about the llvm-commits
mailing list