[PATCH] D153034: [PowerPC][Future] Enable __builtin_mma_xxm[t|f]acc
    Lei Huang via Phabricator via llvm-commits 
    llvm-commits at lists.llvm.org
       
    Fri Jun 23 07:16:15 PDT 2023
    
    
  
lei added inline comments.
================
Comment at: llvm/lib/Target/PowerPC/PPCISelLowering.cpp:10716-10726
+    // Future cpu instructions dmxxinstdmr512 and dmxxextfdmr512 insert and
+    // extract quad vectors(v512i1) from the new wide accumulator(wacc)
+    // register class. The introduction of these new instructions renders the
+    // p10 instructions xxmtacc and xxmfacc obsolete since the new wacc
+    // register class is a better choice for handling quad vector operations.
+    // The intrinsics for xxmtacc and xxmfacc take one argument of
+    // type v512i1, a corresponding dmxx[inst|extf]dmr512 instruction
----------------
I find it confusing to start talking about new instructions when the block starts with the p10 intrinsics. I would add this comment to after handling of non-future ISA code.  So after the if stmt and simplify it a bit as well.
================
Comment at: llvm/test/CodeGen/PowerPC/mmaplus-intrinsics.ll:152
+; CHECK-O0-NEXT:    lxv vs0, 32(r3)
+; CHECK-O0-NEXT:    # implicit-def: $vsrp17
+; CHECK-O0-NEXT:    xxlor v3, vs0, vs0
----------------
I realized these -O0 run line was not added as part of this patch, but I am wondering why we need to be explicitly running `-O0` for these intrinsic tests.  Are they generated differently for `-O0` vs opt?
Repository:
  rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D153034/new/
https://reviews.llvm.org/D153034
    
    
More information about the llvm-commits
mailing list