[PATCH] D139227: [AMDGPU] Update MFMASmallGemmOpt with better performing stategy

Jeffrey Byrnes via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Wed Dec 7 17:32:18 PST 2022


jrbyrnes added a comment.

Hi Austin,

Changes look fine -- and if experiments show it has better performance then I suppose it is better. But the pipeline seems rather arbitrary -- in fact, in the test the previous pipeline fits the requirements of the new one. Maybe since the DAG is less constrained the scheduler has a better ability to produce improved schedule?

Also, having pipeline with 3x as many MFMA SchedGroups as there are MFMAs is an impossible pipeline. I assume you also tried ` I < MFMACount ` ?


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D139227/new/

https://reviews.llvm.org/D139227



More information about the llvm-commits mailing list