[PATCH] D139227: [AMDGPU] Update MFMASmallGemmOpt with better performing stategy
Jeffrey Byrnes via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Dec 7 17:32:18 PST 2022
jrbyrnes added a comment.
Hi Austin,
Changes look fine -- and if experiments show it has better performance then I suppose it is better. But the pipeline seems rather arbitrary -- in fact, in the test the previous pipeline fits the requirements of the new one. Maybe since the DAG is less constrained the scheduler has a better ability to produce improved schedule?
Also, having pipeline with 3x as many MFMA SchedGroups as there are MFMAs is an impossible pipeline. I assume you also tried ` I < MFMACount ` ?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D139227/new/
https://reviews.llvm.org/D139227
More information about the llvm-commits
mailing list