[PATCH] D65423: [AMDGPU] Fix high occupancy calculation and print it
Stanislav Mekhanoshin via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Jul 29 17:24:11 PDT 2019
rampitec marked 2 inline comments as done.
rampitec added inline comments.
================
Comment at: lib/Target/AMDGPU/AMDGPUSubtarget.h:1322
+
+ unsigned getMaxWavesPerEU() const override { return 10; }
};
----------------
rampitec wrote:
> arsenm wrote:
> > Should just set a field in the constructor rather than introducing a virtual
> That is too early, constructor does not know it yet.
> Also look at the other functions of the same family, they are all virtual: getMaxWorkGroupsPerCU, getMinFlatWorkGroupSize, getMaxFlatWorkGroupSize, getMinWavesPerEU, getMaxWavesPerEU(unsigned FlatWorkGroupSize).
Well, OK, I still can do it. Just overwrite value in GCNSubtarget(). It looks a little weird because I still have to duplicate this function in the GCNSubtarget(), otherwise C++ has problems with name resolve. Either this or we will always have to specify AMDGPUSubtarget:: prefix.
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D65423/new/
https://reviews.llvm.org/D65423
More information about the llvm-commits
mailing list