[llvm] [X86] Reduce znver3/4 LoopMicroOpBufferSize to practical loop unrolling values (PR #91340)
via llvm-commits
llvm-commits at lists.llvm.org
Wed May 8 02:09:23 PDT 2024
ganeshgit wrote:
> isn't it limited by the 6 wide renamer in practice
I am not sure why you mention renamer. It has a role to play in the throughput however it is not strictly limited to renamer's capacity right? Renamer's capability will be on case-case basis but theoretical limit will be 9uops. Also, renamer can be split across integer\fp(vector). So, I don't think we should restrict it with renamer's capability.
@RKSimon I think we should add a tuning flag whether a subtarget is willing to use this LoopMicroOpBufferSize for unrolling decision. I agree that the metric you are proposing is serving the purpose but the term LoopMicroOpBufferSize in itself is misleading and is not representative.
https://github.com/llvm/llvm-project/pull/91340
More information about the llvm-commits
mailing list