[llvm] [X86] Reduce znver3/4 LoopMicroOpBufferSize to practical loop unrolling values (PR #91340)

Simon Pilgrim via llvm-commits llvm-commits at lists.llvm.org
Wed May 15 04:08:42 PDT 2024


RKSimon wrote:

Would people prefer we just drop the LoopMicroOpBufferSize entry from the znver3/4 models (same as znver1/2)? This prevents most loop unrolling and we then rely on the cpu's op cache higher decode rate to get higher performance (but we end up testing every loop).

https://github.com/llvm/llvm-project/pull/91340


More information about the llvm-commits mailing list