[PATCH] D144388: [X86] Revise Alderlake P-Core schedule model
Noah Goldstein via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Sun Mar 5 23:00:17 PST 2023
goldstein.w.n added a comment.
In D144388#4170437 <https://reviews.llvm.org/D144388#4170437>, @HaohaiWen wrote:
> According to optimization manual, back to back ADD latency is 2 cycles. What uops.info tested was not back to back ADD latency.
>
>> Back-to-back ADD/SUB operations that are both executed on the Fast Adder unit perform the operations
>> in two cycles.
>> •
>> In 128/256-bit, back-to-back ADD/SUB operations executed on the Fast Adder unit perform the
>> operations in two cycles.
>> •
>> In 512-bit, back-to-back ADD/SUB operations are executed in two cycles if both operations use the
>> Fast Adder unit on port 5.
>> The following instructions are executed by the Fast Adder unit:
>> •
>> (V)ADDSUBSS/SD/PS/PD
>> •
>> (V)ADDSS/SD/PS/PD
>> •
>> (V)SUBSS/SD/PS/PD
2c latency in just a special case isn't really 2c latency. That's like saying HSW has 4c L1D latency b.c of the weird pointer chasing case.
I only tested back-to-back so can't say, but if it's 3c in other cases (as measured by uops.info) maybe should change back?
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D144388/new/
https://reviews.llvm.org/D144388
More information about the llvm-commits
mailing list