[PATCH] D38611: Fix X86 regression on linpack
Evgeny Stupachenko via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Fri Jun 1 11:59:23 PDT 2018
evstupac added inline comments.
================
Comment at: lib/Target/X86/X86TargetTransformInfo.cpp:2538
+ // stores with folded address, treating each 2 such stores as 1
+ // additional instruction.
+ unsigned C1Insns = ((C1.FoldedStoreAddresses >> 1) + C1.Insns);
----------------
RKSimon wrote:
> This seems incredibly specific to a particular Intel architecture - you said you've tried to make a best average - based on what?
I have not seen significant difference on Atom CPUs and have seen gains on Cores. It is hard to cover all CPUs. We can adjust this (by limiting to specific CPU(s)) if get a regression.
================
Comment at: test/Transforms/LoopStrengthReduce/X86/folded_addresses.ll:8
+; Function Attrs: norecurse nounwind uwtable
+define void @foo(i32, float, float* noalias nocapture readonly, float* noalias nocapture) local_unnamed_addr #0 {
+; CHECK-LABEL: @foo(
----------------
RKSimon wrote:
> qcolombet wrote:
> > Could you use `opt -instnamer` on the input test?
> Is there any way that this can be reduced further?
Maybe we could get rid of vectorization, however now this is the test that got ~30% regression. It looks reasonable to track it.
Repository:
rL LLVM
https://reviews.llvm.org/D38611
More information about the llvm-commits
mailing list