[PATCH] D32002: [X86] Improve large struct pass by value performance
Zvi Rackover via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Wed Apr 19 03:07:33 PDT 2017
zvi added a comment.
This LGTM, thanks!
Maybe better wait for other reviewers to give the final ok.
================
Comment at: lib/Target/X86/X86.td:508
FeatureBMI2,
+ FeatureFastString,
FeatureFMA,
----------------
courbet wrote:
> zvi wrote:
> > courbet wrote:
> > > RKSimon wrote:
> > > > Is this a Haswell feature in particular or the only target that has been tested?
> > > I've tested it on Haswell and Skylake. The Skylake model below actually uses HSWFeatures too, so I have not added it there again.
> > The Optimization Guide section @craig.topper quoted above states that this feature is available starting from Ivy Bridge.
> Unfortunately I don't have an IvyBridge to measure it. Do we want to blindly trust the manual ? :)
I have no objection for limiting to Haswell and later.
https://reviews.llvm.org/D32002
More information about the llvm-commits
mailing list