[llvm-commits] [PATCH] Make '-mtune=x86_64' assume fast unaligned memory accesses.
Dan Gohman
dan433584 at gmail.com
Tue Dec 11 10:05:46 PST 2012
Makes sense to me.
Dan
On Mon, Dec 10, 2012 at 1:27 AM, Chandler Carruth <chandlerc at gmail.com> wrote:
> Not all chips targeted by x86_64 have this feature, but a dramatically
> increasing number do. Specifying a chip-specific tuning parameter will
> continue to turn the feature on or off as appropriate for that
> particular chip, but the generic flag should try to achieve the best
> performance on the most widely available hardware. Today, the number of
> chips with fast UA access dwarfs those without in the x86-64 space.
>
> Note that this also brings LLVM's code generation for this '-march' flag
> more in line with that of modern GCCs.
>
> http://llvm-reviews.chandlerc.com/D195
>
> Files:
> lib/Target/X86/X86.td
>
> Index: lib/Target/X86/X86.td
> ===================================================================
> --- lib/Target/X86/X86.td
> +++ lib/Target/X86/X86.td
> @@ -155,7 +155,8 @@
> def : Proc<"pentium-m", [FeatureSSE2, FeatureSlowBTMem]>;
> def : Proc<"pentium4", [FeatureSSE2]>;
> def : Proc<"pentium4m", [FeatureSSE2, FeatureSlowBTMem]>;
> -def : Proc<"x86-64", [FeatureSSE2, Feature64Bit, FeatureSlowBTMem]>;
> +def : Proc<"x86-64", [FeatureSSE2, Feature64Bit, FeatureSlowBTMem,
> + FeatureFastUAMem]>;
> def : Proc<"yonah", [FeatureSSE3, FeatureSlowBTMem]>;
> def : Proc<"prescott", [FeatureSSE3, FeatureSlowBTMem]>;
> def : Proc<"nocona", [FeatureSSE3, FeatureCMPXCHG16B,
>
> _______________________________________________
> llvm-commits mailing list
> llvm-commits at cs.uiuc.edu
> http://lists.cs.uiuc.edu/mailman/listinfo/llvm-commits
>
More information about the llvm-commits
mailing list