[PATCH] [x86] Add a feature flag for slow 32-byte unaligned memory accesses.

Sanjay Patel spatel at rotateright.com
Fri Nov 21 07:54:02 PST 2014


Hi qcolombet, hfinkel, andreadb, nadav,

This patch adds a feature flag to avoid unaligned 32-byte load/store AVX codegen for Sandy Bridge and Ivy Bridge. There is no functionality change intended for those chips. Previously, the absence of AVX2 was being used as a proxy to detect this feature. But that hindered codegen for AVX-enabled AMD chips such as btver2 that do not have the 32-byte unaligned access slowdown.

More detailed performance measurements are included in PR21541 ( http://llvm.org/bugs/show_bug.cgi?id=21541 ).

http://reviews.llvm.org/D6355

Files:
  lib/Target/X86/X86.td
  lib/Target/X86/X86ISelLowering.cpp
  lib/Target/X86/X86Subtarget.cpp
  lib/Target/X86/X86Subtarget.h
  test/CodeGen/X86/2012-05-19-avx2-store.ll
  test/CodeGen/X86/unaligned-32-byte-memops.ll
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D6355.16492.patch
Type: text/x-patch
Size: 6981 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20141121/e692b6f5/attachment.bin>


More information about the llvm-commits mailing list