[PATCH] D32416: [x86, SSE] AVX1 PR28129

Dinar Temirbulatov via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Sun Apr 23 23:33:20 PDT 2017


dtemirbulatov created this revision.

Further perf tests on Jaguar indicate that:

  vxorps  %ymm0, %ymm0, %ymm0
  vcmpps  $15, %ymm0, %ymm0, %ymm0

is consistently faster (by about 9%) than:

  vpcmpeqd  %xmm0, %xmm0, %xmm0
  vinsertf128  $1, %xmm0, %ymm0, %ymm0

Testing equivalent code on a SandyBridge (E5-2640) puts it slightly (~3%) faster as well.
OK to commit, this change is only related to AVX1?


https://reviews.llvm.org/D32416

Files:
  lib/Target/X86/X86InstrSSE.td
  test/CodeGen/X86/all-ones-vector.ll
  test/CodeGen/X86/avx-basic.ll
  test/CodeGen/X86/avx-cvt-3.ll
  test/CodeGen/X86/avx-intrinsics-fast-isel.ll
  test/CodeGen/X86/pr28129.ll
  test/CodeGen/X86/vector-pcmp.ll
  test/CodeGen/X86/xop-intrinsics-fast-isel.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D32416.96347.patch
Type: text/x-patch
Size: 18313 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20170424/47d89bd3/attachment.bin>


More information about the llvm-commits mailing list