[PATCH] D42258: [X86][SSE] Aggressively use PMADDWD for v4i32 multiplies with 17 or more leading zeros
    Simon Pilgrim via Phabricator via llvm-commits 
    llvm-commits at lists.llvm.org
       
    Thu Jan 18 11:44:25 PST 2018
    
    
  
RKSimon created this revision.
RKSimon added reviewers: craig.topper, pcordes, zvi, andreadb, spatel.
As discussed in https://reviews.llvm.org/D41484, PMADDWD for 'zero extended' vXi32 is nearly always a better option than PMULLD:
On SNB it will result in code that isn't any faster, but not any slower so we may as well keep it.
On KNL it only has half the throughput, so I've disabled it on there - ideally there'd be a better way than this.
Repository:
  rL LLVM
https://reviews.llvm.org/D42258
Files:
  lib/Target/X86/X86ISelLowering.cpp
  test/CodeGen/X86/promote.ll
  test/CodeGen/X86/shrink_vmul.ll
  test/CodeGen/X86/slow-pmulld.ll
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D42258.130468.patch
Type: text/x-patch
Size: 42745 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20180118/173ef298/attachment-0001.bin>
    
    
More information about the llvm-commits
mailing list