[PATCH] D49255: AMDGPU: Split wide vectors of i16/f16 into 32-bit regs on calls

Matt Arsenault via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Thu Jul 12 10:37:27 PDT 2018


arsenm created this revision.
arsenm added a reviewer: rampitec.
Herald added subscribers: t-tye, tpr, dstuttard, yaxunl, nhaehnle, wdng, kzhuravl.

This improves code for the same reasons as scalarizing 32-bit
element vectors.


https://reviews.llvm.org/D49255

Files:
  lib/Target/AMDGPU/SIISelLowering.cpp
  test/CodeGen/AMDGPU/call-argument-types.ll
  test/CodeGen/AMDGPU/mad-mix-lo.ll
  test/CodeGen/AMDGPU/mul.i16.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D49255.155215.patch
Type: text/x-patch
Size: 8394 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20180712/0abbb291/attachment.bin>


More information about the llvm-commits mailing list