[PATCH] D21160: [NVPTX] Add intrinsics for shfl instructions.

Justin Lebar via llvm-commits llvm-commits at lists.llvm.org
Wed Jun 8 17:22:00 PDT 2016


jlebar created this revision.
jlebar added a reviewer: tra.
jlebar added subscribers: jholewinski, llvm-commits.

Currently clang emits these instructions via inline (volatile) asm in
the CUDA headers.  Switching to intrinsics will let the optimizer reason
across calls to these intrinsics.

http://reviews.llvm.org/D21160

Files:
  include/llvm/IR/IntrinsicsNVVM.td
  lib/Target/NVPTX/NVPTXIntrinsics.td
  test/CodeGen/NVPTX/shfl.ll

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D21160.60123.patch
Type: text/x-patch
Size: 8783 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20160609/63b23d00/attachment.bin>


More information about the llvm-commits mailing list