[PATCH] D21160: [NVPTX] Add intrinsics for shfl instructions.
Justin Lebar via llvm-commits
llvm-commits at lists.llvm.org
Wed Jun 8 17:22:00 PDT 2016
jlebar created this revision.
jlebar added a reviewer: tra.
jlebar added subscribers: jholewinski, llvm-commits.
Currently clang emits these instructions via inline (volatile) asm in
the CUDA headers. Switching to intrinsics will let the optimizer reason
across calls to these intrinsics.
http://reviews.llvm.org/D21160
Files:
include/llvm/IR/IntrinsicsNVVM.td
lib/Target/NVPTX/NVPTXIntrinsics.td
test/CodeGen/NVPTX/shfl.ll
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D21160.60123.patch
Type: text/x-patch
Size: 8783 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/llvm-commits/attachments/20160609/63b23d00/attachment.bin>
More information about the llvm-commits
mailing list