[PATCH] D49274: [CUDA] Provide integer SIMD functions for CUDA-9.2

Artem Belevich via Phabricator via cfe-commits cfe-commits at lists.llvm.org
Thu Jul 19 16:47:42 PDT 2018


tra updated this revision to Diff 156386.
tra added a comment.

Fixed inline asm syntax.
Added workaround for the bug in __vmaxs2() discovered during testing().

I've got set of tests for these functions that I'll add to test-suite shortly. AFAICT this implementation matches nvidia's bit-to-bit.


https://reviews.llvm.org/D49274

Files:
  clang/lib/Headers/__clang_cuda_device_functions.h
  clang/lib/Headers/__clang_cuda_libdevice_declares.h

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D49274.156386.patch
Type: text/x-patch
Size: 17204 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/cfe-commits/attachments/20180719/f23d56a6/attachment-0001.bin>


More information about the cfe-commits mailing list