[PATCH] D49274: [CUDA] Provide integer SIMD functions for CUDA-9.2

Benjamin Kramer via Phabricator via cfe-commits cfe-commits at lists.llvm.org
Wed Jul 18 08:02:37 PDT 2018


bkramer accepted this revision.
bkramer added inline comments.
This revision is now accepted and ready to land.


================
Comment at: clang/lib/Headers/__clang_cuda_device_functions.h:1080
+  unsigned int r;
+  asm("vabsdiff2.u32.u32.u32.sat %0,%1,%2,0;" : "=r"(r) : "r"(__a), "r"(__b));
+  return r;
----------------
Should this really saturate?


================
Comment at: clang/lib/Headers/__clang_cuda_device_functions.h:1095
+  unsigned int r;
+  asm("vabsdiff2.s32.s32.s32.sat %0,%1,0,0;" : "=r"(r) : "r"(__a));
+  return r;
----------------
vabsdiff4?


https://reviews.llvm.org/D49274





More information about the cfe-commits mailing list