[PATCH] D49274: [CUDA] Provide integer SIMD functions for CUDA-9.2
Benjamin Kramer via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Wed Jul 18 08:02:37 PDT 2018
bkramer accepted this revision.
bkramer added inline comments.
This revision is now accepted and ready to land.
================
Comment at: clang/lib/Headers/__clang_cuda_device_functions.h:1080
+ unsigned int r;
+ asm("vabsdiff2.u32.u32.u32.sat %0,%1,%2,0;" : "=r"(r) : "r"(__a), "r"(__b));
+ return r;
----------------
Should this really saturate?
================
Comment at: clang/lib/Headers/__clang_cuda_device_functions.h:1095
+ unsigned int r;
+ asm("vabsdiff2.s32.s32.s32.sat %0,%1,0,0;" : "=r"(r) : "r"(__a));
+ return r;
----------------
vabsdiff4?
https://reviews.llvm.org/D49274
More information about the cfe-commits
mailing list