[PATCH] D116673: [Clang][NVPTX]Add NVPTX intrinsics and builtins for CUDA PTX cvt sm80 instructions

Artem Belevich via Phabricator via cfe-commits cfe-commits at lists.llvm.org
Thu Jan 6 11:29:52 PST 2022


tra accepted this revision.
tra added a comment.
This revision is now accepted and ready to land.

LGTM.



================
Comment at: clang/test/CodeGen/builtins-nvptx.c:760
+// CHECK-LABEL: nvvm_cvt_sm80
+__device__ void nvvm_cvt_sm80() {
+#if __CUDA_ARCH__ >= 800
----------------
Can you try compiling this test file all the way to .o so we're sure that ptxas does accept the PTX we end up generating.



CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D116673/new/

https://reviews.llvm.org/D116673



More information about the cfe-commits mailing list