[Openmp-commits] [PATCH] D95752: [OpenMP][DeviceRTL] Extract shuffle idiom and port it to declare variant

Johannes Doerfert via Phabricator via Openmp-commits openmp-commits at lists.llvm.org
Sun Jan 31 00:20:04 PST 2021


jdoerfert planned changes to this revision.
jdoerfert added a comment.

I'll update this tomorrow, certain parts are not great.



================
Comment at: openmp/libomptarget/deviceRTLs/common/include/shuffle.h:88
+                                     int32_t SrcLane) {
+// In Cuda 9.0, the *_sync() version takes an extra argument 'mask'.
+#if CUDA_VERSION >= 9000
----------------
JonChesterfield wrote:
> Seems bad, both because it's a macro instead of variant, and because I thought we'd already got rid of that macro
we will, ptx selection. I'll actually update this patch.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D95752/new/

https://reviews.llvm.org/D95752



More information about the Openmp-commits mailing list