[PATCH] D129536: [CUDA][FIX] Make shfl[_sync] for unsigned long long non-recursive

Johannes Doerfert via Phabricator via cfe-commits cfe-commits at lists.llvm.org
Mon Jul 11 19:44:02 PDT 2022


jdoerfert created this revision.
jdoerfert added a reviewer: tra.
Herald added subscribers: mattd, bollu, yaxunl.
Herald added a project: All.
jdoerfert requested review of this revision.
Herald added a project: clang.
Herald added a subscriber: cfe-commits.

A copy-paste error caused UB in the definition of the unsigned long long
versions of the shfl intrinsics. Reported and diagnosed by @trws.


Repository:
  rG LLVM Github Monorepo

https://reviews.llvm.org/D129536

Files:
  clang/lib/Headers/__clang_cuda_intrinsics.h
  clang/test/CodeGenCUDA/shuffle_long_long.cu

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D129536.443820.patch
Type: text/x-patch
Size: 5102 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/cfe-commits/attachments/20220712/95c37c63/attachment.bin>


More information about the cfe-commits mailing list