[PATCH] D100394: [Clang][NVPTX] Add NVPTX intrinsics and builtins for CUDA PTX cp.async instructions
Artem Belevich via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Wed May 17 14:05:49 PDT 2023
tra added a comment.
Hi. It looks like CUDA-11+ headers need a variant of cm.async intrinsics which provides the optional src_size argument.
I'm planning to add it to the existing intrinsics in NVPTX. It's just a heads-up in case you may have existing uses of them that may need to be updated.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D100394/new/
https://reviews.llvm.org/D100394
More information about the cfe-commits
mailing list