[PATCH] D98607: [NVPTX] CUDA provides a memcpy and memset
Artem Belevich via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Mar 15 10:43:17 PDT 2021
tra added a comment.
It would be good to add a test.
Both NVCC and clang currently lower memcpy to an explicit loop. I'm not sure what effect (if any) allowing memcpy/memset libcall would have on performance. We may want to benchmark it before landing.
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D98607/new/
https://reviews.llvm.org/D98607
More information about the llvm-commits
mailing list