[PATCH] D98607: [NVPTX] CUDA provides a memcpy and memset

Artem Belevich via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon Mar 15 10:43:17 PDT 2021


tra added a comment.

It would be good to add a test.

Both NVCC and clang currently lower memcpy to an explicit loop. I'm not sure what effect (if any) allowing memcpy/memset libcall would have on performance. We may want to benchmark it before landing.


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D98607/new/

https://reviews.llvm.org/D98607



More information about the llvm-commits mailing list