[Parallel_libs-commits] [PATCH] D24043: [StreamExecutor] Simplify Kernel classes

James Price via Parallel_libs-commits parallel_libs-commits at lists.llvm.org
Tue Aug 30 13:32:33 PDT 2016


jprice added inline comments.

================
Comment at: streamexecutor/include/streamexecutor/Kernel.h:47
@@ -49,3 +46,3 @@
 ///     using SaxpyKernel =
-///         streamexecutor::TypedKernel<float *, float *, float *>;
+///         streamexecutor::Kernel<float *, float *, float *>;
 ///     } // namespace compiler_cuda_namespace
----------------
Maybe I'm missing something, but should these parameter types actually be `GlobalDeviceMemory<float>`? Otherwise I'm not sure how such a kernel object could be passed to a `thenLaunch()` function with the correct arguments.

================
Comment at: streamexecutor/include/streamexecutor/Kernel.h:59
@@ -61,2 +58,3 @@
 ///     // MultiKernelLoaderSpec instance like SaxpyKernelLoaderSpec below.
 ///     Expected<ccn::SaxpyKernel> MaybeKernel =
+///         Device->createKernel<cnn::SaxpyKernel>(cnn::SaxpyKernelLoaderSpec);
----------------
Missing `std::unique_ptr<>` here?

================
Comment at: streamexecutor/include/streamexecutor/Kernel.h:60
@@ -62,3 +59,3 @@
 ///     Expected<ccn::SaxpyKernel> MaybeKernel =
-///         ccn::SaxpyKernel::create(Device, ccn::SaxpyKernelLoaderSpec);
+///         Device->createKernel<cnn::SaxpyKernel>(cnn::SaxpyKernelLoaderSpec);
 ///     if (!MaybeKernel) { /* Handle error */ }
----------------
cnn -> ccn


https://reviews.llvm.org/D24043





More information about the Parallel_libs-commits mailing list