[Parallel_libs-commits] [PATCH] D24043: [StreamExecutor] Simplify Kernel classes

Jason Henline via Parallel_libs-commits parallel_libs-commits at lists.llvm.org
Tue Aug 30 14:04:27 PDT 2016


jhen added inline comments.

================
Comment at: streamexecutor/include/streamexecutor/Kernel.h:47
@@ -49,3 +46,3 @@
 ///     using SaxpyKernel =
-///         streamexecutor::TypedKernel<float *, float *, float *>;
+///         streamexecutor::Kernel<float *, float *, float *>;
 ///     } // namespace compiler_cuda_namespace
----------------
jprice wrote:
> Maybe I'm missing something, but should these parameter types actually be `GlobalDeviceMemory<float>`? Otherwise I'm not sure how such a kernel object could be passed to a `thenLaunch()` function with the correct arguments.
You're right. Plus the `A` parameter is usually passed as a `float` not a `float *`. Now the signature should be fixed.


https://reviews.llvm.org/D24043





More information about the Parallel_libs-commits mailing list