[PATCH] D43735: [AMDGPU] Remove use of OpenCL triple environment and replace with function attribute for AMDGPU

Tony Tye via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon Mar 19 10:57:26 PDT 2018


t-tye added inline comments.


================
Comment at: lib/CodeGen/TargetInfo.cpp:7661
+  if (M.getLangOpts().OpenCL && FD->hasAttr<OpenCLKernelAttr>() && (M.getTriple().getOS() == llvm::Triple::AMDHSA))
+    F->addFnAttr("amdgpu-implicitarg-num-bytes", "32");
+
----------------
yaxunl wrote:
> Now we support enqueue_kernel, there are two extra hidden arguments. Totally 6 hidden arguments. The implicitarg-num-bytes should be 48 now.
Can you give the ABI for these implicit arguments so can update the AMDGUUsage document?


https://reviews.llvm.org/D43735





More information about the llvm-commits mailing list