[PATCH] D43735: [AMDGPU] Remove use of OpenCL triple environment and replace with function attribute for AMDGPU

Yaxun Liu via Phabricator via llvm-commits llvm-commits at lists.llvm.org
Mon Mar 19 10:43:34 PDT 2018


yaxunl added inline comments.


================
Comment at: lib/CodeGen/TargetInfo.cpp:7661
+  if (M.getLangOpts().OpenCL && FD->hasAttr<OpenCLKernelAttr>() && (M.getTriple().getOS() == llvm::Triple::AMDHSA))
+    F->addFnAttr("amdgpu-implicitarg-num-bytes", "32");
+
----------------
Now we support enqueue_kernel, there are two extra hidden arguments. Totally 6 hidden arguments. The implicitarg-num-bytes should be 48 now.


https://reviews.llvm.org/D43735





More information about the llvm-commits mailing list