[PATCH] D43735: [AMDGPU] Remove use of OpenCL triple environment and replace with function attribute for AMDGPU
Yaxun Liu via Phabricator via llvm-commits
llvm-commits at lists.llvm.org
Mon Mar 19 10:43:34 PDT 2018
yaxunl added inline comments.
================
Comment at: lib/CodeGen/TargetInfo.cpp:7661
+ if (M.getLangOpts().OpenCL && FD->hasAttr<OpenCLKernelAttr>() && (M.getTriple().getOS() == llvm::Triple::AMDHSA))
+ F->addFnAttr("amdgpu-implicitarg-num-bytes", "32");
+
----------------
Now we support enqueue_kernel, there are two extra hidden arguments. Totally 6 hidden arguments. The implicitarg-num-bytes should be 48 now.
https://reviews.llvm.org/D43735
More information about the llvm-commits
mailing list