[PATCH] D142393: [OpenMP] Add 'amdgpu-flat-work-group-size' to OpenMP kernels
Joseph Huber via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Mon Jan 23 11:26:06 PST 2023
jhuber6 created this revision.
jhuber6 added reviewers: JonChesterfield, arsenm, tra, yaxunl, jdoerfert.
Herald added subscribers: kosarev, kerbowa, guansong, tpr, dstuttard, jvesely, kzhuravl.
Herald added a project: All.
jhuber6 requested review of this revision.
Herald added subscribers: cfe-commits, sstefan1, MaskRay, wdng.
Herald added a project: clang.
This patch adds the `amdgpu-flat-work-group-size=1,1024` attribute to
OpenMP kernels targeting AMDGPU. This also lets us use
`--gpu-max-threads-per-block` which is loosened from being a HIP only
option.
Repository:
rG LLVM Github Monorepo
https://reviews.llvm.org/D142393
Files:
clang/include/clang/Basic/LangOptions.def
clang/include/clang/Driver/Options.td
clang/lib/CodeGen/TargetInfo.cpp
clang/lib/Driver/ToolChains/AMDGPU.cpp
clang/lib/Driver/ToolChains/AMDGPUOpenMP.cpp
clang/lib/Driver/ToolChains/HIPAMD.cpp
clang/lib/Frontend/CompilerInvocation.cpp
clang/test/Driver/openmp-offload-gpu.c
clang/test/OpenMP/amdgcn-attributes.cpp
-------------- next part --------------
A non-text attachment was scrubbed...
Name: D142393.491465.patch
Type: text/x-patch
Size: 11071 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/cfe-commits/attachments/20230123/3b05ce33/attachment-0001.bin>
More information about the cfe-commits
mailing list