[PATCH] D142393: [OpenMP] Add 'amdgpu-flat-work-group-size' to OpenMP kernels

Joseph Huber via Phabricator via cfe-commits cfe-commits at lists.llvm.org
Mon Jan 23 11:26:06 PST 2023


jhuber6 created this revision.
jhuber6 added reviewers: JonChesterfield, arsenm, tra, yaxunl, jdoerfert.
Herald added subscribers: kosarev, kerbowa, guansong, tpr, dstuttard, jvesely, kzhuravl.
Herald added a project: All.
jhuber6 requested review of this revision.
Herald added subscribers: cfe-commits, sstefan1, MaskRay, wdng.
Herald added a project: clang.

This patch adds the `amdgpu-flat-work-group-size=1,1024` attribute to
OpenMP kernels targeting AMDGPU. This also lets us use
`--gpu-max-threads-per-block` which is loosened from being a HIP only
option.


Repository:
  rG LLVM Github Monorepo

https://reviews.llvm.org/D142393

Files:
  clang/include/clang/Basic/LangOptions.def
  clang/include/clang/Driver/Options.td
  clang/lib/CodeGen/TargetInfo.cpp
  clang/lib/Driver/ToolChains/AMDGPU.cpp
  clang/lib/Driver/ToolChains/AMDGPUOpenMP.cpp
  clang/lib/Driver/ToolChains/HIPAMD.cpp
  clang/lib/Frontend/CompilerInvocation.cpp
  clang/test/Driver/openmp-offload-gpu.c
  clang/test/OpenMP/amdgcn-attributes.cpp

-------------- next part --------------
A non-text attachment was scrubbed...
Name: D142393.491465.patch
Type: text/x-patch
Size: 11071 bytes
Desc: not available
URL: <http://lists.llvm.org/pipermail/cfe-commits/attachments/20230123/3b05ce33/attachment-0001.bin>


More information about the cfe-commits mailing list