[PATCH] D142393: [OpenMP] Add 'amdgpu-flat-work-group-size' to OpenMP kernels

Matt Arsenault via Phabricator via cfe-commits cfe-commits at lists.llvm.org
Wed Apr 26 10:43:27 PDT 2023


arsenm added inline comments.


================
Comment at: clang/include/clang/Basic/LangOptions.def:271
 LANGOPT(GPUAllowDeviceInit, 1, 0, "allowing device side global init functions for HIP")
-LANGOPT(GPUMaxThreadsPerBlock, 32, 1024, "default max threads per block for kernel launch bounds for HIP")
+LANGOPT(GPUMaxThreadsPerBlock, 32, 1024, "default max threads per block for kernel launch bounds for OpenMP/HIP")
 LANGOPT(GPUDeferDiag, 1, 0, "defer host/device related diagnostic messages for CUDA/HIP")
----------------
yaxunl wrote:
> arsenm wrote:
> > probably should drop the language and describe what it is
> CUDA does not use it. Drop the language may cause confusion.
If CUDA doesn't respect the flag, that's just broken. The concept is common among all the languages, and all the unnecessary language qualifications are a plague. Controls should be expressed as a generic concept that languages selectively enable 


Repository:
  rG LLVM Github Monorepo

CHANGES SINCE LAST ACTION
  https://reviews.llvm.org/D142393/new/

https://reviews.llvm.org/D142393



More information about the cfe-commits mailing list