[PATCH] D142393: [OpenMP] Add 'amdgpu-flat-work-group-size' to OpenMP kernels
Matt Arsenault via Phabricator via cfe-commits
cfe-commits at lists.llvm.org
Wed Apr 26 10:43:27 PDT 2023
arsenm added inline comments.
================
Comment at: clang/include/clang/Basic/LangOptions.def:271
LANGOPT(GPUAllowDeviceInit, 1, 0, "allowing device side global init functions for HIP")
-LANGOPT(GPUMaxThreadsPerBlock, 32, 1024, "default max threads per block for kernel launch bounds for HIP")
+LANGOPT(GPUMaxThreadsPerBlock, 32, 1024, "default max threads per block for kernel launch bounds for OpenMP/HIP")
LANGOPT(GPUDeferDiag, 1, 0, "defer host/device related diagnostic messages for CUDA/HIP")
----------------
yaxunl wrote:
> arsenm wrote:
> > probably should drop the language and describe what it is
> CUDA does not use it. Drop the language may cause confusion.
If CUDA doesn't respect the flag, that's just broken. The concept is common among all the languages, and all the unnecessary language qualifications are a plague. Controls should be expressed as a generic concept that languages selectively enable
Repository:
rG LLVM Github Monorepo
CHANGES SINCE LAST ACTION
https://reviews.llvm.org/D142393/new/
https://reviews.llvm.org/D142393
More information about the cfe-commits
mailing list