[all-commits] [llvm/llvm-project] fb2c42: [OpenMP] Improve AMDGPU Plugin
Johannes Doerfert via All-commits
all-commits at lists.llvm.org
Mon Dec 19 19:11:19 PST 2022
Branch: refs/heads/main
Home: https://github.com/llvm/llvm-project
Commit: fb2c42df41cb01e1122fd4e9c81e1f4bc5592b12
https://github.com/llvm/llvm-project/commit/fb2c42df41cb01e1122fd4e9c81e1f4bc5592b12
Author: Johannes Doerfert <johannes at jdoerfert.de>
Date: 2022-12-19 (Mon, 19 Dec 2022)
Changed paths:
M llvm/include/llvm/Frontend/OpenMP/OMPGridValues.h
M openmp/libomptarget/plugins-nextgen/amdgpu/src/rtl.cpp
M openmp/libomptarget/plugins-nextgen/common/PluginInterface/PluginInterface.cpp
M openmp/libomptarget/plugins-nextgen/common/PluginInterface/PluginInterface.h
Log Message:
-----------
[OpenMP] Improve AMDGPU Plugin
With this patch we:
- pick more sensible defaults for the number of teams, inspired by the
old plugin, and configured via LIBOMPTARGET_AMDGPU_TEAMS_PER_CU.
- check the input signal of a kernel launch late, after the queue lock
was taken, to avoid a barrier packet more often.
- copy the kernel arguments in one swoop into the appropriate memory.
- manually specialize the callbacks to avoid potential indirect calls.
Commit: e3d9a448c57c46600db635244792205f7a0ea58b
https://github.com/llvm/llvm-project/commit/e3d9a448c57c46600db635244792205f7a0ea58b
Author: Johannes Doerfert <johannes at jdoerfert.de>
Date: 2022-12-19 (Mon, 19 Dec 2022)
Changed paths:
M openmp/libomptarget/plugins-nextgen/amdgpu/src/rtl.cpp
Log Message:
-----------
[OpenMP] Account for dynamic shared memory in the AMDGPU nextgen plugin
Compare: https://github.com/llvm/llvm-project/compare/ab17a08d0f6f...e3d9a448c57c
More information about the All-commits
mailing list